Question 1

Why do unbounded queues fail in producer-consumer systems?

Accepted Answer

An unbounded queue absorbs the difference between producer and consumer rates — but if the producer is persistently faster than the consumer, the queue grows without limit. At 10MB/s production and 8MB/s consumption, the queue accumulates 2MB/s. After 10 minutes, 1.2GB is buffered. The system runs out of memory, crashes, and loses all queued data. Unbounded queues also hide the mismatch — the producer sees no errors and does not slow down, making the problem invisible until the system crashes. Bounded queues force explicit backpressure: the producer must handle the 'queue full' signal by blocking, dropping, or applying load shedding.

Question 2

What is the reactive pull model for backpressure?

Accepted Answer

In the reactive pull model (Reactive Streams specification), the consumer controls the flow rate: the consumer sends a request for N items ('I can handle N more'); the producer sends exactly N items and waits for the next request. The consumer requests more only when it has capacity to process them — eliminating buffer overflow by design. Kafka consumers implement this: the consumer calls poll() for a batch of messages, processes them entirely, commits offsets, then polls again. The broker only delivers what is requested. Contrast with push: the producer pushes data at its own rate, and the consumer must keep up or buffer. Pull is inherently backpressure-safe; push requires explicit backpressure mechanisms.

Question 3

How does Apache Flink implement end-to-end backpressure?

Accepted Answer

Flink implements backpressure through bounded network buffers between operators. If a downstream operator is slow, it stops reading from its input buffers. Those buffers fill up. The upstream operator trying to write to the full buffer blocks. The upstream operator's input buffers fill. This propagates all the way back to the source reader. The entire pipeline naturally slows to the rate of the slowest stage — no data is dropped, no unbounded buffering occurs. Monitor backpressure ratio in Flink's metrics UI: an operator at 100% backpressure is the bottleneck. Increase its parallelism (add more task slots) or optimize its logic. Backpressure propagation makes the bottleneck visible rather than hiding it behind queue depths.

Question 4

What drop policy should you use when a buffer is full?

Accepted Answer

Drop policy depends on data value and latency requirements: Drop newest (tail drop): reject incoming items when the buffer is full — fast feedback to the producer, preserves older queued items that have already waited. Simple but causes producer retries. Drop oldest (head drop / sliding window): evict the oldest item from the buffer to make room for the new one — keeps the most recent data, appropriate when old data has less value than new data (metrics, sensor readings, log samples). Random drop (RED - Random Early Detection): probabilistically drop items as the buffer approaches full rather than waiting until it overflows — smoother signal to the producer, avoids synchronized drops. Always emit a counter for every dropped item regardless of policy — drop rate is a critical operational metric.

Backpressure and Flow Control: Low-Level Design

Why Unbounded Queues Fail

Backpressure Strategies

Block the Producer

Drop with Policy

Reactive Pull Model

TCP Flow Control

Backpressure in Stream Processing

Design Checklist