Question 1

What is the difference between pub/sub and a message queue?

Accepted Answer

A message queue delivers each message to exactly one consumer (competing consumers pattern). A pub/sub system delivers each message to all subscribers of the topic (fan-out). Message queues are used for work distribution (task queues, job processing). Pub/sub is used for event broadcasting (notify all interested parties of an event). Kafka supports both: consumer groups implement competing consumers within a group, while multiple independent consumer groups each receive all messages.

Question 2

How does Kafka guarantee message ordering?

Accepted Answer

Kafka guarantees ordering within a partition. Messages with the same partition key (e.g., user_id) are always assigned to the same partition and consumed in order by the assigned consumer. Messages across different partitions may be processed out of order relative to each other. For global ordering, use a single partition (sacrificing parallelism). For per-entity ordering (the common requirement), use entity ID as the partition key so all events for one entity go to the same partition.

Question 3

What is a dead letter topic and when should you use one?

Accepted Answer

A dead letter topic (DLT) receives messages that a consumer fails to process after maximum retries. Without a DLT, a poison pill message (malformed payload, unexpected schema, downstream service permanently down) blocks partition processing indefinitely. With a DLT, failed messages are moved aside after N retries, preserving original message plus failure metadata (reason, attempt count, original topic). A separate consumer or human review handles DLT messages. Alert on-call when DLT accumulates items — it signals a systematic failure.

Question 4

What are the three message delivery semantics and their trade-offs?

Accepted Answer

At-most-once: acknowledge before processing; messages may be lost on crash but are never duplicated. Lowest latency, acceptable for metrics or logs. At-least-once: acknowledge after processing; messages may be duplicated on retry but never lost. Requires idempotent consumers. Most common choice. Exactly-once: transactional producers plus idempotent consumers plus atomic offset commits. Kafka supports this via transactions. Highest complexity and latency; use only when duplicates cause real harm (financial transactions, inventory updates).

Low Level Design: Pub/Sub Messaging System

Core Components

Message Storage and Partitioning

Delivery Semantics

Consumer Groups and Load Balancing

Dead Letter Topics

Key Interview Discussion Points