Question 1

How does the inbox pattern achieve exactly-once processing from an at-least-once broker?

Accepted Answer

Message brokers (Kafka, RabbitMQ, SQS) guarantee at-least-once delivery: they redeliver messages if the consumer crashes before ACKing. This means the consumer's processing logic may run multiple times for the same message. The inbox pattern adds a deduplication layer: before processing, attempt to INSERT the message_id into an InboxMessage table with a UNIQUE constraint (ON CONFLICT DO NOTHING). If the insert succeeds (1 row affected), this is the first delivery — process it. If the insert fails (0 rows affected), it is a duplicate delivery — skip it. The message_id is the broker-assigned unique identifier (Kafka offset+partition, SQS MessageId, RabbitMQ delivery tag). This converts at-least-once delivery into exactly-once business effect.

Question 2

What is the transactional inbox and when should you use it?

Accepted Answer

The transactional inbox wraps both the InboxMessage INSERT and the business operation in a single database transaction. Either both the deduplication record AND the business effect commit together, or neither does. Without the transactional inbox, there is a window after the InboxMessage INSERT commits but before the business operation runs where a crash leaves the system in an inconsistent state (deduplication record exists but business effect did not happen). This is safe if your business operation is idempotent — you can retry it. But for non-idempotent operations (sending an email, charging a card), use the transactional inbox to ensure atomicity. Requirement: the business operation must use the same database as the inbox table.

Question 3

How is the inbox pattern different from the outbox pattern?

Accepted Answer

The outbox pattern solves the publishing problem: you write to your database and need to reliably publish an event to a message broker without a distributed transaction between your DB and the broker. You write the event to an Outbox table in the same DB transaction as your business data; a relay publishes it to Kafka. The inbox pattern solves the consuming problem: the broker delivers a message to you, and you need to process it exactly once despite potential redeliveries. You record the message_id in an Inbox table as a deduplication guard before processing. Use both patterns together for end-to-end reliability: outbox ensures every event is published; inbox ensures every event is processed exactly once.

Question 4

How do you handle inbox message failures and retries?

Accepted Answer

When processing fails, update the InboxMessage status to FAILED with the error_message. A background retry job queries FAILED messages with attempt_count below the max (typically 3) and resets them to PENDING with exponential backoff (30s, 2m, 8m). After max attempts, move to a dead-letter table (InboxDeadLetter) for manual inspection and alert the on-call team. Critical: the retry loop must also use FOR UPDATE SKIP LOCKED so multiple retry workers don't double-retry the same message. The idempotency guard in the processing path handles the case where a retry of a FAILED message races with a duplicate delivery of the same message — the second insert attempt is blocked by the existing row.

Question 5

How do you prevent the inbox table from growing unboundedly?

Accepted Answer

Add a retention cleanup job: DELETE FROM InboxMessage WHERE status = 'processed' AND processed_at < NOW() - INTERVAL '7 days'. Run nightly. Keep 7 days of processed records to deduplicate late-arriving duplicates (some brokers redeliver messages hours or days after the original delivery in edge cases). For very high-throughput systems (millions of messages/day), partition the InboxMessage table by received_at date (PostgreSQL declarative partitioning) so the cleanup job can DROP old partitions instead of running DELETE row-by-row. DROP PARTITION is O(1) and does not bloat the table. Index the received_at column with a partial index on status to keep cleanup queries fast.

Inbox Pattern Low-Level Design: Exactly-Once Message Processing

Core Data Model

Idempotent Message Processing

Transactional Inbox (Atomic with Business Data)

Retry and Dead-Letter Queue

Key Interview Points