Question 1

What is the dual write problem and how does the outbox pattern solve it?

Accepted Answer

The dual write problem: a service needs to update its database AND publish an event to Kafka. These are separate systems with no shared transaction. If the database write succeeds but Kafka publish fails, downstream services miss the event. If Kafka succeeds but the database fails, downstream services act on a non-existent change. The outbox pattern solves this: write the event to an outbox table in the SAME database as the business data, in the SAME transaction. Both succeed or both fail -- database atomicity guarantees consistency. A separate process reads the outbox table and publishes events to Kafka. Options: polling (query every second for unpublished events) or CDC with Debezium (reads the database transaction log and publishes automatically, sub-second latency). After successful Kafka publish, mark the event as published. This converts the dual write into a single atomic database write plus a reliable async publish.

Question 2

What is the difference between saga orchestration and choreography?

Accepted Answer

Choreography: each service listens for events and independently decides what to do next. Order Service publishes OrderCreated. Payment Service hears it, charges card, publishes PaymentProcessed. Inventory Service hears it, reserves items. No central coordinator. Pros: simple, decoupled. Cons: hard to follow the full workflow across services, difficult to handle complex compensation logic, and adding/modifying steps requires changes in multiple services. Best for: simple 3-4 step flows. Orchestration: a central saga orchestrator service drives the workflow. It sends commands to each service in sequence, handles responses, and triggers compensations on failure. The orchestrator holds the complete workflow definition. Pros: clear workflow visibility, easier to debug, simpler compensation logic. Cons: the orchestrator is a potential bottleneck and single point of failure (mitigate with HA). Best for: complex multi-step workflows with branching and conditional logic. Most production systems use orchestration for non-trivial sagas.

Question 3

How do compensating transactions work in sagas?

Accepted Answer

When a saga step fails, previously completed steps must be undone via compensating transactions -- semantic reversals, not true rollbacks. Example: order saga step 3 (inventory) fails. Compensations: step 2 reversal -- Payment Service issues refund. Step 1 reversal -- Order Service marks order as cancelled. Design requirements: (1) Idempotent -- compensations may be retried (network failure). A refund processed twice must not refund twice. Use idempotency keys. (2) Handle partial state -- the original operation may have partially completed. (3) Order-independent -- in choreography, compensations may arrive in any order. Irreversible actions: some operations cannot be compensated (sending an email, shipping a package). Strategy: place irreversible actions at the END of the saga. If an earlier step fails, the irreversible action has not occurred. If the irreversible step itself fails, all previous steps succeeded -- partial completion may be acceptable.

Question 4

When should you use the outbox pattern versus direct event publishing?

Accepted Answer

Always use the outbox pattern when a service needs to update its database and publish an event. Direct publishing (write to DB, then publish to Kafka) has the dual write problem: if the publish fails or the service crashes between the DB write and publish, the event is lost. There is no reliable way to make DB + Kafka atomic without the outbox. The outbox pattern is the standard solution: write business data + event to the same DB transaction. A separate process publishes events from the outbox to Kafka. With Debezium CDC, this adds sub-second latency and requires no polling. The only case where direct publishing is acceptable: when losing an occasional event is tolerable (analytics, non-critical logging) AND the probability of failure between DB write and publish is acceptably low. For any business-critical event (order created, payment processed, user registered), use the outbox pattern. In interviews, mentioning the outbox pattern demonstrates awareness of a subtle but critical consistency issue that many candidates miss.

System Design: Microservices Data Patterns — Saga, Outbox Pattern, Dual Write Problem, Transactional Messaging, CQRS

The Dual Write Problem

The Outbox Pattern

Saga Pattern Revisited

Compensating Transactions

CQRS with Event-Driven Microservices

Choosing the Right Pattern