Question 1

What is the difference between event sourcing and traditional CRUD?

Accepted Answer

Traditional CRUD stores the current state of an entity. An order record has status=SHIPPED, amount=99.99. When the status changes, the row is updated in place. The previous state is lost (unless you add audit logging). Event sourcing stores every state change as an immutable event. The order stream contains: OrderCreated(amount=99.99), PaymentReceived, ItemsPacked, OrderShipped. The current state is derived by replaying all events. No data is ever lost -- you can reconstruct the state at any point in time. Advantages of event sourcing: complete audit trail (every change is recorded), temporal queries (what was the state at 3 PM?), event replay (fix a bug in projection logic, replay events through corrected code), and multiple read models (different consumers build different views from the same events). Disadvantages: increased complexity (developers must think in events, not state), eventual consistency between the event store and read models, event schema evolution (old events may not match the current schema -- must handle versioning), and debugging is harder (the current state is computed, not stored directly). Use event sourcing when audit trails are required (finance, healthcare), when you need temporal queries, or when multiple services need to react to state changes.

Question 2

How does the saga pattern handle distributed transactions across microservices?

Accepted Answer

A saga replaces a single distributed transaction with a sequence of local transactions coordinated by events or a central orchestrator. Each local transaction updates one service and publishes an event or notifies the orchestrator. Two implementations: Choreography-based saga: each service listens for events and reacts. Order service creates an order and emits OrderCreated. Inventory service hears it, reserves stock, emits StockReserved. Payment service hears it, charges the card, emits PaymentProcessed. Shipping service hears it, schedules delivery. No central coordinator -- each service knows its part. Simple for 3-4 steps but hard to follow for complex workflows. Orchestration-based saga: a central OrderSaga orchestrator drives the process. It sends ReserveStock command to inventory, waits for response, sends ChargeCard command to payment, waits for response, sends ScheduleDelivery to shipping. The orchestrator has a clear view of the entire workflow, making it easier to debug and extend. If any step fails, the orchestrator triggers compensating transactions in reverse order: refund payment, release stock, cancel order. Each compensating action must be idempotent because retries may occur.

Question 3

How does Kafka guarantee message ordering and how do partitions affect it?

Accepted Answer

Kafka guarantees ordering within a partition but not across partitions. Each partition is an ordered, append-only log. Messages written to a partition are assigned sequential offsets (0, 1, 2, ...). A consumer reading from a partition sees messages in exactly the order they were produced. Across partitions: no ordering guarantee. If a topic has 4 partitions, messages produced at time T1 and T2 may arrive in different order depending on which partition each was written to. Partition key determines which partition a message goes to: partition = hash(key) % num_partitions. All messages with the same key go to the same partition and are therefore ordered relative to each other. Design principle: choose a partition key that groups related messages. For an order system, use order_id as the key. All events for order-12345 (OrderCreated, PaymentReceived, ItemsShipped) go to the same partition and arrive in order. Different orders may go to different partitions -- they do not need relative ordering. If you need global ordering across all messages, use a topic with a single partition. This limits throughput to one consumer per consumer group but guarantees total order.

Question 4

How do you handle eventual consistency in event-driven systems?

Accepted Answer

Eventual consistency means that after a write, there is a delay before all read models reflect the change. The write side commits to the event store, but the read side (consuming events via Kafka) may lag by 100ms to several seconds. Mitigation strategies: (1) Read-your-own-writes -- after a user creates an order, read from the write model (event store) for that specific user session, bypassing the eventually consistent read model. The user sees their order immediately. Other users see it after the read model catches up. (2) Optimistic UI -- the frontend assumes the write succeeded and shows the result immediately. If the backend rejects it (detected asynchronously), show an error and revert. Common in modern SPAs. (3) Polling or WebSocket -- after a write, the frontend polls the read model or listens via WebSocket for the projected state. Display a loading indicator until the read model catches up. (4) Causal consistency -- include a version or sequence number in the response to the write. The client passes this version to the read endpoint. The read endpoint waits until its projection has reached that version before responding. This guarantees the read reflects at least the last write, at the cost of slightly higher read latency.

System Design: Event-Driven Architecture — Kafka, Event Sourcing, CQRS, Saga Pattern, Eventual Consistency

Events vs Commands vs Queries

Apache Kafka as the Event Backbone

Event Sourcing

CQRS: Command Query Responsibility Segregation

The Saga Pattern for Distributed Transactions

Compensating Transactions

Eventual Consistency and Its Implications