Question 1

What is the difference between event sourcing and traditional CRUD?

Accepted Answer

Traditional CRUD stores only the current state. You see the account balance is $500 but have no record of how it got there. Event sourcing stores the complete history of state changes as an immutable sequence of events: AccountCreated, Deposited($300), Deposited($400), Withdrawn($200). The current state is derived by replaying events. Benefits: (1) Complete audit trail — every change is recorded with who, what, and when. Required for financial, compliance, and legal systems. (2) Temporal queries — reconstruct state at any point in time by replaying events up to that timestamp. (3) Rebuild projections — create new read models from historical events without losing data. (4) Debugging — replay events to reproduce bugs. Trade-offs: more complex implementation, eventual consistency between event store and projections, event schema must be versioned carefully (you cannot change past events).

Question 2

What is CQRS and how does it relate to event sourcing?

Accepted Answer

CQRS (Command Query Responsibility Segregation) separates write operations (commands) from read operations (queries). The write side handles commands that produce events. The read side consists of projections — materialized views optimized for specific queries. CQRS is often used with event sourcing but they are independent: you can have CQRS without event sourcing (separate write DB from read DB), and you can have event sourcing without strict CQRS. In a full event sourcing + CQRS system: commands → aggregate validation → events written to event store → event bus publishes events → projections consume events and update read models → queries read from projections. The read models can use any storage: PostgreSQL for relational queries, Elasticsearch for full-text search, Redis for cached lookups. Each projection is independently rebuilt by replaying the event stream.

Question 3

How do you handle event schema evolution in event sourcing?

Accepted Answer

Events are immutable and stored forever. When the business logic changes, you need to handle events written under the old schema. Strategies: (1) Event upcasting: when loading old events, transform them to the new schema before passing to the aggregate. The upcast function is versioned: if event.version == 1, add the new required field with a default value. (2) New event versions: instead of modifying OrderPlaced, add OrderPlacedV2 with the new fields. Old aggregates understand V1; new aggregates understand both. (3) Weak schema: use JSONB payload with optional fields. New code checks if the field exists before using it. (4) Snapshot migration: when a snapshot is taken, serialize using the current schema. On load, use the snapshot (current schema) + only replay events since the snapshot. Recommendation: include an event version field from day one. Write explicit upcasters for each version transition. Test that events from 2 years ago still replay correctly.

Question 4

How do snapshots work and when should you use them?

Accepted Answer

An aggregate reconstructed by replaying all its events is correct but slow for long-lived aggregates with many events. An order processed 10,000 times would require 10,000 event replays. Snapshots solve this: every N events (e.g., every 50), serialize the aggregate's current state to a Snapshot record: (aggregate_id, sequence_number, state_json, created_at). On load: query for the most recent snapshot. Load the state from the snapshot. Replay only events with sequence_number > snapshot.sequence_number. Worst case: replay N-1 events (between snapshots) instead of all events. Use snapshots when: aggregates have > 100 events, replay time is measurable (> a few ms), or aggregates are accessed frequently. Avoid snapshots when: aggregates are short-lived (orders completed in minutes), event count is small, or snapshot storage adds complexity you don't need yet.

Question 5

How does optimistic concurrency control work in an event store?

Accepted Answer

Multiple concurrent commands targeting the same aggregate could create conflicting events. Example: two transfers from the same account simultaneously, both seeing balance=$100, both succeeding — overdraft. Optimistic concurrency: when appending a new event, specify the expected version (last known sequence_number). The event store tries to insert at expected_version + 1. If another command already wrote that sequence_number, the unique constraint (aggregate_id, sequence_number) causes an INSERT failure. The application catches the conflict error, reloads the aggregate from the current state (replaying the new events), re-runs the command validation, and retries. This is optimistic locking without explicit row locks — low contention cost, correct under concurrent writes. If the retry also fails (high contention on a hot aggregate), back off and retry with exponential jitter. For very high contention aggregates, consider partitioning or actor-based serialization.

Event Sourcing System Low-Level Design

What is Event Sourcing?

Core Concepts

Data Model

Command Handler Pattern

Optimistic Concurrency Control

Projections and Read Models

Snapshots

When to Use Event Sourcing