Question 1

What does the CAP theorem mean in practice for multi-region systems?

Accepted Answer

The CAP theorem states that a distributed system can only guarantee two of three properties: Consistency (all nodes see the same data simultaneously), Availability (every request receives a response), and Partition Tolerance (the system continues operating despite network partitions). In practice, network partitions are unavoidable in distributed systems u2014 the network between two datacenters in different countries will occasionally be slow or unreliable. So the real choice is between Consistency and Availability during a partition. CP systems (choose consistency over availability): if a network partition prevents a region from confirming the latest data, it stops serving requests rather than risk returning stale data. Example: a bank's balance query waits until it can confirm the current balance, even if this means a timeout for the user. AP systems (choose availability over consistency): continue serving requests even during partitions, accepting that different regions may temporarily see different data. Example: a product catalog shows slightly stale prices during a partition u2014 better than a 503 error. Most web-scale systems choose AP for user-facing features (show stale data rather than errors) and CP for financial operations (balance, inventory count). The PACELC extension adds: Even in the absence of partitions, there is a trade-off between Latency (respond fast) and Consistency (wait for replication).

Question 2

How do active-active multi-region systems handle write conflicts?

Accepted Answer

Write conflicts occur when the same data is modified in two regions before the changes replicate to each other. Resolution strategies: (1) Last-Write-Wins (LWW): each write includes a timestamp. When conflicting writes are detected, the one with the later timestamp wins. Problem: clock skew between regions means "later" is unreliable u2014 NTP synchronizes to within ~1ms, but a write in us-east 500 microseconds after a write in eu-west might have a slightly earlier clock time due to skew. Solution: use logical clocks (Lamport timestamps or hybrid logical clocks) instead of wall clocks. (2) Vector clocks: each write carries a vector of counters (one per region). Two writes with non-comparable vectors (neither dominates) are flagged as concurrent conflicts u2014 the application decides how to merge. Amazon DynamoDB's conditional writes and optimistic locking use this approach. (3) CRDTs (Conflict-free Replicated Data Types): design the data structure so any two concurrent operations can be automatically merged without conflict. Examples: G-Counter (increment only, merge = max per region), LWW-Element-Set (set with timestamps), OR-Set (supports add and remove). CRDTs eliminate conflicts entirely for their supported operations. (4) Operational Transformation (Google Docs approach): serialize all operations through a single global coordinator, transforming concurrent operations to be commutative. Requires a central authority u2014 limits availability during partition.

Question 3

How does Google Cloud Spanner achieve global strong consistency?

Accepted Answer

Google Spanner provides external consistency u2014 a stronger guarantee than serializable isolation u2014 across globally distributed databases. The key innovation is TrueTime: Google's globally synchronized clock system using GPS receivers and atomic clocks in every datacenter. TrueTime provides timestamps with a known error bound: TT.now() returns (earliest, latest) with the guarantee that the true current time is within this interval. Typically the interval is 1-7ms wide. Spanner uses this to assign globally consistent commit timestamps: when a transaction commits, it picks a timestamp T such that T is after all previously committed transactions. Spanner waits for the TrueTime uncertainty interval to pass before returning to the client ("commit wait"), ensuring that any subsequent transaction will have a later TrueTime reading. This wait (1-7ms) is the price of external consistency. Why it works: because TrueTime gives real-time guarantees (not just logical ordering), Spanner can assign globally monotonic timestamps without central coordination. A read at timestamp T is guaranteed to see all transactions committed before T, regardless of which replica serves the read. Practical implications: Spanner multi-region writes take ~50-100ms (cross-region Paxos round trip) versus ~10ms for single-region. Use Spanner when data correctness across regions is non-negotiable (financial systems, global inventory); use eventually consistent systems when performance matters more than perfect consistency.

System Design Interview: Multi-Region Architecture and Global Replication

Why Multi-Region?

The CAP Theorem Applied

Active-Passive (Primary-Secondary) Replication

Active-Active (Multi-Primary) Replication

Global Load Balancing

Database Global Patterns

Data Residency and Compliance

Interview Checklist

Companies That Ask This