Question 1

What is the difference between linearizability and serializability?

Accepted Answer

Linearizability is a per-object (single-key) consistency guarantee: operations on a single key appear atomic and respect real-time order. Serializability is a multi-object (transaction) consistency guarantee: transactions appear to execute in some serial order, but that order need not respect real time. Strict serializability combines both: transactions are serializable AND the serial order respects real-time order. Linearizability does not say anything about multi-key atomicity; serializability does not say anything about real-time ordering of individual reads.

Question 2

What is the fencing token pattern and when is it needed?

Accepted Answer

A fencing token is a monotonically increasing number issued with each distributed lock grant. The lock holder includes the token in every write to the protected resource. The resource storage server rejects writes with a token lower than the highest seen. This prevents a paused or delayed lock holder (e.g., after a GC pause or network partition) from issuing writes after it has lost the lock to a new holder. Without fencing tokens, process pauses can cause split-brain writes even when using distributed locks.

Question 3

When should CAS be used instead of a mutex?

Accepted Answer

CAS is preferred over a mutex when contention is low and the update is short. CAS avoids the overhead of lock acquisition and is immune to lock-holder failures (a failed CAS simply retries). Under high contention, many CAS retries waste CPU (livelock risk), making a mutex more efficient. CAS is ideal for: optimistic locking in databases, atomic counter increments, state machine transitions where the new state depends on the current state.

Question 4

What is the latency and availability cost of linearizability vs eventual consistency?

Accepted Answer

Linearizability requires every read and write to coordinate with a leader or a quorum, adding at least one network round-trip per operation. This means write latency includes replication time (typically 1-10ms within a datacenter, 30-100ms cross-region). The system is unavailable during leader election. Eventual consistency serves reads from any local replica with no coordination, giving sub-millisecond read latency. The tradeoff is stale reads and the need for conflict resolution. Most production systems use linearizability for coordination primitives (locks, sequences) and eventual consistency for data reads.

Question 5

What is linearizability and how does it differ from serializability?

Accepted Answer

Linearizability is a single-object consistency model guaranteeing that every operation appears to take effect atomically at some point between its invocation and completion, making the system behave like a single copy of the data. Serializability is a multi-object transaction model that guarantees transactions execute in some serial order but does not constrain where within a transaction's real-time interval that order is placed; linearizability adds this real-time ordering constraint on top of atomicity.

Question 6

How do fencing tokens enforce linearizable writes?

Accepted Answer

A fencing token is a monotonically increasing number (e.g., the Raft term or ZooKeeper epoch) issued to a leader when it acquires a lock; the leader includes this token on every write to the storage node, which rejects any write carrying a token lower than the highest it has seen. This prevents a slow or network-partitioned former leader from completing stale writes after a new leader has been elected, ensuring linearizable semantics even under split-brain scenarios.

Question 7

How does Raft achieve linearizable reads?

Accepted Answer

Raft achieves linearizable reads by having the leader send a heartbeat and wait for acknowledgment from a quorum of followers before serving the read, confirming it is still the authoritative leader and has not been superseded. Alternatively, the read can be appended to the Raft log as a no-op read entry and executed only after it is committed, ensuring it observes all previously committed writes.

Question 8

What is the performance cost of linearizability?

Accepted Answer

Linearizability requires at least one round trip to a quorum of replicas for every read and write, adding network latency proportional to the distance between nodes and reducing throughput compared to eventually consistent or read-local alternatives. Under the CAP theorem, linearizability (consistency) requires sacrificing availability during network partitions, meaning the system must reject or stall requests rather than serve potentially stale data.

Linearizability Low-Level Design: Single-Object Semantics, Fencing Tokens, and Atomic Register Implementation

What Is Linearizability?

Single-Object Atomic Register Model

Implementation: Single Leader with Synchronous Replication

Fencing Tokens

Compare-and-Swap (CAS)

ZooKeeper's Linearizability Model

PACELC: The Latency Cost of Linearizability

SQL Schema

Python Implementation Sketch