Question 1

What is the difference between Redis locks and ZooKeeper/etcd for distributed locking?

Accepted Answer

Redis distributed locks (SET NX EX) are simple and fast but provide weaker guarantees: single-node Redis is a single point of failure; clock drift and GC pauses can cause a lock holder to believe it holds the lock after TTL expiry. Redlock (5-node Redis quorum) addresses availability but remains controversial for correctness. ZooKeeper and etcd use consensus protocols (Paxos and Raft) with stronger guarantees: lock state is replicated to a quorum before acknowledging; ZooKeeper ephemeral nodes auto-delete on session expiry without TTL drift; etcd provides monotonically increasing revision numbers as fencing tokens. Rule: use Redis for efficiency (preventing duplicate work where occasional failures are tolerable); use ZooKeeper/etcd for correctness (preventing data corruption — financial systems, database leader election).

Question 2

What is a fencing token and why is it critical for distributed locks?

Accepted Answer

A fencing token is a monotonically increasing number returned with each lock grant. When the lock holder writes to a protected resource, the resource checks that the token exceeds the last seen value and rejects stale requests. The problem it solves: a lock holder gets GC-paused, its TTL expires, a new holder gets a higher token and proceeds, and then the original resumes believing it still holds the lock. Without fencing, two processes write simultaneously (split-brain). With fencing, the old holder is rejected because its token is lower. ZooKeeper zxid and etcd revision numbers serve as fencing tokens. Redis does not natively provide one — you must atomically increment an external counter and include it in the lock value.

Question 3

How does leader election work using ZooKeeper ephemeral sequential nodes?

Accepted Answer

Each candidate creates an ephemeral sequential znode under a common path, e.g., /election/candidate-0000000001. ZooKeeper guarantees globally unique monotonically increasing sequence numbers. Each client reads all children, checks if its node has the smallest sequence number — if yes, it is leader. If not, it watches the next-lowest node and waits. When the watched node is deleted (holder crashed, ephemeral node auto-deleted on session expiry), the watcher is notified and re-checks. This creates a fair queue — leadership passes in arrival order. Key properties: (1) ephemeral nodes auto-delete on crash — no stale locks, (2) watching the previous node prevents thundering herd, (3) sequence number serves as a natural fencing token.

System Design Interview: Design a Distributed Lock and Leader Election System

What Is a Distributed Lock?

Key Properties of a Distributed Lock

Redis-based Lock

Redlock Algorithm (Multi-Node Redis)

ZooKeeper Ephemeral Sequential Nodes

etcd Leader Election

Fencing Tokens

Interview Framework