Question 1

How do you generate a stable cache key for a database query result?

Accepted Answer

Normalize the query before hashing to avoid key misses from semantically identical but textually different queries. Normalization steps: parse the SQL into an AST, canonicalize whitespace and keyword casing, sort commutative clauses (e.g., IN list elements, AND predicates with no order dependency), then serialize back to a canonical string. Append the parameter values in order to the canonical string, then SHA-256 hash the result. Prefix the hash with the database name and schema version to namespace keys across environments and invalidate all keys on DDL changes by bumping the schema version. For ORM-generated queries, implement normalization at the ORM layer rather than the cache layer to intercept queries before they reach the wire.

Question 2

Compare tag-based invalidation and TTL-based expiry for query cache invalidation — when do you use each?

Accepted Answer

TTL-based expiry is simple: every cached result expires after N seconds regardless of whether the underlying data changed. Use it when the result is naturally time-bounded (e.g., a leaderboard refreshed every 60 seconds) or when approximate freshness is acceptable and the invalidation cost outweighs consistency benefit. Tag-based invalidation associates each cached result with one or more entity tags (e.g., 'user:42', 'product:99'). When a write touches a tagged entity, invalidate all cache entries sharing that tag. This gives strong consistency but requires tracking which queries depend on which entities — either via a reverse index in Redis (tag → set of cache keys) or by having the application declare dependencies at cache-store time. Use tag-based invalidation for OLTP workloads where data changes frequently and stale reads cause visible correctness bugs. Combine both: short TTL as a safety net, tag invalidation for immediate consistency on writes.

Question 3

How do you handle stale reads in a query cache during high write throughput without making every read synchronously wait for invalidation?

Accepted Answer

Use a stale-while-revalidate pattern: serve the cached (potentially stale) result immediately, then asynchronously dispatch a background job to re-execute the query and update the cache. The background job uses a per-key lock (Redis SETNX) to ensure only one revalidation runs at a time per key, preventing a thundering herd of revalidation goroutines. Set two TTLs per entry: a 'fresh' TTL (e.g., 5 seconds) and a 'stale' TTL (e.g., 60 seconds). Within the fresh window, serve from cache directly. Between fresh and stale TTL expiry, serve stale while revalidating in background. After the stale TTL, block and recompute synchronously. For write-heavy workloads, implement write-through caching: on each write, update both the database and the cache in the same transaction (using a two-phase approach or best-effort with short TTL fallback) so reads always find a warm cache entry.

Question 4

How do you prevent a cache stampede when a popular query's TTL expires simultaneously for many callers?

Accepted Answer

Use three complementary techniques: (1) Probabilistic early expiration (XFetch) — each cache reader, when checking a key, computes a 'virtual expiry' time that randomly triggers recomputation before the actual TTL expires, with probability inversely proportional to remaining TTL. This spreads recomputation over a window rather than letting all readers hit the cache miss at exactly the same time. (2) Request coalescing — when a cache miss is detected, one designated leader (selected via Redis SETNX on a 'recomputing' key) executes the database query, while other concurrent readers either wait on a channel/condition variable or serve the stale result if available. (3) Jittered TTLs — when populating the cache, add a random jitter (e.g., ±10% of the base TTL) to the expiry so that entries loaded together do not all expire at the same wall-clock second.

Query Cache Low-Level Design: Result Caching, Cache Invalidation, and Stale Read Handling

Query Cache Low-Level Design

Cache Key Generation

Cache Storage

TTL Selection

Cache Lookup Flow

Event-Based Invalidation

Tag-Based Invalidation

Cache Bypass Cases

Negative Caching

Thundering Herd Prevention

Cache Warming and Metrics