Question 1

What is the difference between cache-aside, write-through, and write-behind caching?

Accepted Answer

Cache-aside (lazy loading): application checks cache first; on miss, loads from DB and populates cache. Application manages cache explicitly. Pros: cache only what is read, resilient to cache failures. Cons: cold start — first request always misses; cache can be stale if DB is written directly. Write-through: every write goes to cache AND DB synchronously. Cache is always warm and consistent. Cons: write latency doubles; cache fills with data that may never be read. Write-behind (write-back): writes go to cache immediately, then async to DB. Lowest write latency. Cons: data loss if cache fails before flushing. Choose cache-aside for read-heavy workloads (most web apps), write-through for small frequently-read/written datasets (user settings), write-behind for high-write workloads that tolerate small loss windows (like counts, view increments).

Question 2

How do you prevent a cache stampede (thundering herd)?

Accepted Answer

A cache stampede occurs when a popular cache key expires and thousands of concurrent requests all miss simultaneously, all querying the DB at once. Three solutions: (1) Probabilistic early expiration: instead of expiring at exactly T seconds, each request checks if NOW() > expiry - random(0, delta) and proactively refreshes. Only one request refreshes at a time on average. (2) Mutex lock: first cache miss acquires a distributed lock (SETNX in Redis), fetches from DB, populates cache, releases lock. Subsequent misses wait for the lock, then find the cache populated. (3) Stale-while-revalidate: serve the stale value immediately and refresh asynchronously in the background. The stale value is better than N concurrent DB queries.

Question 3

How do you handle cache invalidation for complex objects?

Accepted Answer

Three strategies: (1) TTL-only: set a short TTL (30-60 seconds) and accept brief staleness. Simple, no invalidation logic. Works for read-heavy data that can tolerate lag. (2) Event-driven invalidation: when the source DB row changes, publish an event (via CDC or outbox) that deletes the cache key. Immediate consistency, complex to implement. (3) Versioned cache keys: embed a version number in the cache key (user:{id}:v{version}). On write, increment the version in a separate counter key. Reads always fetch the current version key. Old version keys expire via TTL. Avoids explicit deletes. The version counter itself can be in Redis: INCR user:{id}:version. Use versioned keys for objects with complex invalidation dependencies.

Question 4

What data is worth caching and what should never be cached?

Accepted Answer

Worth caching: user profiles (read 100x more than written), product catalog (changes rarely, queried constantly), session data (read on every request), computed aggregates (follower counts, rating averages). Determine cache value as: (cache_hit_rate * read_qps * db_read_time) - (write_qps * invalidation_overhead). Cache is worth it when reads far outnumber writes and DB read latency is significant. Never cache: user authentication tokens (security), payment records (must be authoritative), real-time data that must be current (live auction prices), anything with regulatory requirements for consistency. Also avoid caching large objects (>1MB) — serialization overhead and Redis memory waste.

Question 5

How do you size your Redis cache and handle eviction?

Accepted Answer

Size the cache to hold the working set: the data accessed by 80% of requests. Use Redis INFO keyspace to see hit/miss ratios. A hit rate below 90% means the cache is too small or the keys are too varied to benefit from caching. Common eviction policies: allkeys-lru (evict least recently used across all keys — use this for general caches), volatile-lru (evict LRU among keys with TTL — use when some keys must never be evicted), allkeys-lfu (evict least frequently used — better for Zipfian distributions with a small hot set). Set maxmemory and maxmemory-policy in redis.conf. Monitor used_memory_rss and evicted_keys metrics; alert if eviction rate is high (means the cache is undersized).

Caching Strategy Low-Level Design

Caching Strategy — Low-Level Design

Cache-Aside (Lazy Loading)

Write-Through Cache

Write-Behind (Write-Back) Cache

Cache Invalidation Strategies

Cache Stampede (Thundering Herd)

Key Interview Points