Question 1

What is layered caching and how does it reduce database load?

Accepted Answer

Layered caching builds a hierarchy: L1 in-process memory (submicrosecond, bounded by process memory), L2 shared Redis cache (sub-millisecond, shared across instances), L3 CDN edge cache (for public content). A 95% overall cache hit rate with only 5% of requests reaching the database reduces database load by 20x. Each layer absorbs misses before they propagate to slower, more expensive layers. Profile which objects are most frequently requested to prioritize what belongs in each cache tier.

Question 2

When should you read from a primary database instead of a read replica?

Accepted Answer

Read replicas introduce asynchronous replication lag (typically milliseconds). Always read from the primary for: read-your-own-write consistency (reading immediately after writing), financial balances (must be exact), inventory counts (accuracy-critical), and any use case where stale data causes business harm. For everything else — user profiles, news feeds, product catalogs, historical analytics — read replicas are appropriate. If replica lag exceeds an acceptable threshold for a use case, route that query class to the primary.

Question 3

What is CQRS and when does it make sense for read-heavy systems?

Accepted Answer

CQRS (Command Query Responsibility Segregation) uses separate data models for reads and writes. The write model is normalized, ACID, and consistent. The read model is denormalized, eventually consistent, and optimized for specific query patterns. Use CQRS when read query patterns are too diverse for a single schema to serve efficiently — for example, the same order data needs to power an order list (sorted, paginated), a search index (Elasticsearch), and a leaderboard (Redis sorted set). CQRS lets each read model use the best storage technology for its query pattern.

Question 4

What is a materialized view and how does it differ from a regular database view?

Accepted Answer

A regular view is a named SQL query that executes on every access — it provides convenience but no performance benefit. A materialized view stores the computed result as a physical table, pre-computing expensive aggregations. Reads hit the stored result instead of re-executing the query. Materialized views must be refreshed (full refresh recomputes entirely; incremental refresh applies only changes). Use materialized views for expensive aggregations that run frequently: total order count per user, daily revenue by category, top 100 articles. The refresh cost is amortized across many reads.

Low Level Design: Read-Heavy System Optimization

Layered Caching

Read Replicas

Denormalization

CQRS Pattern

Query Result Caching

Materialized Views

Connection Pool Tuning for Read-Heavy Load