Load Shedding: Low-Level Design
6 min read Load shedding is the deliberate rejection of requests when a system is overloaded — sacrificing some requests to protect the […] Read article
6 min read Load shedding is the deliberate rejection of requests when a system is overloaded — sacrificing some requests to protect the […] Read article
6 min read Cache warming is the process of proactively loading data into a cache before it is needed by real traffic. Without Read article
7 min read Deleting data is irreversible by default. Soft delete marks records as deleted without removing them from the database; hard delete Read article
6 min read The thundering herd problem occurs when many processes or threads simultaneously wake up to compete for the same scarce resource Read article
4 min read Database indexes dramatically accelerate query performance by providing direct access paths to rows, avoiding full table scans. However, indexes consume Read article
6 min read Pagination enables clients to retrieve large result sets page by page instead of in one response. Two primary approaches exist: Read article
6 min read The Write-Ahead Log (WAL) is the foundation of database durability and crash recovery. Every change to database state is first Read article
6 min read CRDTs (Conflict-Free Replicated Data Types) are data structures that can be updated independently and concurrently on multiple nodes without coordination, Read article
4 min read Multi-Version Concurrency Control (MVCC) is the concurrency mechanism used by PostgreSQL, MySQL InnoDB, Oracle, and most modern databases to enable Read article
6 min read OAuth2 is an authorization framework that allows users to grant third-party applications limited access to their accounts without sharing passwords. Read article
6 min read GraphQL is a query language for APIs where clients specify exactly the data they need and receive nothing more. Unlike Read article
4 min read Geo-spatial indexing enables efficient queries on geographic data: finding restaurants within 1km, matching drivers to riders within 500m, showing events Read article
6 min read WebSocket provides full-duplex, low-latency communication between a browser and server over a single persistent TCP connection. Unlike HTTP polling (client Read article
6 min read Full-text search finds documents matching natural language queries across large corpora. Unlike exact-match database queries (WHERE name = ‘foo’), full-text Read article
3 min read gRPC is a high-performance RPC framework built on HTTP/2 and Protocol Buffers. It provides strongly-typed service contracts, efficient binary serialization, Read article