Low Level Design: Real-Time Leaderboard
A real-time leaderboard ranks users or entities by a score that updates continuously. The core challenge is serving low-latency rank […]
A real-time leaderboard ranks users or entities by a score that updates continuously. The core challenge is serving low-latency rank […]
SLIs (Service Level Indicators), SLOs (Service Level Objectives), and error budgets are the quantitative framework for reliability engineering. An SLI
Feature flags (feature toggles) decouple code deployment from feature release. Code ships to production with a feature disabled; the flag
Read-heavy systems serve many more reads than writes — often 100:1 or higher ratios. Optimizing for reads requires layered caching,
Binary protocols encode messages as compact byte sequences, achieving lower overhead, faster parsing, and smaller payloads than text-based formats (JSON,
Platform engineering builds an Internal Developer Platform (IDP) that provides self-service infrastructure capabilities to application teams. Instead of every team
Tail latency (p99, p999 latency) is the response time experienced by the slowest few percent of requests. While average latency
Graceful shutdown ensures a service stops cleanly: completing in-flight requests, draining connections, flushing buffers, and releasing resources before the process
Stream processing applies computations to unbounded data streams in real time. Windowing divides the infinite stream into finite chunks so
Site Reliability Engineering (SRE) formalizes reliability using three measurements: Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level
Search relevance ranking determines the order in which results are presented for a given query. Poor ranking makes a search
A booking system manages time-bounded reservations for finite resources: hotel rooms, airline seats, restaurant tables, or appointment slots. The core
Secret management secures sensitive credentials (API keys, database passwords, TLS certificates, encryption keys) throughout their lifecycle: generation, storage, distribution, rotation,
A log aggregation system collects log data from distributed services, centralizes it for storage and search, and provides query and
A media processing pipeline ingests raw media (video, image, audio), transforms it into multiple output formats optimized for different devices