Inverted Index Low-Level Design: Tokenization, Posting Lists, TF-IDF Scoring, and Index Updates
What Is an Inverted Index? An inverted index maps each term (word or token) to the list of documents that […]
What Is an Inverted Index? An inverted index maps each term (word or token) to the list of documents that […]
Why B-Trees Cannot Index 2D Space A B-tree sorts data along a single dimension. Given a point (lat, lng), there
What Is a Bitmap Index? A bitmap index represents the presence or absence of a column value for each row
What Is a Write-Ahead Log and Why Does It Exist? A write-ahead log (WAL) is the foundation of durability in
LSM Tree Structure and the Need for Compaction A Log-Structured Merge-tree (LSM-tree) accepts all writes into an in-memory buffer (MemTable).
B-Tree Structure and Page Layout A B-tree index is a balanced tree where every node is a fixed-size disk page
Raft is the consensus algorithm designed to be understandable. It powers etcd (Kubernetes backbone), CockroachDB, TiKV, and many other systems
Snapshot isolation (SI) is the transaction isolation level used by PostgreSQL (as its default), Oracle, SQL Server, and many distributed
MVCC (Multi-Version Concurrency Control) is the foundation of concurrent transaction processing in PostgreSQL, MySQL InnoDB, Oracle, and most modern distributed
A bloom filter is a space-efficient probabilistic data structure that answers membership queries with certainty about absence and high confidence
Counting the number of distinct elements in a large stream — cardinality estimation — seems to require storing every unique
The gossip protocol is the distributed systems equivalent of rumor spreading: each node periodically shares information with a small random
A TLS certificate manager automates the full lifecycle of digital certificates: issuance via ACME protocol, storage with encrypted private keys,
LSM-tree (Log-Structured Merge-tree) is the storage engine architecture behind RocksDB, LevelDB, Cassandra, and HBase. It trades read amplification for dramatically
A service registry is the backbone of dynamic service discovery in microservice architectures. Rather than hardcoding service endpoints, every instance