Question 1

When should you use a sharded counter instead of a Redis counter?

Accepted Answer

Redis handles over 1 million INCRBY operations per second on a single node — it is the right choice for most high-throughput counters (view counts, likes, session activity). Use sharded Postgres counters when: (1) the data must be durable without a separate Redis persistence layer (AOF/RDB); (2) the counter must participate in a Postgres transaction alongside other writes (e.g., an inventory decrement that also updates an order row); (3) you need audit history per increment (a ledger of who changed the count, when, and by how much). Sharded Postgres counters with N=16 shards achieve approximately 80K updates/second per counter — sufficient for most single-counter hotspots. For counters updated by millions of users simultaneously (global like count on a viral post), Redis is the only practical option.

Question 2

How does HyperLogLog achieve constant memory regardless of cardinality?

Accepted Answer

HyperLogLog (HLL) estimates set cardinality using a probabilistic algorithm. Instead of storing every distinct element, it hashes each element and tracks the position of the leading zero bits in the hash. Intuitively: the probability of seeing k leading zeros is 1/2^k — so if the maximum leading zeros observed is k, the cardinality is approximately 2^k. HLL aggregates multiple such estimates (using 16,384 registers in Redis's implementation) and applies the HarmonicMean correction, achieving ~0.81% standard error. Memory: exactly 12 KB per HLL regardless of whether you have 100 or 100 billion distinct elements. Trade-off: you cannot enumerate the distinct elements (no membership query) — only cardinality. For sets where you need both the count and the ability to check membership, use a Bloom filter (approximate membership) alongside a separate counter.

Question 3

How do you handle counter resets or adjustments (e.g., correcting a fraudulent like count)?

Accepted Answer

A Redis INCRBY counter has no built-in audit trail — you can SET the counter to any value but cannot explain why. For counters that need correction capability: (1) maintain a separate adjustment ledger (CounterDelta table) with an adjustment_reason column; apply deltas via the INSERT-based batching pattern rather than direct INCR; (2) for Redis counters, SET the corrected value directly and log the change: r.set("counter:post:42:likes", corrected_value) with a corresponding audit record. Never silently adjust counters — always write an audit record with the before value, after value, changed_by user, and reason. For fraud-related adjustments (removing fake likes), batch the removal in an offline job: compute the fraudulent delta, apply a negative adjustment, log the fraud event. Do not remove individual increments retroactively — the ledger pattern makes corrections additive (a negative entry), not destructive.

Question 4

How do you implement a global request counter for rate limiting without a single hotspot?

Accepted Answer

Rate limiting requires a per-user, per-time-window counter: "user 42 has made 47 requests in the last 60 seconds." Redis is the natural fit: INCRBY user:ratelimit:42:1735689600 1 with EXPIREAT to the window end. This is O(1) and Redis handles millions of such counters. The hotspot risk is not per-user (each user has their own key) but per-Redis-shard if one shard holds too many active keys. Solution: partition rate limit keys across a Redis cluster by user_id hash — key space is automatically distributed. For sliding window rate limits: use Redis sorted sets (ZADD with timestamp as score, ZREMRANGEBYSCORE to evict old entries, ZCARD to count). Sorted set approach is O(log N) per request but provides exact sliding window semantics vs. the approximate fixed-window INCR approach.

Question 5

What consistency guarantees does a sharded counter provide for reads?

Accepted Answer

A sharded counter read (SELECT SUM(value) FROM CounterShard WHERE counter_id=X) is a snapshot read: it sees all committed increments up to the moment of the SELECT. It does NOT see in-flight transactions (increments that started but haven't committed yet). This is correct for most use cases — inventory counts should not include uncommitted reservations. Stale read window: if shards are spread across Postgres nodes with replication lag, the SUM across shards may include some shards' committed values and other shards' older values. Avoid cross-node sharding for counters that need strong consistency; keep all shards on the primary. Approximate counters (view counts, like counts): cache the last SUM result in Redis for 10 seconds. The read is always at most 10 seconds stale, which is imperceptible to users and eliminates the SUM query from the hot path entirely.

Distributed Counter System Low-Level Design: Exact, Sharded, Redis, and HyperLogLog Strategies

Distributed Counter System: Low-Level Design

Core Data Model

Strategy 1: Exact Postgres Counter (Low Write Rate)

Strategy 2: Redis Counter (High Write Rate, Eventual Persistence)

Strategy 3: Sharded Counter (Extremely High Write Throughput)

Strategy 4: HyperLogLog for Unique Counts (Cardinality Estimation)

Key Design Decisions