Q: How do you handle trending queries that spike within minutes?

The batch trie rebuild happens hourly, so a query that starts trending at 2:05pm won't appear in suggestions until 3:00pm. Trending boost layer: maintain a Redis sorted set trend_queries with ZINCRBY trend_queries 1 {query_text} on each search. TTL on the sorted set: 1 hour (sliding window). ZREVRANGEBYSCORE trend_queries +inf 0 LIMIT 0 100 returns the top trending queries. At serving time: fetch base suggestions from trie/Redis cache, then merge with trending queries that match the prefix. Give trending queries a score boost. This catches viral events (breaking news, celebrity mentions) within minutes without rebuilding the full trie.

Q: What is the "Justin Bieber problem" in autocomplete and how do you solve it?

The Justin Bieber problem (named from a Twitter engineering post) applies to autocomplete the same way it applies to social feeds: celebrities or viral events generate such high query volume that their terms dominate all prefix suggestions. Example: "ju" should show "jump", "jupiter", "just" — but if Justin Bieber is trending, "justin bieber" floods every prefix from "j" to "justin". Solutions: (1) Frequency cap per query — no single query contributes more than X% of a prefix's score. (2) Diversity injection — ensure top-k suggestions include variety across categories (people, places, actions). (3) Separate trending layer — keep celebrity/viral queries in the trending sorted set rather than baking them into the trie, then merge at serving time with a capped weight. This prevents viral events from permanently distorting the trie.

Q: How does personalized autocomplete work at scale?

Two-layer architecture: (1) Global layer — trie/Redis suggestions based on population-level query frequency. Shared across all users. (2) Personal layer — per-user recent searches stored in Redis: ZADD user_recents:{user_id} {timestamp} {query_text}. On autocomplete request: fetch global top-k for the prefix + personal recent searches matching the prefix. Merge: personal matches ranked first (recency boosted), then global suggestions fill remaining slots. Personal layer is fetched from Redis in O(log n + k) — fast enough to merge inline with the global results. Limit personal history to 20-50 queries per user; ZREMRANGEBYRANK to prune old entries. No per-user trie — shared infrastructure with a personal overlay.

Question 1

How does a trie support autocomplete suggestions in O(L) time?

Accepted Answer

A trie (prefix tree) stores strings character by character. Each node represents a prefix; traversal from root to any node spells out that prefix. For autocomplete, the naive approach is to traverse to the prefix node and then DFS to find all completions — O(n) in the worst case. The optimized approach pre-computes the top-k suggestions at every trie node. Each node stores a sorted list of (score, query) pairs for the best k queries with that prefix. Lookup is now O(L) — just traverse L characters to reach the prefix node, then return the pre-stored top-k list. The tradeoff: updates require recomputing top-k at each ancestor node (O(L * k) per update). This is why most production systems rebuild the trie in batch rather than updating it per query.

Question 2

How does a Redis sorted set accelerate autocomplete suggestions?

Accepted Answer

Store precomputed suggestions as a Redis sorted set: ZADD suggest:{prefix} {score} {query_text}. ZREVRANGEBYSCORE returns the top-k queries for a prefix in O(log n + k). Alternatively, store as a Redis string (key=suggest:{prefix}, value=JSON array). Cache common prefixes permanently (1-3 characters). For longer prefixes, set TTL=10 minutes. On cache miss: query the trie service and populate Redis. Cache hit rate is very high for short prefixes since they account for most searches. Redis handles 100K+ lookups/second. This layer eliminates trie traversal for the majority of requests and is the key to achieving <50ms suggestion latency at scale.

Question 3

How do you handle trending queries that spike within minutes?

Accepted Answer

The batch trie rebuild happens hourly, so a query that starts trending at 2:05pm won't appear in suggestions until 3:00pm. Trending boost layer: maintain a Redis sorted set trend_queries with ZINCRBY trend_queries 1 {query_text} on each search. TTL on the sorted set: 1 hour (sliding window). ZREVRANGEBYSCORE trend_queries +inf 0 LIMIT 0 100 returns the top trending queries. At serving time: fetch base suggestions from trie/Redis cache, then merge with trending queries that match the prefix. Give trending queries a score boost. This catches viral events (breaking news, celebrity mentions) within minutes without rebuilding the full trie.

Question 4

What is the "Justin Bieber problem" in autocomplete and how do you solve it?

Accepted Answer

The Justin Bieber problem (named from a Twitter engineering post) applies to autocomplete the same way it applies to social feeds: celebrities or viral events generate such high query volume that their terms dominate all prefix suggestions. Example: "ju" should show "jump", "jupiter", "just" — but if Justin Bieber is trending, "justin bieber" floods every prefix from "j" to "justin". Solutions: (1) Frequency cap per query — no single query contributes more than X% of a prefix's score. (2) Diversity injection — ensure top-k suggestions include variety across categories (people, places, actions). (3) Separate trending layer — keep celebrity/viral queries in the trending sorted set rather than baking them into the trie, then merge at serving time with a capped weight. This prevents viral events from permanently distorting the trie.

Question 5

How does personalized autocomplete work at scale?

Accepted Answer

Two-layer architecture: (1) Global layer — trie/Redis suggestions based on population-level query frequency. Shared across all users. (2) Personal layer — per-user recent searches stored in Redis: ZADD user_recents:{user_id} {timestamp} {query_text}. On autocomplete request: fetch global top-k for the prefix + personal recent searches matching the prefix. Merge: personal matches ranked first (recency boosted), then global suggestions fill remaining slots. Personal layer is fetched from Redis in O(log n + k) — fast enough to merge inline with the global results. Limit personal history to 20-50 queries per user; ZREMRANGEBYRANK to prune old entries. No per-user trie — shared infrastructure with a personal overlay.

Search Suggestions (Autocomplete) System Low-Level Design

Requirements

Trie-Based Architecture

Data Flow

Redis Caching Layer

Trie Update Strategy

Personalization

Query Normalization

Key Design Decisions