Question 1

What is the difference between fan-out on write and fan-out on read for news feeds?

Accepted Answer

Fan-out on write (push model): when a user posts, immediately write the post ID to every follower's feed cache. Feed reads are instant (O(1) Redis read). Cost: write amplification -- one post causes N writes (N = follower count). For a user with 10M followers, one post = 10M Redis writes. Fan-out on read (pull model): when a user opens their feed, fetch recent posts from each account they follow, merge and sort. No write overhead. Cost: read is slow (query N accounts, merge N result sets). For users following 500 accounts, each feed load queries 500 sources. Industry solution: hybrid. Use push for users with fewer than ~10K followers (fast writes, fast reads). Use pull for celebrities (>10K followers). When reading, merge the pre-computed pushed feed with freshly pulled celebrity posts.

Question 2

How does Instagram store and retrieve its news feed at scale?

Accepted Answer

Instagram uses a pre-computed feed cache in Redis. Each user has a feed stored as a Redis Sorted Set with post IDs scored by timestamp. On post creation: fan out to followers via a distributed task queue (Celery workers consuming from a Kafka topic). Each worker writes the post ID into a batch of follower feed caches. For users with millions of followers (Kylie Jenner): fan-out is skipped; followers pull celebrity posts at read time. Feed read: ZREVRANGE user:{id}:feed 0 49 (top 50 posts by score). On cache miss (cold start or evicted): pull from the posts database, reconstruct the feed, repopulate cache. Feed cap: keep only the most recent 1000 post IDs per user in Redis -- older content is paginated from the database.

Question 3

How do you implement feed ranking with a machine learning model?

Accepted Answer

Two-stage retrieval-ranking architecture: Stage 1 (retrieval): gather ~1000 candidate posts. Sources: recent posts from followed accounts, sponsored/promoted posts, suggested content from accounts you might like. Use simple recency filter: posts from the last 7 days from accounts you follow. Stage 2 (ranking): score all 1000 candidates with a ranking model. Features: post age (recency decay), relationship strength (how often you interact with the poster), content type affinity (do you engage more with videos or photos?), predicted engagement probability (likes, comments, shares). Model: gradient boosting (XGBoost) or a two-tower neural network. Serve via a low-latency inference server (TensorFlow Serving, Triton). Return the top 50 posts by score. The ranking model is retrained daily on new engagement data.

Question 4

How do you handle new posts appearing at the top of a paginated feed?

Accepted Answer

Offset-based pagination breaks with new content: page 2 (offset 20) shifts when 5 new posts are added to page 1, causing posts 16-20 to appear on both page 1 and page 2. Solution: cursor-based pagination. The cursor is the ID (or timestamp) of the last seen post. 'Load more' request: return posts older than cursor_post_id. New posts added above the cursor never shift the position of posts below it. For 'pull to refresh' (load newer content): return posts newer than top_cursor_post_id (the ID of the newest post the user has seen). Show a 'You have 8 new posts' banner. This separates the refresh (top of feed) from the infinite scroll (bottom of feed) and prevents the jarring experience of content jumping around.

Question 5

How do you prevent a single celebrity post from overloading your fan-out workers?

Accepted Answer

When a user with 100M followers posts, the fan-out job (write to 100M feed caches) is massive. Mitigation: (1) Async processing: the post creation API returns immediately; fan-out happens asynchronously via Kafka workers. Users may see a delay of seconds to minutes before followers see the post -- acceptable for social media. (2) Rate limiting fan-out workers: spread the 100M writes over 60-120 seconds to avoid a Redis write spike. (3) Skip inactive followers: only fan-out to followers who were active in the last 7 days. A dormant user can pull their feed on login. Reduces fan-out by 50-80% for most celebrity accounts. (4) Hybrid model: for accounts above a follower threshold (1M), skip fan-out entirely and serve their posts via pull at read time. This is the fundamental solution to the celebrity problem.

System Design: Social Network News Feed — Fan-out on Write vs Read, Ranking, and Feed Generation

The Feed Problem

Fan-out on Write (Push Model)

Fan-out on Read (Pull Model)

Hybrid Approach (Industry Standard)

Feed Ranking

Interview Tips