Question 1

How does Instagram handle image uploads without overloading application servers?

Accepted Answer

Instagram uses presigned S3 URLs to bypass the application server entirely for raw image bytes. Flow: (1) Client requests an upload URL from the backend. (2) Backend generates a presigned PUT URL with constraints (max 10MB, JPEG/PNG/HEIC only). (3) Client uploads directly to S3. The backend never handles the image bytes -- saving bandwidth and CPU. (4) An S3 event triggers a Lambda function that validates the image, runs content moderation (AWS Rekognition), generates multiple sizes (150x150 thumbnail, 640x640 feed, 1080x1080 full), converts to WebP/AVIF, strips EXIF GPS data for privacy, and stores all versions in S3. (5) Lambda publishes a post-ready event to Kafka. (6) The post service creates the database record and triggers feed fanout. This pipeline handles 100M+ uploads per day with minimal application server load.

Question 2

How does the Instagram Explore page generate personalized recommendations?

Accepted Answer

The Explore page shows content from accounts the user does not follow, personalized to their interests. Three stages: (1) Candidate generation -- produce thousands of candidate posts using collaborative filtering (posts liked by similar users), geographic popularity, topic-based matching (user interested in cooking sees cooking content), and embedding-based retrieval (user and post embeddings in the same vector space, approximate nearest neighbor search). (2) Ranking -- an ML model scores each candidate for the specific user, predicting engagement probability (like, comment, save, share). Features include post engagement rate, author-viewer affinity, content type preference, and recency. (3) Filtering and diversification -- remove posts from blocked users, posts violating guidelines, and already-seen content. Inject variety to avoid showing all the same topic. The Explore page is pre-computed and cached with a 15-30 minute refresh cycle due to the computational cost of ML inference.

Question 3

How does Instagram serve billions of images per hour efficiently?

Accepted Answer

CDN (CloudFront/Fastly/Akamai) serves all images. The CDN edge server caches images close to users. For popular content, cache hit rate exceeds 90% -- sub-10ms response from edge. On cache miss, the CDN fetches from S3 origin, caches, and serves. Format negotiation: the CDN checks the Accept header and serves AVIF (50% smaller than JPEG) to supported browsers, WebP (25% smaller) as fallback, and JPEG as universal fallback. The Vary: Accept header ensures correct per-format caching. Image URLs encode the size: cdn.example.com/images/{user_id}/{post_id}/640.webp. Clients request the appropriate size for their device. Bandwidth savings from WebP/AVIF are enormous at Instagram scale -- petabytes per day. Images are immutable (new post = new URL), so cache invalidation is simple. Deleted posts return 404 after CDN cache TTL expires.

Question 4

How do Instagram Stories handle ephemeral 24-hour content?

Accepted Answer

Stories differ from regular posts in key ways: (1) TTL-based storage -- stories have a 24-hour TTL. After expiration, they are deleted from active storage (moved to archive for Highlights). (2) Stories tray -- the horizontal bar of story circles. Pre-computed per user: for each followed account, check for active stories. Sort by unseen first, then by engagement recency. Cache the tray and invalidate when a followed user posts or a story expires. (3) View tracking -- each story view (viewer_id, story_id, timestamp) is recorded. Celebrity stories with 10M views create massive write volume. Use Redis INCR for real-time count and Kafka + batch writes for the full view list. (4) Ordering -- within a user stories, show chronologically (oldest first). Between users, show unseen stories first, then stories from users the viewer engages with most.

System Design: Instagram/Photo Sharing — Image Upload, News Feed, Stories, Explore Page, CDN, Image Processing

Image Upload Pipeline

News Feed Generation

Stories Architecture

Explore Page and Content Discovery

CDN and Image Serving

Image Upload Pipeline

News Feed Generation

Stories Architecture

Explore Page and Content Discovery

CDN and Image Serving

Social Graph and Interactions