Q: How do you handle URL expiry and code reuse?

Store an expires_at timestamp on each URL. On redirect, check expiry inline (Redis cached value includes expiry metadata). A background daily job soft-deletes expired URLs (is_active=false) and evicts them from Redis. Expired short codes enter a cooldown period (default 30 days) before being eligible for reuse, preventing users with cached links from being redirected to the wrong destination after reuse.

Q: How do you scale a URL shortener to handle 4000 redirects per second?

Read path scaling: CDN caches popular short codes at edge nodes globally. Redis cluster caches all active codes (hot set). Database is only hit on cold cache misses (rare for popular codes). Write path: URL creation at 40/sec is trivial for any relational database. Analytics ingestion via Kafka decouples write amplification from the hot path. Horizontal scaling: stateless redirect service behind a load balancer, Redis cluster with read replicas, sharded Kafka topics by short_code prefix.

Question 1

How do you generate short codes for a URL shortener at scale?

Accepted Answer

The recommended approach is base62 encoding of a distributed auto-increment ID (e.g., Snowflake ID). Base62 uses characters a-z, A-Z, 0-9. Seven characters give 62^7 = 3.5 trillion unique codes. The ID is generated by a distributed ID service, then base62-encoded. This avoids collision checks and produces non-sequential, non-guessable codes. For simplest implementation, use a database sequence with base62 encoding.

Question 2

Why is the redirect path the most critical optimization in a URL shortener?

Accepted Answer

URL shorteners are extremely read-heavy: typically 100:1 to 1000:1 reads (redirects) to writes (URL creation). Every redirect adds latency to the user's navigation experience, so P99 under 10ms is the target. The optimization stack: CDN edge caching (2ms, no origin hit for cached codes), Redis in-memory cache (0.5ms), then database as last resort. Click tracking is async via Kafka to avoid adding latency to the redirect response.

Question 3

How do you track click analytics without slowing down redirects?

Accepted Answer

Use a fire-and-forget async pattern: on redirect, publish a click event to Kafka (non-blocking, < 1ms). A separate stream processing pipeline (Flink or Spark Streaming) consumes events to: parse user-agent for device/OS/browser, geolocate IP to country/city, filter bots, and write to the clicks table and pre-aggregated counters. The redirect endpoint returns immediately; analytics lag by seconds, which is acceptable for dashboards.

Question 4

How do you handle URL expiry and code reuse?

Accepted Answer

Store an expires_at timestamp on each URL. On redirect, check expiry inline (Redis cached value includes expiry metadata). A background daily job soft-deletes expired URLs (is_active=false) and evicts them from Redis. Expired short codes enter a cooldown period (default 30 days) before being eligible for reuse, preventing users with cached links from being redirected to the wrong destination after reuse.

Question 5

How do you scale a URL shortener to handle 4000 redirects per second?

Accepted Answer

Read path scaling: CDN caches popular short codes at edge nodes globally. Redis cluster caches all active codes (hot set). Database is only hit on cold cache misses (rare for popular codes). Write path: URL creation at 40/sec is trivial for any relational database. Analytics ingestion via Kafka decouples write amplification from the hot path. Horizontal scaling: stateless redirect service behind a load balancer, Redis cluster with read replicas, sharded Kafka topics by short_code prefix.

System Design: URL Shortener and Click Analytics Platform (2025)

Requirements and Scale

URL Shortening and ID Generation

Data Model

Redirect Architecture – The Hot Path

Click Analytics Pipeline

Custom Aliases and Expiry