Question 1

What are the latency tradeoffs between HLS, DASH, and LL-HLS for live video streaming?

Accepted Answer

Standard HLS targets 15-30 seconds of end-to-end latency because the player must buffer multiple full segments (typically 6-10 seconds each) before playback. DASH (Dynamic Adaptive Streaming over HTTP) achieves similar latency but is codec-agnostic and offers more flexible segment templates. Low-Latency HLS (LL-HLS, RFC 8216bis) reduces latency to 2-4 seconds by splitting segments into partial segments of 200ms and using HTTP/2 push or blocking playlist requests to deliver them as soon as they are encoded. The tradeoff is increased origin load and more complex CDN configuration; LL-HLS also requires all cache layers to support partial segment caching to avoid defeating the low-latency guarantee.

Question 2

How does GPU acceleration improve live video transcoding throughput?

Accepted Answer

Software transcoding with x264 or x265 on CPU cores is compute-bound and typically yields 1-4x real-time speed per core for 1080p. GPU-based encoders such as NVIDIA NVENC or AMD AMF offload the motion estimation and entropy coding stages to dedicated silicon, achieving 8-30x real-time speed at a fraction of the CPU cost. This matters for live streams where the encode must complete faster than real time — a single NVENC instance can handle multiple concurrent 1080p streams that would otherwise require an entire CPU server. The quality-per-bit is slightly lower than a slow CPU encode, but for live streaming the latency constraint makes the quality tradeoff acceptable.

Question 3

How do you implement WebSocket chat with Redis pub/sub fan-out at scale?

Accepted Answer

Each chat server maintains long-lived WebSocket connections to its local clients and subscribes to a Redis channel per stream (e.g. chat:stream:{stream_id}). When a viewer sends a message, their chat server publishes it to Redis. Redis broadcasts the message to all subscribed chat servers, each of which fans it out to their local WebSocket connections. This decouples horizontal scaling of the WebSocket tier from the fan-out logic. At very high subscriber counts (millions of concurrent viewers), replace the per-stream Redis channel with a tiered fan-out: a small number of relay servers subscribe to Redis and maintain WebSocket pools to downstream edge servers, which hold the actual viewer connections.

Question 4

Why use HyperLogLog for live viewer count estimation instead of exact counting?

Accepted Answer

Exact counting with a Redis Set requires O(N) memory proportional to the number of distinct viewer IDs, which becomes gigabytes for popular streams. HyperLogLog (HLL) provides a cardinality estimate with a standard error of 0.81% using at most 12 KB of memory per key, regardless of cardinality. Redis exposes PFADD to record a viewer session and PFCOUNT to read the estimate. Multiple HLL keys can be merged with PFMERGE for aggregated counts across regions. For a live viewer ticker that updates every few seconds, the sub-1% error is imperceptible to users and the fixed memory cost makes it operationally safe to maintain one HLL per active stream.

Question 5

What is the optimal CDN segment caching strategy for live HLS streams?

Accepted Answer

Set a short TTL (equal to segment duration, e.g. 6 seconds) on the live playlist manifest (*.m3u8) so players always fetch a fresh segment list, but set a long or infinite TTL on completed media segments (*.ts or *.mp4 fMP4 chunks) because segment content is immutable once written. Use surrogate keys or cache tags to allow instant purge of the manifest without touching segment objects. For LL-HLS, configure CDN to support HTTP/2 server push or hold-until-complete on partial segment requests. Place CDN PoPs close to viewer clusters and configure origin shield to collapse the thundering herd of manifest requests from millions of simultaneous players into a single upstream fetch per PoP.

Low Level Design: Live Video Streaming Platform

Introduction

Ingest Pipeline

Transcoding

HLS Adaptive Bitrate

Chat and Interactions

Viewer Count Estimation

VOD Replay

Frequently Asked Questions: Live Video Streaming Platform