Question 1

Why must video processing be asynchronous and queue-based rather than synchronous?

Accepted Answer

Transcoding a 1-hour 4K video to multiple HLS variants takes 15-45 minutes of CPU time. A synchronous HTTP request would timeout (typical limit: 30-60 seconds). The upload endpoint must return immediately with a job_id (202 Accepted), and the client polls or uses a webhook to check status. The job queue provides backpressure: if 100 videos are uploaded simultaneously and you have 10 worker instances, the queue absorbs the spike — workers process 10 videos at a time, the other 90 wait. Without a queue, a traffic spike would either crash the workers (OOM from 100 concurrent FFmpeg processes) or require over-provisioning for peak load.

Question 2

What is HLS adaptive bitrate streaming and why is it the standard format?

Accepted Answer

HLS (HTTP Live Streaming) splits a video into small segments (2-6 seconds each) encoded at multiple bitrates: 1080p/8Mbps, 720p/4Mbps, 480p/2Mbps, 360p/1Mbps. A master playlist (.m3u8) lists all available variants; each variant has its own playlist listing its segments. The player starts at a low bitrate, monitors download speed, and automatically switches to a higher bitrate when bandwidth allows. This eliminates buffering: if a user's connection drops from 10Mbps to 2Mbps, the player switches to the 480p variant seamlessly. HLS is supported natively by all modern browsers, iOS, Android, and smart TVs. Alternative: DASH (similar concept, different format) — HLS has broader native support.

Question 3

How do you generate multiple quality variants in parallel?

Accepted Answer

A single FFmpeg command can output multiple variants: ffmpeg -i input.mp4 -map 0:v -map 0:a -c:v libx264 -crf 23 -b:v 4000k output_720p.mp4 -c:v libx264 -crf 28 -b:v 1000k output_360p.mp4. This processes the input once while encoding multiple outputs. For very large files, split the job: one worker handles 1080p and 720p (CPU-intensive, GPU-accelerated if available), another handles 480p and 360p. Use FFmpeg with hardware acceleration (-hwaccel nvenc on NVIDIA GPUs) to reduce transcoding time by 5-10x. Distribute variant jobs to a worker pool; track completion per variant in the DB and update the master playlist only when all variants complete.

Question 4

How do you track transcoding progress and notify the client?

Accepted Answer

FFmpeg outputs progress to stderr: frame=1234 fps=24 bitrate=4000kb/s time=00:01:23. Parse this output in real-time by reading the subprocess stderr line by line. After each progress line, update the Job table: UPDATE Job SET progress=45 WHERE job_id=:id. For client notification: (1) Polling — client polls GET /jobs/{id} every 5 seconds and checks progress field. Simple, works for any client. (2) WebSocket — client subscribes to job updates; server pushes progress events as they arrive. Better UX for upload flows. (3) Webhook — on completion, POST to a pre-registered callback URL. Best for server-to-server integrations. Implement polling first; add WebSocket if UX requires real-time progress bars.

Question 5

How do you handle transcoding failures and partial outputs?

Accepted Answer

Mark the job as FAILED with an error_message from FFmpeg's exit code and stderr. Implement retry logic with a limit (max 3 attempts) and exponential backoff. Before retrying, check if the source file is still in S3 (it may have been garbage-collected). For partial outputs (transcoder crashed mid-job): detect by checking if all expected variant playlists exist in S3 and all segments are present. Clean up partial S3 objects before retrying to avoid serving corrupt HLS playlists. Add a dead-letter queue: after 3 failed attempts, move the job to a DLQ for manual inspection. Alert on DLQ depth > 0 — transcoding failures are often caused by corrupt source files that need human review.

Video Transcoding Pipeline Low-Level Design

Video Transcoding Pipeline — Low-Level Design

End-to-End Pipeline

Upload and Job Creation

Transcoding Worker (FFmpeg)

Master HLS Playlist

Adaptive Bitrate Playback

Key Interview Points