Question 1

How does a chat application deliver messages in real-time?

Accepted Answer

Real-time delivery uses persistent WebSocket connections. When a user opens the app, it establishes a WebSocket to the nearest connection server. The server maintains a mapping of user_id to connection_server_id in Redis. When User A sends a message to User B: the message goes to the message service via A WebSocket, the service stores the message, looks up B connection server in Redis, routes the message to that server, which delivers it via B WebSocket. If B is offline (no WebSocket), the message is stored in a pending queue and a push notification is sent. When B opens the app, it reconnects via WebSocket and pulls all pending messages. Delivery acknowledgment: B client sends an ACK after receiving the message. If no ACK within 5 seconds, the server retries. The client deduplicates by message_id.

Question 2

How does group chat message fanout work at scale?

Accepted Answer

For small groups (up to 256 members like WhatsApp): the sender sends the message once to the server. The server fans out by sending to each group member individually -- checking online status for each: online members get immediate WebSocket delivery, offline members get push notifications and stored messages. For a 100-member group, this creates 99 individual deliveries. For large groups or channels (thousands to millions of members like Telegram): server-side fanout is too expensive. Use a pull model: store the message once in the channel message log. When members open the channel, they pull new messages since their last read offset. A push notification alerts them to check. Hybrid approach: push to the most recently active members, pull for the rest. This balances real-time delivery for engaged users with efficiency for the long tail.

Question 3

How does end-to-end encryption work in a chat application?

Accepted Answer

End-to-end encryption (E2EE) ensures only the sender and recipient can read messages -- the server handles only ciphertext. WhatsApp uses the Signal Protocol. Setup: each user generates a public-private key pair. Public keys are uploaded to the server. To message User B: User A downloads B public key, performs Diffie-Hellman key exchange to derive a shared secret, and encrypts the message with this secret. The server relays the ciphertext. B decrypts with their private key. Group E2EE: the group creator generates a symmetric group key and distributes it to each member encrypted with their individual public key. Messages are encrypted with the group key. When a member is removed, a new group key is generated. Tradeoffs: E2EE prevents server-side search (cannot index encrypted content), content moderation (cannot scan messages), and server-side backup (backups must be client-encrypted).

Question 4

How do you scale WebSocket connections for 500 million concurrent users?

Accepted Answer

Each connection server handles 50K-500K concurrent WebSocket connections (limited by memory -- each connection uses 10-50KB). For 500M concurrent users: 1,000-10,000 connection servers. Architecture: (1) Connection routing -- a Redis hash maps user_id to connection_server_id. When a user connects, register the mapping. On disconnect, remove it. (2) Load balancing -- use a Layer 4 load balancer (TCP-level) to distribute WebSocket connections across servers. HTTP-level (L7) load balancers also work but add overhead. (3) Geographic distribution -- deploy connection servers in multiple regions. Route users to the nearest region via GeoDNS. (4) Heartbeats -- clients ping every 30 seconds. Connections without a heartbeat for 60 seconds are cleaned up. (5) Graceful shutdown -- when deploying a new version of the connection server, drain existing connections over 30 seconds before shutting down. Clients reconnect to a new server. (6) Fallback -- if WebSocket fails (firewalls, proxies), fall back to long polling.

System Design: Chat Application (WhatsApp/Messenger) — Real-Time Messaging, WebSocket, Delivery, Group Chat, E2E Encryption

High-Level Architecture

WebSocket Connection Management

Message Storage and Delivery Guarantees

Group Chat Architecture

End-to-End Encryption

Presence and Typing Indicators