Question 1

What is the difference between HTTP/1.1, HTTP/2, and HTTP/3?

Accepted Answer

HTTP/1.1: text-based protocol, one request per TCP connection at a time (head-of-line blocking). Browsers open 6 parallel connections per domain to work around this. Keep-alive reuses connections for sequential requests. HTTP/2: binary framing layer over TCP. Multiplexing allows multiple concurrent requests on a single TCP connection as interleaved frames. Header compression (HPACK) reduces repetitive header overhead. Server push allows preemptive resource sending (deprecated in practice). However, TCP head-of-line blocking remains: if one TCP packet is lost, all streams on that connection are blocked until the retransmission. HTTP/3: replaces TCP with QUIC (UDP-based transport with built-in TLS 1.3). Each QUIC stream is independent -- a lost packet on one stream does not block others (eliminates TCP head-of-line blocking). Connection establishment is faster: 0-RTT for returning connections (sends encrypted data immediately). Connection migration: QUIC connections survive IP address changes (WiFi to cellular) because they are identified by a connection ID, not source IP/port. Performance improvement over HTTP/2 is most significant on high-latency or lossy networks (mobile, cross-continent). All major CDNs (Cloudflare, Google, AWS CloudFront) support HTTP/3.

Question 2

When should you use WebSocket versus Server-Sent Events versus HTTP polling?

Accepted Answer

HTTP polling: the client sends a request every N seconds asking if there is new data. Simple to implement but wastes bandwidth (most responses are empty), adds latency (up to N seconds), and creates unnecessary server load. Use only when the other options are not available (legacy systems). Long polling: the client sends a request and the server holds it open until data is available (or timeout). Better than polling for latency but still creates a new connection for each message. Server-Sent Events (SSE): the server pushes data to the client over a standard HTTP connection. Unidirectional (server to client only). Auto-reconnects on connection loss. Works with HTTP/2 multiplexing (multiple SSE streams on one connection). Simpler than WebSocket. Use for: live notifications, stock tickers, live feeds, server logs -- any case where only the server sends data. WebSocket: full-duplex, bidirectional communication after an HTTP upgrade handshake. Both client and server can send messages at any time. Use for: real-time chat, collaborative editing, multiplayer games, interactive applications -- any case where both sides send data frequently. Rule of thumb: if only the server pushes data, use SSE. If both sides need to send data, use WebSocket. If you need maximum compatibility with proxies and firewalls, use long polling as a fallback.

Question 3

How does the TLS 1.3 handshake work and why is it faster than TLS 1.2?

Accepted Answer

TLS 1.2 handshake requires 2 round trips before encrypted data can flow: Round trip 1: ClientHello (client sends supported cipher suites) -> ServerHello (server chooses cipher, sends certificate). Round trip 2: Key exchange (client and server establish a shared secret) -> Finished messages (both sides confirm). Total: 2 RTTs before application data. TLS 1.3 handshake requires only 1 round trip: the client sends ClientHello with key shares (Diffie-Hellman parameters for all supported groups) in the first message. The server responds with ServerHello, its key share, the encrypted certificate, and the Finished message -- all in one response. After 1 RTT, both sides have the shared secret and can send encrypted application data. Additionally, TLS 1.3 supports 0-RTT resumption: returning clients can send encrypted application data in the very first message using a pre-shared key from a previous session. The server processes it immediately -- zero additional latency. Trade-off: 0-RTT data is vulnerable to replay attacks (an attacker can record and resend the first message). Only use 0-RTT for idempotent GET requests, never for POST or state-changing operations.

Question 4

Why is connection pooling important for microservices performance?

Accepted Answer

Establishing a new TCP connection requires a 3-way handshake (1 RTT). With TLS, add another 1-2 RTTs. For a service 10ms away, each new connection adds 20-30ms of latency before any application data flows. In microservices, a single user request may trigger 5-10 internal service calls. Without connection pooling, each call incurs the connection setup overhead: 100-300ms wasted on handshakes alone. Connection pooling maintains a pool of established, ready-to-use connections. When a service needs to call another service, it borrows a connection from the pool. After the request completes, the connection is returned (not closed). The handshake cost is paid once; subsequent requests reuse the warm connection with zero setup overhead. Database connection pooling (PgBouncer, HikariCP) is critical because database connections are expensive to establish and databases have connection limits (PostgreSQL default: 100). A pool of 20 connections can serve thousands of requests per second. HTTP connection pooling is built into most HTTP clients (Go http.Client, Java HttpClient, Python requests.Session). Pool sizing: too small causes request queuing (threads wait for a free connection). Too large wastes resources and may exceed upstream limits. Start with pool_size = 2 * CPU_cores and adjust based on observed p99 latency.

System Design: Network Protocols — TCP, UDP, HTTP/2, HTTP/3, WebSocket, DNS, TLS Handshake, Connection Pooling

TCP: Reliable, Ordered Delivery

HTTP/1.1, HTTP/2, and HTTP/3

WebSocket: Bidirectional Real-Time Communication

TLS Handshake and Performance

DNS Resolution and Optimization

Connection Pooling