Question 1

What is length-prefix framing and why is it the preferred approach for binary protocols?

Accepted Answer

Length-prefix framing prepends each message with its byte length (typically a 4-byte big-endian integer). The receiver reads the length field first, then reads exactly that many bytes for the payload. This is simple, efficient, and unambiguous — there's no delimiter scanning or escaping needed. It's used by gRPC HTTP/2 DATA frames, Kafka's message protocol, Redis RESP3, and most database wire protocols. Alternatives (delimiter-based, fixed-size) are more fragile or less flexible.

Question 2

How do Protocol Buffers handle schema evolution?

Accepted Answer

Protobuf assigns each field a unique tag number in the .proto schema. The wire format encodes messages as (tag, wire_type, value) triples; absent fields are omitted. Adding a new field with a new tag number is backward and forward compatible: old receivers ignore unknown tags; new receivers provide defaults for missing tags. Safe changes: add new fields, mark old fields reserved. Unsafe: reuse a deleted field's tag number (causes type mismatches), rename fields (tags matter, not names).

Question 3

What is a variable-length integer (varint) and when should you use one?

Accepted Answer

A varint (variable-length integer) encodes small numbers in fewer bytes: values 0-127 in 1 byte, 128-16383 in 2 bytes, up to 64-bit integers in 10 bytes. This is efficient when most values are small (counters, IDs, lengths). Protobuf and gRPC use LEB128 varints extensively. When values are uniformly distributed across the full int64 range, varints are inefficient (always 10 bytes). Use fixed-width integers for random IDs (UUID, snowflake) and varints for counts and lengths.

Question 4

How does multiplexing work in a binary protocol?

Accepted Answer

Multiplexing sends multiple in-flight requests on a single connection by including a correlation ID (request_id) in each message header. When a response arrives, the receiver looks up the request_id in a pending-requests map and resolves the corresponding future/callback. gRPC uses HTTP/2 stream IDs (31-bit integer per request). Kafka uses correlation_id (int32) in its request/response protocol. Without multiplexing, each request blocks the connection until the response arrives (head-of-line blocking).

Low Level Design: Binary Protocol Design

Framing

Type Encoding

Protocol Versioning

Protocol Buffers (Protobuf)

Error Detection

Request-Response Correlation

Compression