Question 1

What is Operational Transformation (OT) and how does it solve concurrent edits?

Accepted Answer

OT resolves conflicts between concurrent edits by transforming operations before applying them. When two users edit simultaneously, each submits an operation with the document revision they edited against. The server receives Op-A at revision 5 and Op-B also at revision 5 (concurrent). To apply Op-B: transform it against Op-A (which was applied first). The transform adjusts Op-B's position to account for Op-A's insertion or deletion. The result: both operations are applied and the document converges to the same state on all clients. Google Docs uses OT with a central server as the serialization point.

Question 2

What is the difference between OT and CRDT for collaborative editing?

Accepted Answer

OT requires a central server to serialize and transform operations - it cannot work peer-to-peer without coordination. CRDTs (Conflict-free Replicated Data Types) are designed to converge to the same state regardless of operation order, without transformation. For text: CRDT assigns each character a globally unique ID. Operations reference character IDs (not positions), so they remain valid regardless of what other edits happened. CRDT supports peer-to-peer collaboration and offline editing with sync on reconnect. Trade-off: CRDT accumulates tombstoned (deleted) characters and is more complex to implement. Notion and Figma use CRDTs; Google Docs uses OT.

Question 3

How do you handle offline edits in a collaborative document system?

Accepted Answer

Clients buffer operations locally while offline (store in IndexedDB or similar). Each operation is tagged with the client's current revision number at time of creation. On reconnect, the client submits all buffered operations to the server in order. The server transforms each against all operations that happened during the offline period (from the client's last-known revision to current). After transformation, operations are applied to the authoritative document. The client receives all server operations it missed and applies them locally. Final state converges. CRDT-based systems handle this more naturally since operations are order-independent.

Question 4

How do you implement cursor presence in a collaborative editor?

Accepted Answer

Each client broadcasts cursor position and selection on every keystroke via WebSocket: {user_id, position, color, username}. The server stores current cursor positions in Redis as a hash: HSET doc:{doc_id}:cursors {user_id} {position_json} with a 30-second TTL per field (refreshed on each cursor update). On each cursor update, the server fans out to all other connected clients. Clients render remote cursors as colored carets with user labels. Cursor positions must be transformed against incoming operations: if a user deletes text before your cursor, shift your cursor left accordingly.

Question 5

How do you persist collaborative document history efficiently?

Accepted Answer

Store every operation in an append-only operations table: (doc_id, revision, op_json, user_id, created_at). The current document state can be reconstructed by replaying all operations from revision 0. To bound replay time, take periodic snapshots: every 1000 revisions, store the full document text as a snapshot with its revision number. Cold start: load the latest snapshot, then replay only the operations since that snapshot. For undo/redo: operations are already in the history - undo creates a reverse operation. For version history display: replay to any target revision.

System Design: Collaborative Document Editing — Operational Transformation and CRDT (2025)

Requirements and Core Challenge

Operational Transformation (OT)

CRDT (Conflict-free Replicated Data Type)

Architecture: Server-Side OT with WebSocket

Presence, Cursors, and Persistence