Question 1

What is the core difference between Operational Transformation and CRDTs for collaborative editing?

Accepted Answer

Operational Transformation (OT): operations are transformed against concurrent operations before application. A central server serializes all operations and performs the transformation — ensuring all clients converge to the same state. Requires a reliable central server; all clients must be online to receive and apply transforms. Used by: Google Docs (wave-OT), Apache Wave. CRDTs (Conflict-free Replicated Data Types): data structures designed so that concurrent operations automatically merge without conflicts, without requiring central coordination. Each character has a globally unique ID; "insert after character X" is unambiguous even under concurrent edits. Merging is associative, commutative, and idempotent. Can work peer-to-peer. Used by: Figma, Linear, Notion (Yjs/Automerge). Modern systems prefer CRDTs for simpler server-side logic and better offline support.

Question 2

How does a CRDT handle two users simultaneously typing at the same position?

Accepted Answer

In a CRDT text representation (e.g., LSEQ or RGA algorithm used by Yjs), each character is assigned a globally unique position identifier that encodes its insertion position in a way that is stable under concurrent edits. When Alice inserts "A" and Bob inserts "B" at the same position simultaneously, both characters get unique IDs. The ordering between them is determined by a deterministic tiebreaker: compare author IDs, timestamps (using Hybrid Logical Clocks), or random tie-breaker values. All replicas apply this same ordering rule and converge to the same final document: "AB" or "BA" — consistent across all peers. Neither insertion is lost (no data loss), and the result is deterministic (same across all replicas).

Question 3

Why must delete operations use tombstoning in collaborative editing?

Accepted Answer

If Alice deletes character at position 5 and Bob simultaneously inserts at position 5 (based on a version that still had that character), Bob's insert references a character that Alice deleted. If the character is physically removed from the array, Bob's reference is dangling — the insert position is ambiguous. Tombstoning marks the deleted character with a deleted=True flag without removing it from the position sequence. Bob's insert still refers to a valid (though tombstoned) character and is placed correctly relative to it. The deleted character is invisible in the rendered document but remains in the CRDT structure as a position anchor. Periodically, offline garbage collection can remove tombstones that are no longer referenced by any live character's insertion point.

Question 4

How do you persist collaborative document state efficiently?

Accepted Answer

Two strategies: (1) Operation log: store every insert and delete as a row in DocumentOperation. The current document state is derived by replaying the log. Simple, provides full history and undo. Problem: replay of 1M operations on load is too slow. (2) Snapshots with delta log: store a full state snapshot (the Yjs document binary or full text) every N operations, and replay only the delta since the last snapshot. Load = fetch snapshot + replay delta. With N=100, at most 100 operations to replay on load. Combine both: keep the full operation log for history and conflict resolution, use snapshots for fast loading. For Yjs: Y.encode_state_as_update(ydoc) produces a binary snapshot of the full document state that can be stored in Postgres or S3.

Question 5

How do you show collaborators' cursors and selections in real time?

Accepted Answer

Cursor/presence data is ephemeral and eventually consistent — it does not need the durability guarantees of document operations. Use a separate Redis Pub/Sub channel per document (doc:room:{doc_id}) for presence updates. When a user moves their cursor: publish {"type": "cursor", "user_id": X, "position": Y, "color": "#ff0000"} to the room channel. All connected clients receive the message and render the remote cursor. Frequency: publish on every keypress or mouse move (throttled to ~50ms intervals to avoid flooding). When a user disconnects: publish {"type": "leave", "user_id": X}. For reconnecting users: store the last-known cursor position in a Redis hash (doc:cursors:{doc_id}) so a newly joined collaborator can see existing cursors without waiting for the next update.

Document Collaboration Low-Level Design: OT, CRDTs, and Real-Time Sync

Core Data Model

Operational Transformation (OT)

CRDT Approach (Yjs / Automerge)

Cursor and Presence Tracking

Key Interview Points