Q: How do you handle deleting a comment that has replies?

Never fully remove a comment that has child replies — this would leave orphaned replies with no parent in the tree (breaking thread structure). Instead: soft-delete by setting body="[deleted]", status=deleted, and author_id=NULL. The comment node remains in the tree as a placeholder, preserving the path for its children. Only fully hide a comment (and remove its path entry) if it has zero replies. Track reply_count on each comment as a denormalized count to make this check O(1) without a COUNT(*) query.

Q: How do you prevent excessive nesting in threaded comments?

Cap nesting depth at a maximum (3-5 levels is typical for UX). When a user replies to a comment at the maximum depth, attach the new comment as a sibling of the deepest comment rather than a child. Example: max depth=3, user tries to reply to a depth-3 comment — set the new comment's parent_id to the depth-3 comment's parent (depth-2 comment) instead. Store the depth on each comment and check it at creation time. Display UI: "Replying to @username" to indicate context even though the nesting was flattened.

Q: How do you denormalize comment counts on a post?

Maintain a comment_count column on the Post table, updated with UPDATE Post SET comment_count=comment_count+1 WHERE id=%(id)s every time a comment is inserted. For deletions: decrement by 1 if the comment was visible (don't decrement for already-deleted comments). Accept occasional drift — if a transaction fails after inserting the comment but before updating the count, the count will be off by one. Run a reconciliation job nightly: UPDATE Post SET comment_count=(SELECT COUNT(*) FROM Comment WHERE content_id=Post.id AND status='visible').

Question 1

Why use a materialized path over a recursive CTE for threaded comments?

Accepted Answer

A recursive CTE (WITH RECURSIVE) to fetch a thread must traverse the adjacency list level by level — N rounds trips to the database for a thread N levels deep, or a single recursive query that PostgreSQL executes internally with a worktable scan. A materialized path (each comment stores its full ancestry path: '/1/5/23/') allows fetching the entire subtree with a single non-recursive query: WHERE path LIKE '/1/%'. This is a B-tree prefix scan — fast and O(subtree size). Trade-off: updating paths when a comment is moved is expensive, but comments are rarely moved.

Question 2

How do you implement cursor pagination for comments sorted by top votes?

Accepted Answer

Sort by (upvote_count DESC, created_at DESC) — two columns because many comments have the same vote count. The cursor encodes both values as a tuple. Next page query: WHERE (upvote_count, created_at) < (cursor_votes, cursor_ts). This keyset scan is O(log N) regardless of page depth. Without the two-column cursor, comments with equal vote counts produce non-deterministic ordering, causing items to appear on multiple pages or be skipped entirely. Encode the cursor as base64(JSON) in the API response so clients treat it opaquely.

Question 3

How do you handle deleting a comment that has replies?

Accepted Answer

Never fully remove a comment that has child replies — this would leave orphaned replies with no parent in the tree (breaking thread structure). Instead: soft-delete by setting body="[deleted]", status=deleted, and author_id=NULL. The comment node remains in the tree as a placeholder, preserving the path for its children. Only fully hide a comment (and remove its path entry) if it has zero replies. Track reply_count on each comment as a denormalized count to make this check O(1) without a COUNT(*) query.

Question 4

How do you prevent excessive nesting in threaded comments?

Accepted Answer

Cap nesting depth at a maximum (3-5 levels is typical for UX). When a user replies to a comment at the maximum depth, attach the new comment as a sibling of the deepest comment rather than a child. Example: max depth=3, user tries to reply to a depth-3 comment — set the new comment's parent_id to the depth-3 comment's parent (depth-2 comment) instead. Store the depth on each comment and check it at creation time. Display UI: "Replying to @username" to indicate context even though the nesting was flattened.

Question 5

How do you denormalize comment counts on a post?

Accepted Answer

Maintain a comment_count column on the Post table, updated with UPDATE Post SET comment_count=comment_count+1 WHERE id=%(id)s every time a comment is inserted. For deletions: decrement by 1 if the comment was visible (don't decrement for already-deleted comments). Accept occasional drift — if a transaction fails after inserting the comment but before updating the count, the count will be off by one. Run a reconciliation job nightly: UPDATE Post SET comment_count=(SELECT COUNT(*) FROM Comment WHERE content_id=Post.id AND status='visible').

Comments System Low-Level Design

Comments System — Low-Level Design

Core Data Model

Nested Comments: Materialized Path

Fetching a Thread

Upvoting

Soft Delete

Key Interview Points