Question 1

Why snapshot metadata at bookmark time instead of fetching it live?

Accepted Answer

At display time, fetching metadata (title, thumbnail) for 50 bookmarked URLs would require 50 HTTP requests — slow and unreliable. Worse: URLs go dead (404), get paywalled, or change their title. If metadata is stored at bookmark time, the list always renders instantly from the database — no external requests needed. The snapshot also means bookmarks remain useful even after the original content is deleted or moved. Trade-off: the stored title may be slightly outdated (article renamed). Mitigate with a background refresh job that re-fetches metadata for bookmarks viewed in the last 30 days (weekly refresh). Mark dead links (HTTP 404/410) with a broken_link=TRUE flag and show a warning icon in the UI.

Question 2

How does the change log enable reliable multi-device sync?

Accepted Answer

Mobile apps go offline for hours or days. On reconnect, the app sends its last seen change_id. The server returns all changes since that ID: SELECT * FROM BookmarkChangeLog WHERE user_id=X AND change_id > last_seen ORDER BY change_id ASC LIMIT 1000. The client applies each change in sequence — add inserts a bookmark, remove deletes one, move changes folder_id. The change_id is a BIGSERIAL (auto-increment) — monotonically increasing per database insert, providing a reliable ordering. This is cheaper than comparing full bookmark lists (O(N) for N bookmarks) — the delta sync is O(changes since last sync). Prune the log after 30 days; clients offline longer than 30 days fall back to a full sync (download all bookmark data).

Question 3

How do you implement nested folders without deep recursion?

Accepted Answer

BookmarkFolder.parent_id forms a tree. Querying "all bookmarks in folder X and its subfolders" naively requires recursive CTE or application-level traversal. Limit depth to 3–4 levels (cap in create_folder validation: traverse parent_id chain to root, reject if depth would exceed 4). With this constraint, recursive CTE is fast: WITH RECURSIVE sub AS (SELECT folder_id FROM BookmarkFolder WHERE folder_id = $1 UNION ALL SELECT f.folder_id FROM BookmarkFolder f JOIN sub ON f.parent_id = sub.folder_id) SELECT bookmark_id FROM Bookmark WHERE folder_id IN (SELECT folder_id FROM sub). For deeper hierarchies (rarely needed): use materialized path as in org hierarchy design.

Question 4

How do you efficiently implement bookmark full-text search?

Accepted Answer

PostgreSQL full-text search over bookmarks: CREATE INDEX ON Bookmark USING GIN(to_tsvector('english', coalesce(title,'') || ' ' || coalesce(url,''))). Query: SELECT * FROM Bookmark WHERE user_id=X AND to_tsvector('english', coalesce(title,'') || ' ' || coalesce(url,'')) @@ plainto_tsquery('english', $query). The GIN index makes this fast for individual users' bookmarks (each user has at most 50K bookmarks). For richer search (inside saved article text): store extracted body text at bookmark time (truncated to 5,000 characters) and include it in the tsvector. Rank results: SELECT *, ts_rank(to_tsvector(...), query) AS rank ORDER BY rank DESC. PostgreSQL built-in FTS is sufficient at this scale — no Elasticsearch needed.

Question 5

How do you handle the 50,000 bookmark limit for power users?

Accepted Answer

The limit exists for performance (unlimited bookmarks would make pagination and sync expensive) and storage. Enforcement: before inserting, SELECT COUNT(*) FROM Bookmark WHERE user_id=X and compare with MAX_BOOKMARKS_PER_USER. If at limit: offer to upgrade to a "Power User" tier with higher limits, or suggest archiving old bookmarks (DELETE oldest 1,000 + export to CSV). Soft limit approach: allow users to reach 60,000 but show a warning at 50,000 ("You're approaching your bookmark limit"). Hard stop at 60,000. For the COUNT query to be fast: maintain a bookmark_count column in UserStats (increment/decrement on add/remove) — avoids a full COUNT(*) on each add operation. The UNIQUE constraint on (user_id, content_type, content_id) also implicitly bounds count by preventing duplicates.

Bookmark System Low-Level Design: Folders, Sync, and Full-Text Search

Core Data Model

Bookmark CRUD

Sync Across Devices (Last-Write-Wins with Change Log)

Key Interview Points