Q: How do you handle a cursor becoming invalid after the referenced row is deleted?

If the row encoded in the cursor is deleted, the keyset condition WHERE (created_at, id) < (cursor_ts, cursor_id) still works correctly — it returns the next rows after the deleted row's position without error. The deleted row is simply absent from results. This is a feature, not a bug: cursor pagination is inherently resilient to row deletions. Contrast with offset pagination: deleting a row shifts all subsequent rows one position, causing offset-based cursors to skip or duplicate a row. For soft-deletes (WHERE deleted_at IS NULL), the deleted row is excluded by the filter, and the cursor position is still valid — results naturally skip the soft-deleted row. Document this behavior explicitly in API docs so clients understand they may see gaps after deletions.

Q: How does Elasticsearch implement cursor pagination and what is search_after?

Elasticsearch's from/size pagination (equivalent to SQL OFFSET/LIMIT) degrades severely past from=10000 — Elasticsearch must fetch and sort from+size documents across all shards, then discard the first from. The hard limit is index.max_result_window=10000. search_after is the ES equivalent of keyset pagination: it uses the sort values of the last document as a cursor, fetching only the next page of results. Usage: sort: [{created_at: "desc"}, {_id: "desc"}], search_after: [cursor_timestamp, cursor_id]. This is O(log N) regardless of depth. Use the point-in-time (PIT) API with search_after for consistent pagination across index refreshes — the PIT snapshots the index state so concurrent indexing doesn't affect page results.

Q: How do you paginate a feed that requires real-time updates between page fetches?

The core challenge: user is on page 3 of a feed, new items are added, page 4 is now shifted. With cursor pagination, this is handled correctly — the cursor encodes the exact position, not a page number. New items inserted before the cursor's position don't affect what comes after it. New items inserted after the cursor's position are not returned in forward pagination (you're moving backward in time). To show new items: implement a separate "new items" endpoint that returns items inserted after the most recent item the user has seen (tracked by the first item's id from the first page fetch). The main feed is paginated forward; new items appear as a banner ("5 new posts — tap to refresh") and are loaded separately.

Question 1

How do you implement total page count in a paginated API without COUNT(*)?

Accepted Answer

COUNT(*) on a large table is a full index scan — on a table with 50 million rows, it takes 500ms-2s and adds that cost to every paginated request. Three alternatives: (1) Omit the total count. Most mobile feeds (Instagram, Twitter) show no total count — users infinite-scroll without knowing the total. (2) Approximate count from pg_class: SELECT reltuples::BIGINT FROM pg_class WHERE relname='post'. Updated by ANALYZE, accurate within ~1-5%. Fast — a single metadata lookup. (3) Maintain a counter table: a separate table that stores the pre-computed count, updated via triggers or application code on insert/delete. O(1) lookup, exact, but adds write overhead. Choose based on how critical exact counts are — most UIs work fine with approximate or no count.

Question 2

What is the difference between cursor pagination and keyset pagination?

Accepted Answer

These terms are often used interchangeably, but they describe different layers: keyset pagination is the SQL technique (WHERE (col1, col2) < (val1, val2)) that makes pagination efficient by using an index range scan instead of OFFSET. Cursor pagination is the API contract — the server encodes a position into an opaque token (cursor) and returns it to the client, who passes it back on the next request. Cursor pagination is typically implemented using keyset conditions under the hood. You can implement cursor pagination without keyset conditions (using OFFSET internally with a cursor that encodes the offset number), but this loses the performance benefit. The correct implementation: cursor pagination at the API level, keyset conditions in the SQL, composite index matching the ORDER BY.

Question 3

How do you handle a cursor becoming invalid after the referenced row is deleted?

Accepted Answer

If the row encoded in the cursor is deleted, the keyset condition WHERE (created_at, id) < (cursor_ts, cursor_id) still works correctly — it returns the next rows after the deleted row's position without error. The deleted row is simply absent from results. This is a feature, not a bug: cursor pagination is inherently resilient to row deletions. Contrast with offset pagination: deleting a row shifts all subsequent rows one position, causing offset-based cursors to skip or duplicate a row. For soft-deletes (WHERE deleted_at IS NULL), the deleted row is excluded by the filter, and the cursor position is still valid — results naturally skip the soft-deleted row. Document this behavior explicitly in API docs so clients understand they may see gaps after deletions.

Question 4

How does Elasticsearch implement cursor pagination and what is search_after?

Accepted Answer

Elasticsearch's from/size pagination (equivalent to SQL OFFSET/LIMIT) degrades severely past from=10000 — Elasticsearch must fetch and sort from+size documents across all shards, then discard the first from. The hard limit is index.max_result_window=10000. search_after is the ES equivalent of keyset pagination: it uses the sort values of the last document as a cursor, fetching only the next page of results. Usage: sort: [{created_at: "desc"}, {_id: "desc"}], search_after: [cursor_timestamp, cursor_id]. This is O(log N) regardless of depth. Use the point-in-time (PIT) API with search_after for consistent pagination across index refreshes — the PIT snapshots the index state so concurrent indexing doesn't affect page results.

Question 5

How do you paginate a feed that requires real-time updates between page fetches?

Accepted Answer

The core challenge: user is on page 3 of a feed, new items are added, page 4 is now shifted. With cursor pagination, this is handled correctly — the cursor encodes the exact position, not a page number. New items inserted before the cursor's position don't affect what comes after it. New items inserted after the cursor's position are not returned in forward pagination (you're moving backward in time). To show new items: implement a separate "new items" endpoint that returns items inserted after the most recent item the user has seen (tracked by the first item's id from the first page fetch). The main feed is paginated forward; new items appear as a banner ("5 new posts — tap to refresh") and are loaded separately.

API Pagination Low-Level Design: Cursor, Offset, and Keyset Strategies

Key Interview Points

Offset Pagination (and Why It Breaks)

Cursor Pagination (Standard for APIs)

Required Index for Cursor Pagination

Bi-directional Pagination

Filtering with Cursor Pagination

Key Interview Points