Question 1

How does cursor-based pagination work and why is it better than offset pagination?

Accepted Answer

Offset pagination uses OFFSET and LIMIT in SQL: SELECT * FROM orders ORDER BY id LIMIT 10 OFFSET 1000. The database must scan and discard 1000 rows before returning 10 -- O(offset + limit) per query. At page 1000, the database scans 10,000 rows to return 10. Additionally, if new rows are inserted while paginating, rows shift and the client may see duplicates or miss rows. Cursor-based pagination uses a pointer to the last seen item: SELECT * FROM orders WHERE id > last_seen_id ORDER BY id LIMIT 10. This uses an index seek -- O(limit) regardless of how deep into the result set you are. The cursor (last_seen_id) is encoded as an opaque string (base64 encoded) and returned in the response as next_cursor. The client passes it back as the after parameter. Stability: new inserts do not affect pagination because the cursor is positional, not offset-based. Limitation: cursor pagination does not support jumping to an arbitrary page (page 50). If random page access is required, consider a hybrid approach: use cursors for sequential navigation and a search endpoint for jumping to specific ranges.

Question 2

How do you implement idempotency keys for safe payment API retries?

Accepted Answer

Idempotency key implementation: the client generates a UUID and includes it in the request header (Idempotency-Key: uuid). Server flow: (1) Check if the idempotency key exists in the store (Redis or PostgreSQL). (2) If not found: create a record with status IN_PROGRESS and the request hash. Process the request. Update the record with the response and status COMPLETED. Return the response. (3) If found with status COMPLETED: return the stored response without re-processing. (4) If found with status IN_PROGRESS: return 409 Conflict (another request with the same key is being processed). (5) If found but the request body differs from the stored request hash: return 422 Unprocessable Entity (reusing an idempotency key with different parameters is an error). Storage: use Redis with a 24-hour TTL for the idempotency records. For financial operations, also store in PostgreSQL for durability. The request hash ensures the idempotency key is bound to a specific request -- the client cannot accidentally reuse a key for a different operation. Stripe implements this pattern and requires idempotency keys for all mutating API calls.

Question 3

When should you use GraphQL instead of REST and what are the pitfalls?

Accepted Answer

Use GraphQL when: (1) Multiple clients need different data shapes -- a mobile app needs a subset of fields, a web app needs more, and an admin dashboard needs everything. With REST, you either over-fetch (return all fields) or maintain multiple endpoints. GraphQL lets each client request exactly the fields it needs. (2) The frontend team iterates faster than the backend team -- GraphQL allows frontend developers to change their data requirements without backend API changes. (3) You have deeply nested relationships -- a user has orders, each order has items, each item has a product with reviews. REST requires multiple requests or complex include parameters. GraphQL fetches the entire graph in one request. Pitfalls: (1) N+1 query problem -- a naive GraphQL resolver fetches each related entity individually. Solution: use DataLoader to batch and cache database queries within a single request. (2) Query complexity attacks -- a malicious client can send a deeply nested query that consumes excessive server resources. Solution: implement query depth limiting and query cost analysis. (3) Caching is harder -- REST GET requests are cached by HTTP caches (CDN, browser). GraphQL uses POST requests which are not cached by default. Solution: use persisted queries (pre-registered query strings) with GET requests, or application-level caching.

Question 4

How should you design API rate limiting for different tiers of users?

Accepted Answer

Tiered rate limiting provides different limits based on the API consumer plan or authentication status. Implementation: (1) Identify the consumer -- extract the API key or OAuth token from the request. Look up the associated plan (free, starter, enterprise). Unauthenticated requests are rate-limited by IP address. (2) Apply per-tier limits -- free: 60 requests per hour, starter: 1000 per hour, enterprise: 10000 per hour. Store limits in a configuration service so they can be adjusted without deployment. (3) Use the token bucket algorithm in Redis per consumer: each consumer has a bucket key. MULTI: GET the bucket, compute tokens remaining, DECR if tokens available, EXEC. Or use a Lua script for atomicity. (4) Return rate limit headers in every response: X-RateLimit-Limit (the limit for this tier), X-RateLimit-Remaining (requests left in the current window), X-RateLimit-Reset (Unix timestamp when the window resets), Retry-After (seconds to wait, included with 429 responses). (5) Differentiate by endpoint -- write endpoints (POST, PUT, DELETE) may have stricter limits than read endpoints (GET). A search endpoint may have a separate, lower limit due to its computational cost. Document all limits clearly in your API reference.

System Design: API Design Patterns — REST, GraphQL, gRPC, Versioning, Pagination, Rate Limiting, Idempotency

REST API Design Principles

API Versioning Strategies

Idempotency for Safe Retries

Rate Limiting Design

GraphQL vs REST vs gRPC

Error Handling and Response Design