Question 1

How does token bucket rate limiting differ from fixed window rate limiting?

Accepted Answer

Fixed window: count requests in discrete time windows (e.g., 100 requests per minute). Reset the counter at the start of each window. Problem: a burst of 100 requests at 12:00:59 and another 100 at 12:01:01 pass the limit, but 200 requests arrive in 2 seconds — the boundary effect. Token bucket: a bucket accumulates tokens at a fixed rate (e.g., 100 tokens per minute = 1.67 tokens/second). Each request consumes one token. Tokens cap at the bucket capacity. Bursting: if no requests arrive for a while, tokens accumulate up to capacity, allowing a burst. The burst then drains the bucket — requests are rate-limited until tokens refill. Token bucket accurately models "N requests per time period" without boundary effects. It also allows short bursts (up to capacity) which is more user-friendly than fixed window's abrupt resets.

Question 2

Why is a Redis Lua script used for rate limiting instead of a regular Redis transaction?

Accepted Answer

A rate limit check-and-update involves multiple Redis commands: GET the current token count, compute the new count, SET the updated value. Between the GET and SET, another request from the same consumer could arrive and read the same (pre-decrement) token count, causing both requests to pass even if only one token remains. Redis MULTI/EXEC (optimistic transaction) retries if the key changes between WATCH and EXEC, but under high concurrency this can loop many times. A Lua script runs atomically on the Redis server: all commands in the script execute as a single unit — no other command can interleave. This is the correct solution for rate limiting. The Lua script reads, computes, and writes in one atomic operation, eliminating the race condition without client-side retry loops.

Question 3

How do you implement tiered pricing for API calls accurately at scale?

Accepted Answer

Tiered pricing example: first 1M calls/month free, next 9M at $0.001 each, above 10M at $0.0005 each. Accurate billing requires counting every call and applying tier breakpoints. At scale (billions of calls): counting in real time in a relational DB is too slow. Architecture: (1) API gateway increments a Redis counter (INCR usage:{consumer}:{api}:{month}) on each call — O(1), sub-millisecond. (2) Usage events are also sent to Kafka for durability (Redis could lose data on crash). (3) A monthly batch job reads the final counter from Kafka aggregations (ClickHouse or BigQuery) for billing accuracy — Redis for real-time display, event log for billing. (4) Apply tiered pricing: if total_calls <= 1M: charge $0. If 1M < calls <= 10M: charge (calls - 1M) * $0.001. If calls > 10M: charge 9M * $0.001 + (calls - 10M) * $0.0005.

Question 4

How do you handle API key rotation without downtime?

Accepted Answer

API key rotation flow: (1) Developer generates a new API key in the portal. (2) Old key and new key are both active simultaneously (grace period: 24-48 hours). (3) Developer updates their application to use the new key. (4) Developer revokes the old key. The system must support multiple active keys per consumer for the grace period. Database: consumer_api_keys table (consumer_id, key_hash, status, created_at, revoked_at). Allow multiple ACTIVE keys per consumer. On revocation: set status=REVOKED, update revoked_at. Redis cache: cache key lookups with a short TTL (60 seconds). On revocation: explicitly invalidate the Redis cache entry (DEL apikey:{hash}). This ensures revoked keys stop working within 60 seconds without waiting for cache expiry.

Question 5

What is an API gateway and how does it differ from a reverse proxy?

Accepted Answer

A reverse proxy (Nginx, HAProxy) routes HTTP requests to backend servers: load balancing, SSL termination, connection pooling. It is protocol-level — it does not understand API concepts. An API gateway sits on top of a reverse proxy layer and adds API-specific functionality: authentication (API key, OAuth JWT validation), rate limiting per consumer, request/response transformation (field filtering, protocol translation), API versioning (route /v1 and /v2 to different backends), usage metering, developer portal integration, circuit breaking, and API analytics. In practice: API gateways (Kong, AWS API Gateway, Apigee) often use Nginx or Envoy as the underlying HTTP layer, adding the API-specific logic on top. The key distinction: a reverse proxy routes; an API gateway enforces API policies.

System Design: API Marketplace — Developer Portal, Rate Limiting, and Billing (2025)

What is an API Marketplace?

API Key Authentication and Routing

Rate Limiting: Token Bucket with Redis

Metered Billing Pipeline

API Analytics and Developer Dashboard