Q: What is the "new enemy" problem in authorization systems and how does Zanzibar solve it?

The new enemy problem: a user grants access to a resource, then immediately revokes it. Due to eventual consistency in distributed authorization stores, the revocation may not be visible to all nodes yet. A read that arrives microseconds after the revoke may still see the old (access-granted) state — the user who was just de-authorized is the "new enemy" who still gets through. Zanzibar solves this with Zookies (Zanzibar cookies): each write returns a token encoding the write's timestamp (a Spanner read timestamp). Subsequent authorization checks include this token and are processed at a timestamp >= the token's timestamp, guaranteeing they see the revocation. This is an external consistency guarantee built on Spanner's TrueTime.

Question 1

Why must DSPs respond to bid requests in under 100ms?

Accepted Answer

The RTB auction must complete before the web page renders the ad slot — typically within 100-150ms of the page load. The SSP deducts its own processing time (10-20ms), leaving 80-100ms for DSPs to receive the bid request, evaluate it, and return a response. The SSP sets a timeout and ignores any response after the deadline. DSPs that consistently time out are penalized with reduced bid request volume by the SSP. This forces DSPs to optimize every component: feature retrieval from Redis must be < 5ms, model inference < 10ms, campaign matching must use precomputed indexes rather than database queries, and the bid response must be serialized and sent in the remaining budget.

Question 2

What is the "new enemy" problem in authorization systems and how does Zanzibar solve it?

Accepted Answer

The new enemy problem: a user grants access to a resource, then immediately revokes it. Due to eventual consistency in distributed authorization stores, the revocation may not be visible to all nodes yet. A read that arrives microseconds after the revoke may still see the old (access-granted) state — the user who was just de-authorized is the "new enemy" who still gets through. Zanzibar solves this with Zookies (Zanzibar cookies): each write returns a token encoding the write's timestamp (a Spanner read timestamp). Subsequent authorization checks include this token and are processed at a timestamp >= the token's timestamp, guaranteeing they see the revocation. This is an external consistency guarantee built on Spanner's TrueTime.

Question 3

How does smooth budget pacing work in RTB to avoid early budget exhaustion?

Accepted Answer

Smooth pacing distributes ad spend evenly across the day, respecting that traffic volume is not uniform (peaks in morning and evening). The ideal spend curve is precomputed from historical traffic data: for each hour, what fraction of daily impressions typically occur? At any point in the day, ideal_spend_so_far = daily_budget * cumulative_traffic_fraction_so_far. A throttle ratio = actual_spend / ideal_spend controls the bid rate: ratio < 0.8: increase bid rate (underpacing, risk of not spending the budget). Ratio > 1.0: reduce bid rate (overpacing, risk of exhausting budget early). Implementation: a Redis counter tracks actual spend; a background job computes the throttle ratio every second and updates a Redis key that the bidding engine reads before each bid decision.

Question 4

What is the difference between RBAC and ABAC, and when should you use each?

Accepted Answer

RBAC (Role-Based): permissions are attached to roles, users are assigned roles. Simple, auditable, works well when access patterns are defined by job function (e.g., admin, editor, viewer). Limitation: cannot express policies like "users can only edit their own documents" or "access only from corporate IP range." ABAC (Attribute-Based): policies evaluate attributes of the subject, resource, action, and environment. More expressive — can express row-level security, time-of-day restrictions, geofencing, and any combination. Limitation: harder to reason about (what can a user do?), harder to audit, higher implementation complexity. Use RBAC for coarse-grained access (role-level, resource-type-level). Use ABAC (or ReBAC) when you need fine-grained, context-aware policies that RBAC cannot express.

Question 5

How does frequency capping at scale use probabilistic data structures?

Accepted Answer

Exact frequency capping requires a Redis sorted set per (user, campaign) to track impression timestamps — O(1) per check but O(users * campaigns) memory. At billion-user scale this is prohibitive. Probabilistic alternative: Count-Min Sketch (CMS) per campaign. A CMS is a 2D array of counters with multiple hash functions. On each impression: increment CMS[h_i(user_id)] for each hash function i. Frequency estimate: min(CMS[h_i(user_id)]) across all hash functions. CMS overestimates (never underestimates) — it may over-cap a user (show them fewer ads than the cap), but never under-cap (never shows more than the cap). Space: O(campaigns) instead of O(users * campaigns). Bloom filter variant: just check "has user seen >= N impressions?" — binary answer, very compact.

System Design: Real-Time Bidding (RTB) Platform — Ad Auction in 100ms (2025)

RTB Architecture Overview

Bid Request and Response

Budget Pacing with Redis

Win/Loss Notification and Attribution

Frequency Capping