Question 1

How does Google Maps find the fastest route on a graph with billions of edges?

Accepted Answer

Basic Dijkstra is too slow for real-time queries on a continent-scale road graph. Google Maps uses Contraction Hierarchies (CH): during preprocessing, the algorithm repeatedly contracts the least important nodes by adding shortcut edges between their neighbors. The augmented graph enables bidirectional Dijkstra that expands very few nodes -- sub-second queries on billions of edges. For traffic-aware routing: edge weights change with real-time traffic from millions of GPS traces. A hybrid approach uses CH for the highway backbone (less affected by traffic) and real-time Dijkstra for last-mile urban routing. Alternative routes are computed by penalizing edges in the optimal route and re-running the search, producing 2-3 meaningfully different options.

Question 2

How does Google Maps render the visual map efficiently?

Accepted Answer

Modern maps use vector tiles: instead of pre-rendered images, the server sends raw geographic data (road geometries, building outlines, labels) as Protocol Buffer-encoded vector tiles. The client renders on the GPU. Benefits: 10x smaller than raster tiles, smooth zooming without pixelation, dynamic styling (day/night mode without re-downloading), and rotation/tilt support. Tiles are keyed by (zoom_level, x, y). At zoom level 18 (street level): approximately 69 billion possible tiles. Most are never requested (oceans, empty areas). Tiles are stored in object storage, served via CDN with long cache TTLs, and only rendered on-demand for unpopular areas. Popular urban areas are pre-rendered. Google Maps, Mapbox, and Apple Maps all use vector tiles.

Question 3

How does Google Maps predict ETA accurately?

Accepted Answer

Google uses an ML model (DeepMind collaboration) combining: real-time GPS traces from millions of Android phones (average speed per road segment, updated every few minutes), historical traffic patterns (Monday rush hour is predictable), live incident data (accidents, construction from user reports), and weather data. A graph neural network operates on the road network: node features include road type, speed limit, lanes; edge features include current speed and historical patterns. The model predicts travel time per segment and sums for the full route. It captures complex interactions like traffic spillover to alternate routes. Training data: billions of historical trips with actual travel times. Display: ranges during planning (25-35 min), single estimate during navigation updated every minute.

Question 4

What geospatial indexing does Google Maps use?

Accepted Answer

Google Maps uses S2 Geometry internally. S2 divides the Earth into hierarchical cells using a space-filling curve (Hilbert curve projected onto a sphere). Each cell has a unique 64-bit ID. Cells at the same level are roughly equal in area -- unlike geohash which distorts near the poles. S2 supports: containment queries (is this point inside this polygon?), proximity queries (find POIs within 5 km), and covering (find the minimum set of cells that cover a given region). Alternatives: Geohash encodes lat/lng into strings where nearby locations share prefixes -- simpler but distorts at poles and has edge artifacts at cell boundaries. R-tree is a balanced tree for spatial indexing used by PostGIS. For system design interviews: mention geohash for simplicity or S2 for accuracy. Both support the key operation: find all items near a given point.

System Design: Design Google Maps — Geospatial Indexing, Routing, Tile Rendering, ETA Prediction, Offline Maps

Geospatial Data Model

Routing and Navigation

Map Tile Rendering

ETA Prediction and Traffic

Location Search and Geocoding