Q: How does connection pooling interact with read replicas?

Maintain separate connection pools for the primary and each replica. A combined pool of 50 connections pointing at both is wrong — if all 50 connections are assigned to the primary, replicas are unused; if all go to replicas, writes fail. Use: primary_pool (max 20 connections), replica_pool (max 100 connections, distributed across N replicas). The large replica pool handles the high volume of reads; the smaller primary pool handles only writes and consistency-sensitive reads. PgBouncer supports this with separate config blocks per database. Size each pool based on: (active DB connections available) / (number of app instances), targeting 80% utilization.

Question 1

How do you route read queries to replicas without application-level changes?

Accepted Answer

Use a connection proxy like PgBouncer or ProxySQL that sits between your application and the database. The application connects to one endpoint; the proxy inspects each query and routes SELECT statements to read replicas and writes to the primary. PgBouncer with read/write splitting requires a PostgreSQL-aware proxy like Pgpool-II or an application-level library. In practice, most teams use the ORM's built-in read replica support: Django's DATABASE_ROUTERS, Rails's connects_to, or explicit read/write connection pool selection in the application. Explicit routing in the application is more transparent and easier to debug than proxy-level routing.

Question 2

How do you handle replication lag to prevent stale reads after writes?

Accepted Answer

Replication lag is typically 10-100ms but can spike under load. The session consistency approach: after a write, store the write's LSN (log sequence number) in the user's session. Before a subsequent read for the same user, check if the replica's pg_last_wal_replay_lsn() has caught up to the stored LSN. If not, route that specific read to the primary. This gives read-your-writes consistency for the user who just wrote, while all other users get replica reads. Alternative: after a write, set a short flag in Redis (e.g., SETEX user:{id}:wrote_recently 5 1) and route all reads for that user to the primary for the next 5 seconds.

Question 3

Which queries must always go to the primary?

Accepted Answer

Three categories: (1) Writes (INSERT, UPDATE, DELETE, DDL) — always primary. (2) Reads that must reflect the latest committed data: immediately-after-write reads in the same transaction, reads that determine business logic for subsequent writes (e.g., SELECT FOR UPDATE, SELECT balance before deducting). (3) Reads inside an explicit transaction — PostgreSQL replicas don't support writable transactions; any transaction that mixes reads and writes must go to the primary. A simple rule: if consistency after a write matters for correctness (not just UX), use the primary. If a slightly stale read would be acceptable (profile page, product catalog), use a replica.

Question 4

How do you monitor replica lag and when do you failover?

Accepted Answer

Monitor replication lag via: SELECT EXTRACT(EPOCH FROM (NOW() - pg_last_xact_replay_timestamp())) AS lag_seconds on each replica. Alert if lag > 30 seconds. At lag > 60 seconds, consider removing the lagging replica from the rotation to prevent very stale reads. Use a health check endpoint that your load balancer polls: the replica returns 200 only if lag < configurable threshold (default 10s). Cloud managed databases (RDS, Cloud SQL) expose replication lag as a CloudWatch/Stackdriver metric. For automatic failover (promoting a replica to primary on primary failure), use Patroni (Postgres HA manager) which handles leader election via etcd or Consul.

Question 5

How does connection pooling interact with read replicas?

Accepted Answer

Maintain separate connection pools for the primary and each replica. A combined pool of 50 connections pointing at both is wrong — if all 50 connections are assigned to the primary, replicas are unused; if all go to replicas, writes fail. Use: primary_pool (max 20 connections), replica_pool (max 100 connections, distributed across N replicas). The large replica pool handles the high volume of reads; the smaller primary pool handles only writes and consistency-sensitive reads. PgBouncer supports this with separate config blocks per database. Size each pool based on: (active DB connections available) / (number of app instances), targeting 80% utilization.

Read Replica Routing Low-Level Design

Read Replica Routing — Low-Level Design

Replication Architecture

Application-Level Routing

Read-Your-Writes Consistency

Lag Monitoring and Replica Health

Connection Pooling Strategy

Key Interview Points