Question 1

Why is database connection pooling essential for application performance?

Accepted Answer

Without pooling, every database query requires a new TCP connection: 3-way handshake (1-3ms) + TLS handshake (5-30ms) + PostgreSQL authentication (2-5ms) = 10-40ms per connection. At 1000 requests/second, that is 1000 new connections/second, each consuming a PostgreSQL backend process (10-20MB memory). PostgreSQL default max_connections is 100 -- connections exhaust in 100ms. With pooling: 20 pre-established connections are shared. A request borrows a connection (~0ms), executes, and returns it. The 10-40ms setup happens once at pool initialization. 20 connections handle 1000 req/sec because each connection serves ~50 short queries/sec (20ms average query time). Connection reuse amortizes expensive setup across thousands of requests.

Question 2

How do you determine the correct connection pool size?

Accepted Answer

The most common mistake: pools that are too large. More connections does not mean better performance. PostgreSQL degrades with too many active connections due to context switching, lock contention, and buffer cache thrashing. Formula: pool_size = 2 * CPU_cores + number_of_disk_spindles. For 8 cores with SSD: 2*8+1 = 17, round to 20. This accounts for CPU-bound queries (parallelism up to core count) and I/O-bound queries (while one waits for disk, another uses CPU). Practical guideline: start with 10-20 connections. Benchmark under realistic load. Increase only if connections are saturated AND database CPU/IO are not saturated. For read replicas: connections_needed = queries_per_second * average_query_time_seconds. 5000 qps * 0.005s = 25 connections. Pool of 30 provides headroom.

Question 3

What is PgBouncer and why use it instead of application-level pooling?

Accepted Answer

PgBouncer is an external connection pooler between applications and PostgreSQL. It accepts thousands of lightweight client connections and multiplexes them onto a small pool of actual PostgreSQL connections. Transaction pooling mode: a client gets a PostgreSQL connection only during a transaction. Between transactions, the connection returns to the pool. 100 PostgreSQL connections serve thousands of concurrent clients. Why PgBouncer over app-level pooling: (1) Multiple application instances share one PostgreSQL pool. Without PgBouncer: 10 instances with pool_size=20 each = 200 PostgreSQL connections. With PgBouncer: 10 instances connect to PgBouncer, which maintains only 30-50 PostgreSQL connections. (2) Kubernetes scaling: as pods scale up/down, PgBouncer absorbs connection churn. New pods connect to PgBouncer instantly instead of establishing new PostgreSQL connections. (3) Connection limit protection: PgBouncer prevents applications from accidentally exhausting PostgreSQL max_connections.

Question 4

What connection pool metrics should you monitor?

Accepted Answer

Key metrics: (1) Active connections -- currently executing queries. If equal to pool_size, the pool is saturated and requests queue. (2) Idle connections -- waiting for work. Zero idle during peak means pool is too small. (3) Waiting threads -- requests waiting for a connection. Non-zero indicates pool exhaustion. Increase size or optimize slow queries. (4) Connection acquisition time -- time waiting to get a pool connection. Should be near zero (microseconds). Spikes indicate saturation. (5) Connection creation time -- time to establish new connections. Spikes indicate database overload. Alert on: pool saturation (active equals max for over 1 minute), connection timeout (threads waiting too long), and creation failures (database unreachable). PostgreSQL: monitor pg_stat_activity for connection states. HikariCP: expose via Micrometer/JMX. PgBouncer: SHOW STATS and SHOW POOLS commands.

System Design: Database Connection Pooling — PgBouncer, HikariCP, Pool Sizing, Connection Limits, Performance

Why Connection Pooling Matters

Application-Level Pooling: HikariCP

External Pooling: PgBouncer

Pool Sizing: How Many Connections?

Connection Pool Monitoring