Question 1

Why is connection pooling important and what does it solve?

Accepted Answer

Establishing a database connection requires a TCP handshake, TLS negotiation, and database authentication -- typically 50-200ms. For an application handling 1000 requests/second, creating a new connection per request is impractical. Connection pooling maintains a set of pre-established connections that are borrowed by requests and returned to the pool after use. Borrowing a pooled connection costs microseconds instead of milliseconds. Pools also cap the maximum connections to the database (a PostgreSQL server handles at most 100-300 concurrent connections efficiently -- more causes memory exhaustion and CPU thrashing). Without a pool, a traffic spike creates thousands of simultaneous connection attempts, overwhelming the database. The pool serializes connection creation and queues requests, smoothing out spikes.

Question 2

How do you size a database connection pool correctly?

Accepted Answer

Connection pool sizing formula: pool_size = (database_max_connections) / (number_of_app_server_instances). If PostgreSQL max_connections = 100 and you have 4 app servers, each pool should have at most 25 connections. A common mistake: setting pool size based on application concurrency (number of threads or goroutines) instead of database capacity. A server with 100 goroutines does not need 100 connections -- goroutines frequently block on I/O and many can share connections. Use the Hikari minimum pool size formula: pool_size = Tn * (Cm - 1) + 1, where Tn = number of threads and Cm = maximum simultaneous connections per transaction (usually 1). Monitor pool wait time and active/idle ratio: if wait time is consistently high, the pool is undersized; if idle ratio is consistently high, the pool is oversized and holding unnecessary server resources.

Question 3

What is PgBouncer and how does transaction mode pooling work?

Accepted Answer

PgBouncer is a connection pooler proxy that sits between application servers and PostgreSQL. In transaction mode (the most efficient), a PostgreSQL connection is held only for the duration of a transaction -- when the transaction commits or rolls back, the connection returns to the pool. This allows thousands of application connections (each handling one request at a time) to share a small pool of real PostgreSQL connections (e.g., 25). PgBouncer handles 1000+ app-side connections per real PostgreSQL connection in transaction mode. Limitation: features that require session state across transactions do not work in transaction mode -- prepared statements (use protocol-level prepared statements or disable them), advisory locks, SET LOCAL commands, and LISTEN/NOTIFY are not compatible. Use session mode for compatibility at the cost of lower multiplexing efficiency.

Question 4

How should serverless functions handle database connections?

Accepted Answer

Serverless functions (AWS Lambda, Google Cloud Functions) create a new process per invocation, each attempting to open a database connection. With 1000 concurrent invocations, 1000 connections are attempted simultaneously -- overwhelming PostgreSQL (max 100-300 connections). Solutions: (1) RDS Proxy (AWS): a managed connection pooler for RDS databases; Lambda functions connect to RDS Proxy, which maintains a pool of real database connections. RDS Proxy scales to handle thousands of Lambda connections. (2) PgBouncer deployed as a sidecar or separate service: Lambda connects to PgBouncer, which pools real connections. (3) Neon serverless Postgres and PlanetScale use connection multiplexing natively, designed for serverless workloads. (4) Connection reuse within Lambda: reuse the connection across warm Lambda invocations by storing the database client object outside the handler function (in the global scope), but cap the pool size to 1-2 per Lambda instance.

Low Level Design: Connection Pool Design

Connection Pool Internals

Connection Pool Sizing and Saturation

Connection Health Checking

Key Interview Discussion Points