Question 1

What is service discovery and why is it needed in microservices?

Accepted Answer

In microservices, service instances have dynamic IP addresses. Instances scale up/down (auto-scaling), containers restart on different hosts (Kubernetes rescheduling), deployments replace instances (rolling updates), and failures remove instances. Hardcoding IPs is impossible. Service discovery maps service names to network addresses dynamically. A service registers (name, IP, port, health) on startup. Clients query the registry by name to get healthy instance addresses. Two patterns: client-side discovery (client queries registry, selects instance -- Netflix Eureka) and server-side discovery (client calls a stable endpoint, a proxy/LB forwards to a healthy instance -- Kubernetes Services, AWS ELB). Server-side is simpler for clients (just one DNS name) but adds a network hop.

Question 2

How does Kubernetes service discovery work?

Accepted Answer

Kubernetes provides built-in service discovery via Services and DNS. A Service selects pods by label and creates a stable endpoint. CoreDNS maps service-name.namespace.svc.cluster.local to the Service ClusterIP. kube-proxy programs iptables/IPVS rules to load-balance traffic from ClusterIP to healthy pod IPs. The Endpoints controller automatically updates the backend pod list as pods start and stop. Readiness probes: pods are only added to Service endpoints when their readiness probe passes, preventing traffic to starting or unhealthy pods. Headless Services (clusterIP: None): DNS returns individual pod IPs directly. Use for stateful services (databases, Kafka) where clients need specific pod connections. No additional service discovery tool (Consul, Eureka) is needed in Kubernetes -- the platform handles registration, health checking, and DNS resolution natively.

Question 3

What is the difference between liveness and readiness health checks?

Accepted Answer

Liveness check answers: is the process alive and not stuck? Check: GET /health/live returns 200 if the HTTP server responds. Do NOT check external dependencies (database, cache). Failure action: restart the instance (kill and recreate the pod). A database outage should not cause all application pods to restart -- that makes things worse. Readiness check answers: is the instance ready to serve user traffic? Check: GET /health/ready returns 200 if database connection pool is established, cache is connected, and initialization is complete. Failure action: stop routing traffic to this instance (remove from Service endpoints) but do NOT restart. A readiness failure during a database outage correctly stops traffic without unnecessary restarts. The instance becomes ready again when the database recovers. Critical mistake: using the same endpoint for both. If the readiness check includes database connectivity and is used as a liveness check, a database outage causes all pods to restart in a loop.

Question 4

When should you use Consul versus Kubernetes built-in service discovery?

Accepted Answer

Use Kubernetes built-in discovery when: all services run in the same Kubernetes cluster, the built-in DNS and Service abstraction meet your needs, and you want zero additional infrastructure. This covers most Kubernetes-native applications. Use Consul when: (1) Services span multiple environments -- some in Kubernetes, some on VMs, some on bare metal. Consul provides a unified service registry across all platforms. (2) You need a service mesh without Istio -- Consul Connect provides mTLS and service-to-service authorization. (3) You need a distributed key-value store for configuration alongside service discovery. (4) You operate across multiple Kubernetes clusters or cloud providers -- Consul federation connects service registries across clusters and regions. (5) You need richer health checking than Kubernetes provides (custom script checks, multi-step health verification). For a single Kubernetes cluster with only containerized services, Kubernetes DNS is sufficient and simpler. Add Consul when your architecture grows beyond a single cluster or includes non-Kubernetes services.

System Design: Service Discovery — Consul, DNS, etcd, Eureka, Health Checking, Load Balancing, Kubernetes Services

The Service Discovery Problem

DNS-Based Service Discovery

Consul: Full-Featured Service Discovery

Kubernetes Service Discovery

Health Checking Best Practices