Question 1

What is a Lambda cold start and how do you minimize it?

Accepted Answer

A cold start occurs when Lambda creates a new execution environment: provision a micro-VM, download the deployment package, initialize the runtime, and run initialization code. Cold start latency: 100-500ms for Python/Node.js, 500ms-2s for Java. Optimization: (1) Small deployment packages -- include only necessary dependencies, use Lambda layers for shared libraries. (2) Lightweight runtimes -- Python and Node.js cold-start faster than Java. For Java, use GraalVM native image or SnapStart (Lambda snapshots the JVM after init). (3) Move initialization outside the handler -- database connections and SDK clients initialized at module level persist between warm invocations. (4) Provisioned concurrency -- pre-warm N environments. Eliminates cold starts for the configured concurrency but costs more. (5) Avoid VPC attachment when possible -- non-VPC Lambdas start faster (no ENI setup).

Question 2

When should you use serverless versus containers?

Accepted Answer

Use serverless (Lambda) when: traffic is variable with periods of zero (pay nothing when idle), workloads are event-driven (file uploads, webhooks, queue processing), execution is short (under 15 minutes), you want zero infrastructure management, or you are building an MVP/prototype. Use containers (ECS/EKS) when: traffic is steady and high-throughput (Lambda is more expensive at sustained load), you need execution over 15 minutes, you need more than 10 GB memory or 6 vCPUs, latency must be consistently low (no cold starts), or you need specific runtime configurations. Hybrid approach: use Lambda for bursty event-driven work (image processing, notifications, scheduled jobs) and containers for steady-state services (API backends, databases). In interviews, suggest serverless for specific components rather than the entire system to show nuanced architectural thinking.

Question 3

What are the main limitations of serverless architecture?

Accepted Answer

Key limitations: (1) 15-minute execution limit -- long-running processes must be broken into chunks or run elsewhere. (2) Cold starts -- unpredictable latency spikes (100ms-2s). Problematic for P99 latency targets. (3) Statelessness -- no local state between invocations. All state must be in external services (DynamoDB, Redis), adding latency. (4) Limited compute -- max 10 GB memory, 6 vCPUs. CPU-intensive workloads need dedicated instances. (5) Vendor lock-in -- AWS-specific APIs and event formats. Migration requires significant rewriting. (6) Cost at scale -- Lambda is cheap at low volume but expensive at sustained high throughput. At 100M invocations/month, EC2 is often cheaper. (7) Debugging difficulty -- distributed tracing across invocations is harder than debugging a monolith. CloudWatch logs are fragmented.

Question 4

What are common serverless architecture patterns?

Accepted Answer

Five patterns: (1) API backend -- API Gateway + Lambda + DynamoDB. Routes HTTP requests to Lambda functions. Auto-scales from zero. Ideal for variable-traffic APIs. (2) Event processing -- S3 upload triggers Lambda for image resizing, file parsing, or transcoding. SQS messages trigger Lambda for async task processing. (3) Scheduled tasks -- CloudWatch Events triggers Lambda on a cron schedule. Replaces EC2 cron jobs with zero idle cost. (4) Workflow orchestration -- AWS Step Functions coordinates multiple Lambda functions with conditional logic, parallel execution, error handling, and retries. Example: order processing with payment, inventory, and shipping steps. (5) Stream processing -- Lambda processes Kinesis or Kafka records in micro-batches. Each shard gets one Lambda instance. Scales with shard count. These patterns can be combined: an e-commerce system might use pattern 1 for the API, pattern 2 for image uploads, pattern 3 for daily reports, and pattern 4 for order fulfillment.

System Design: Serverless Architecture — AWS Lambda, Cold Starts, Event-Driven, API Gateway, Step Functions, Limitations

How AWS Lambda Works

Cold Starts and Optimization

Serverless Architecture Patterns

Serverless Limitations

When to Use Serverless vs Containers vs VMs