Question 1

How does S3 achieve 11 nines of durability?

Accepted Answer

S3 achieves 99.999999999% durability using erasure coding across multiple availability zones. Unlike simple replication (3 copies, 3x storage overhead), erasure coding splits data into data chunks and parity chunks. For example, Reed-Solomon 6+3: split data into 6 chunks, compute 3 parity chunks, store all 9 across different AZs. Any 6 of 9 chunks can reconstruct the original data. Storage overhead: only 1.5x (vs 3x for replication). Tolerates 3 simultaneous chunk losses. Continuous repair: when a storage node fails, the system reads surviving chunks, recomputes missing ones, and stores them on new nodes automatically. With erasure coding across 3+ AZs, the probability of losing enough chunks simultaneously to lose data is astronomically low. You would need to store 10 billion objects for a billion years to statistically expect one loss.

Question 2

What are presigned URLs and how do they enable direct browser uploads?

Accepted Answer

Presigned URLs grant temporary access to S3 objects without requiring the requester to have AWS credentials. The backend generates a URL containing: the bucket and key, an expiration time (seconds to 7 days), allowed HTTP method (GET or PUT), and a cryptographic signature. For direct upload: the backend generates a presigned PUT URL with constraints (maximum file size, content type). The browser uploads directly to S3 using this URL, bypassing the application server entirely. Benefits: the backend does not handle file bytes (saves bandwidth and CPU), S3 handles the upload at its massive scale, and the upload is resumable with multipart. For downloads: generate a presigned GET URL for each private file. The user can download until the URL expires. Security: use short expiration times. If leaked, exposure is limited. Combine with CloudFront signed URLs for IP restriction and shorter TTLs on sensitive content.

Question 3

When should you use multipart upload for S3?

Accepted Answer

Use multipart upload for any file over 100 MB. Benefits: (1) Resumability -- if one part fails, retry only that part. A 10 GB upload that fails at 9.5 GB with single PutObject must restart from zero. With multipart, only the failed part is retried. (2) Parallelism -- upload parts simultaneously. A 10 GB file split into 100 MB parts: 100 parallel streams saturate the network much faster than a single stream. (3) Size limit -- PutObject is limited to 5 GB. Multipart supports up to 5 TB. Process: CreateMultipartUpload (get upload_id), upload each part with the upload_id and part_number (parts can be 5 MB to 5 GB), CompleteMultipartUpload with the list of part ETags. Parts can be uploaded from different machines and in any order. Set a lifecycle rule to abort incomplete multipart uploads after N days to avoid orphaned parts consuming storage.

Question 4

How do you use S3 storage classes to optimize costs?

Accepted Answer

S3 offers storage classes at different price-performance points: S3 Standard ($0.023/GB/month): frequent access, millisecond retrieval. Use for active application data, user uploads, frequently accessed files. S3 Infrequent Access ($0.0125/GB/month): 30-day minimum storage, per-GB retrieval fee. Use for data accessed less than once a month (backups, older logs). S3 Glacier Instant ($0.004/GB/month): millisecond retrieval like Standard but cheaper. 90-day minimum. S3 Glacier Flexible ($0.0036/GB/month): retrieval in 1-5 minutes (expedited) to 3-5 hours (standard). 90-day minimum. Use for compliance archives. S3 Glacier Deep Archive ($0.00099/GB/month): retrieval in 12 hours. 180-day minimum. Use for data retained for regulatory compliance that is rarely accessed. Lifecycle policies automate transitions: move objects to IA after 30 days, Glacier after 90 days, Deep Archive after 365 days, delete after 7 years. This can reduce storage costs by 10-20x for data with declining access frequency.

System Design: Object Storage (S3) Architecture — Blob Storage, Erasure Coding, Metadata, Presigned URLs, Multipart Upload

Object Storage vs Block Storage vs File Storage

S3 Architecture Internals

Data Durability with Erasure Coding

Multipart Upload

Presigned URLs and Access Control

Object Storage in System Design