Q: How do you ensure data is safely in S3 before deleting from the hot database?

The deletion must only happen after the S3 write is confirmed. Pattern: (1) SELECT the batch of rows to archive; (2) serialize to NDJSON or Parquet; (3) PUT to S3 — wait for the 200 OK response (do not fire-and-forget); (4) verify: GET the S3 object metadata or compute ETag to confirm the write landed; (5) DELETE the rows from the hot table. If the process crashes between step 3 and 5, the rows remain in both S3 and the hot table — duplicated but not lost. The archival job is idempotent: running it again re-uploads the same data to S3 (PUT is idempotent for the same key) and re-deletes the rows (already deleted — 0 rows affected). Never delete first: data loss on S3 failure is unrecoverable.

Q: How do you query archived data without pulling it back into the hot database?

AWS Athena allows SQL queries directly over S3-stored files (JSON, Parquet, CSV) using a serverless query engine. Setup: create an Athena table pointing at the S3 prefix: LOCATION 's3://archive-bucket/orders/'. Query: SELECT * FROM archive.orders WHERE placed_at BETWEEN '2022-01-01' AND '2022-12-31' AND user_id = 'abc'. Athena scans only relevant Parquet files (partition pruning by year/month directory structure). Cost: $5 per TB scanned — a 500GB query costs $2.50. For compliance exports: run the query, download the result CSV, done. For application-level access: federated query (check hot DB first; if not found, query Athena via boto3). Athena query latency is 2–30 seconds — unsuitable for real-time API responses but fine for admin tools and compliance.

Q: How do you handle foreign key constraints when archiving parent rows?

An Order row may be referenced by OrderItem, OrderEvent, and Shipment rows. You cannot DELETE the Order row while child rows reference it (foreign key violation). Correct archive order: archive children before parents. Sequence: (1) archive OrderItem → S3, delete from hot; (2) archive OrderEvent → S3, delete from hot; (3) archive Shipment → S3, delete from hot; (4) archive Order → S3, delete from hot. All steps in one job, transactionally consistent per batch. Alternative: use soft-delete (is_archived flag on the Order row) — the PK remains in the hot table satisfying FK constraints, but the row is excluded from normal queries via a partial index (WHERE is_archived = FALSE). Physical deletion happens in a later cleanup pass after all child rows are archived.

Q: How do you monitor archival job health and detect data loss?

Archival jobs that run silently can have bugs that silently lose data — rows deleted from hot before they were written to S3. Monitoring: (1) row count check: after each batch, verify row_count(S3 batch) == row_count(deleted from DB); (2) reconciliation query: every week, sample 1% of archived row IDs and verify they exist in S3; (3) ArchivalJob table tracking rows_archived, started_at, completed_at — alert if a job doesn't complete within its expected window; (4) compare monthly row counts between hot DB (after archival) and S3 (new data) — total should match expectations based on write rate; (5) set up AWS S3 event notifications to detect unexpected deletes from the archive bucket (should never happen — alert if it does). Never run archival without automated verification.

Question 1

Why is ALTER TABLE DETACH PARTITION the preferred archival method?

Accepted Answer

DETACH PARTITION is a catalog operation — it only modifies the system tables that track partition membership, not the data itself. This means: no rows are scanned, no data is copied, no locks are held on the table data. A partition with 1 billion rows detaches in milliseconds. By contrast, DELETE WHERE created_at < cutoff on a 1B-row table takes hours, generates massive WAL (filling up disk), and holds row locks that block concurrent reads and writes. After detaching, the partition still exists as a standalone table — you can query it, attach it to an archive schema, COPY it to S3 as Parquet, or DROP it. Always design high-volume time-series tables as partitioned from day one specifically to enable this O(1) archival pattern.

Question 2

How do you ensure data is safely in S3 before deleting from the hot database?

Accepted Answer

The deletion must only happen after the S3 write is confirmed. Pattern: (1) SELECT the batch of rows to archive; (2) serialize to NDJSON or Parquet; (3) PUT to S3 — wait for the 200 OK response (do not fire-and-forget); (4) verify: GET the S3 object metadata or compute ETag to confirm the write landed; (5) DELETE the rows from the hot table. If the process crashes between step 3 and 5, the rows remain in both S3 and the hot table — duplicated but not lost. The archival job is idempotent: running it again re-uploads the same data to S3 (PUT is idempotent for the same key) and re-deletes the rows (already deleted — 0 rows affected). Never delete first: data loss on S3 failure is unrecoverable.

Question 3

How do you query archived data without pulling it back into the hot database?

Accepted Answer

AWS Athena allows SQL queries directly over S3-stored files (JSON, Parquet, CSV) using a serverless query engine. Setup: create an Athena table pointing at the S3 prefix: LOCATION 's3://archive-bucket/orders/'. Query: SELECT * FROM archive.orders WHERE placed_at BETWEEN '2022-01-01' AND '2022-12-31' AND user_id = 'abc'. Athena scans only relevant Parquet files (partition pruning by year/month directory structure). Cost: $5 per TB scanned — a 500GB query costs $2.50. For compliance exports: run the query, download the result CSV, done. For application-level access: federated query (check hot DB first; if not found, query Athena via boto3). Athena query latency is 2–30 seconds — unsuitable for real-time API responses but fine for admin tools and compliance.

Question 4

How do you handle foreign key constraints when archiving parent rows?

Accepted Answer

An Order row may be referenced by OrderItem, OrderEvent, and Shipment rows. You cannot DELETE the Order row while child rows reference it (foreign key violation). Correct archive order: archive children before parents. Sequence: (1) archive OrderItem → S3, delete from hot; (2) archive OrderEvent → S3, delete from hot; (3) archive Shipment → S3, delete from hot; (4) archive Order → S3, delete from hot. All steps in one job, transactionally consistent per batch. Alternative: use soft-delete (is_archived flag on the Order row) — the PK remains in the hot table satisfying FK constraints, but the row is excluded from normal queries via a partial index (WHERE is_archived = FALSE). Physical deletion happens in a later cleanup pass after all child rows are archived.

Question 5

How do you monitor archival job health and detect data loss?

Accepted Answer

Archival jobs that run silently can have bugs that silently lose data — rows deleted from hot before they were written to S3. Monitoring: (1) row count check: after each batch, verify row_count(S3 batch) == row_count(deleted from DB); (2) reconciliation query: every week, sample 1% of archived row IDs and verify they exist in S3; (3) ArchivalJob table tracking rows_archived, started_at, completed_at — alert if a job doesn't complete within its expected window; (4) compare monthly row counts between hot DB (after archival) and S3 (new data) — total should match expectations based on write rate; (5) set up AWS S3 event notifications to detect unexpected deletes from the archive bucket (should never happen — alert if it does). Never run archival without automated verification.

Data Archival Low-Level Design: Tiered Storage, Partition Detach, and S3 Cold Storage

Storage Tiers

Partition-Based Archival (Preferred)

Row-by-Row Archival (For Non-Partitioned Tables)

Querying Archived Data

Key Interview Points