Q: How does SKIP LOCKED improve moderator queue throughput?

Without SKIP LOCKED, multiple moderators querying the queue would contend on the same rows: the first moderator locks the top row, causing all other moderators to wait (queue) behind it. With SKIP LOCKED, each moderator skips any row that is already locked and claims the next available one. Result: 10 moderators each claim a disjoint batch of 20 reports simultaneously — 200 reports in progress at once with no blocking. The assigned_to column prevents re-assignment: once a moderator claims a report, others see it as assigned and skip it. Release stale assignments via a cron job: UPDATE Report SET assigned_to = NULL WHERE assigned_to IS NOT NULL AND updated_at < NOW() - interval '30 minutes' AND status = 'pending'. This returns abandoned reports to the queue.

Q: How do you handle CSAM reports differently from standard content reports?

CSAM (Child Sexual Abuse Material) requires: (1) immediate content removal — do not wait for queue; (2) legal hold — preserve the original content for law enforcement (do not delete, move to quarantine); (3) CyberTipline report to NCMEC within 24 hours (US law 18 U.S.C. § 2258A); (4) preserve metadata: uploader IP, timestamp, account info; (5) account suspension pending investigation. Technical: CSAM reports bypass the standard Report table entirely. A separate CSAM_Incident table with its own retention policy (do not delete — keep for law enforcement). A dedicated alerting pipeline notifies the Trust & Safety legal team directly (PagerDuty, legal-oncall). All CSAM handling code must be isolated with strict access controls — minimize the number of employees who can access quarantined content.

Q: How do you measure moderator accuracy and consistency?

Inconsistent moderation (moderator A removes content that moderator B would keep) undermines policy trust. Measure with calibration sets: periodically send the same test case (known-correct answer) to multiple moderators and measure agreement rate. Target: 90%+ inter-rater reliability. Per-moderator accuracy: compare decisions against a gold standard (appeals that were overturned count against the moderator). Dashboard metrics: average review time per category, approval rate by moderator (outliers may be too strict or too lenient), appeal-overturn rate (moderator decisions that the appeals team reversed). Retrain moderators whose accuracy falls below 80%. Use moderator decisions as training data for ML classifiers — human labels are the ground truth.

Q: How do you build an appeal system for auto-moderated content?

Auto-moderation false positives (legitimate content incorrectly removed) must have an appeal path or trust degrades. Appeal flow: (1) user receives a removal notice with a reason code and appeal link; (2) clicking the link creates an Appeal record linked to the EnforcementAction; (3) the appeal goes to a human moderator queue (higher priority than new reports); (4) moderator reviews the original content and context; (5) decision: uphold (auto-mod was correct, explain policy violation) or overturn (remove enforcement action, restore content). Track: appeal volume by content type (high appeal rate = ML model needs retraining), average appeal resolution time (target <72 hours), overturn rate (proportion of appeals upheld by user — high rate means auto-mod is too aggressive).

Question 1

How does reporter reputation prevent coordinated false-flagging attacks?

Accepted Answer

A harassment campaign can coordinate hundreds of users to report a target's content, hoping auto-enforcement triggers a suspension. Without reputation weighting, 500 coordinated reports carry the same weight as 500 organic reports. With reporter accuracy scoring: if a reporter's past reports are dismissed at a rate of 80% (4 out of 5 are unfounded), their new reports receive 20% weight. A coordinated group of 500 low-accuracy reporters generates an effective signal of 100 reports — below auto-enforcement thresholds. Track per reporter: reports_total, reports_upheld (resulted in action_taken), reports_dismissed. Accuracy = upheld / resolved. New reporters start at 50% accuracy. Update after each resolution. Rate limit low-accuracy reporters: if accuracy < 0.2 for last 30 days, cap at 3 reports/day.

Question 2

How does SKIP LOCKED improve moderator queue throughput?

Accepted Answer

Without SKIP LOCKED, multiple moderators querying the queue would contend on the same rows: the first moderator locks the top row, causing all other moderators to wait (queue) behind it. With SKIP LOCKED, each moderator skips any row that is already locked and claims the next available one. Result: 10 moderators each claim a disjoint batch of 20 reports simultaneously — 200 reports in progress at once with no blocking. The assigned_to column prevents re-assignment: once a moderator claims a report, others see it as assigned and skip it. Release stale assignments via a cron job: UPDATE Report SET assigned_to = NULL WHERE assigned_to IS NOT NULL AND updated_at < NOW() - interval '30 minutes' AND status = 'pending'. This returns abandoned reports to the queue.

Question 3

How do you handle CSAM reports differently from standard content reports?

Accepted Answer

CSAM (Child Sexual Abuse Material) requires: (1) immediate content removal — do not wait for queue; (2) legal hold — preserve the original content for law enforcement (do not delete, move to quarantine); (3) CyberTipline report to NCMEC within 24 hours (US law 18 U.S.C. § 2258A); (4) preserve metadata: uploader IP, timestamp, account info; (5) account suspension pending investigation. Technical: CSAM reports bypass the standard Report table entirely. A separate CSAM_Incident table with its own retention policy (do not delete — keep for law enforcement). A dedicated alerting pipeline notifies the Trust & Safety legal team directly (PagerDuty, legal-oncall). All CSAM handling code must be isolated with strict access controls — minimize the number of employees who can access quarantined content.

Question 4

How do you measure moderator accuracy and consistency?

Accepted Answer

Inconsistent moderation (moderator A removes content that moderator B would keep) undermines policy trust. Measure with calibration sets: periodically send the same test case (known-correct answer) to multiple moderators and measure agreement rate. Target: 90%+ inter-rater reliability. Per-moderator accuracy: compare decisions against a gold standard (appeals that were overturned count against the moderator). Dashboard metrics: average review time per category, approval rate by moderator (outliers may be too strict or too lenient), appeal-overturn rate (moderator decisions that the appeals team reversed). Retrain moderators whose accuracy falls below 80%. Use moderator decisions as training data for ML classifiers — human labels are the ground truth.

Question 5

How do you build an appeal system for auto-moderated content?

Accepted Answer

Auto-moderation false positives (legitimate content incorrectly removed) must have an appeal path or trust degrades. Appeal flow: (1) user receives a removal notice with a reason code and appeal link; (2) clicking the link creates an Appeal record linked to the EnforcementAction; (3) the appeal goes to a human moderator queue (higher priority than new reports); (4) moderator reviews the original content and context; (5) decision: uphold (auto-mod was correct, explain policy violation) or overturn (remove enforcement action, restore content). Track: appeal volume by content type (high appeal rate = ML model needs retraining), average appeal resolution time (target <72 hours), overturn rate (proportion of appeals upheld by user — high rate means auto-mod is too aggressive).

Report and Abuse System Low-Level Design: Triage, Moderator Queue, and Auto-Enforcement

Core Data Model

Submitting a Report

Moderator Review Queue

Key Interview Points