Question 1

How do you handle URL slug changes without breaking existing links?

Accepted Answer

When a post title changes, the slug changes (SEO-friendly URL changes). Existing links from other sites, bookmarks, and search engine indexes should continue to work. Solution: store all historical slugs for a post. Schema: PostSlug table: (slug, post_id, is_primary, created_at). On slug change: mark the old slug is_primary=false, insert the new slug as is_primary=true. URL resolution: when serving /:author/:slug, look up the slug in PostSlug. If is_primary=false: return HTTP 301 redirect to the current primary slug URL. If is_primary=true: serve the post. Canonical link: in the HTML , always set . This prevents duplicate content penalties even if old URLs are still being accessed. Slug uniqueness: enforce unique (author_id, slug) across ALL slugs (not just primary). This prevents a new post from claiming a slug that redirects to an old post. Index: (slug, author_id) for fast resolution. 410 Gone: if a post is deleted (not just archived): return 410 Gone for old slugs (tells search engines the content is permanently removed, not just moved).

Question 2

How do you implement a rich text editor for blog posts with image uploads?

Accepted Answer

Rich text editors for blogs typically use ProseMirror (the basis of Tiptap and Notion), Quill, or TipTap. These serialize content as JSON (a structured document model) or HTML. Recommendation: store as JSON (portable, structured, queryable) and render to HTML at display time. Image upload flow in the editor: (1) User pastes or drags an image into the editor. (2) Editor captures the image file (Blob). (3) Client requests a pre-signed S3 upload URL from the server. (4) Client uploads directly to S3. (5) On upload completion: client inserts an image node in the document with the S3 URL. Server-side image processing: on S3 upload notification: (1) Validate the file (MIME type, max size 10MB, scan for malware with ClamAV or AWS Rekognition). (2) Resize to multiple sizes (thumbnail 300px, medium 800px, full size) using Pillow or Sharp. (3) Convert to WebP for smaller file sizes. (4) Store all sizes; use the medium size as the default in blog posts. CDN: serve images through CloudFront or Cloudflare with caching headers (Cache-Control: max-age=31536000, immutable) -- image URLs are content-addressed (hash in the filename), so they never need to be invalidated.

Question 3

How do you design the email newsletter subscription system for a blog?

Accepted Answer

Newsletter subscriptions: readers subscribe to an author's blog. On new post publish: send emails to all subscribers. Schema: Subscription: subscription_id, subscriber_email, author_id, status (ACTIVE, UNSUBSCRIBED, BOUNCED), confirmed (boolean for double opt-in), subscribed_at, token (random token for unsubscribe links). Double opt-in: on subscribe, send a confirmation email with a link containing the token. Only set confirmed=true after the link is clicked. Prevents spam sign-ups (someone else's email being subscribed without consent). CAN-SPAM/GDPR compliance: include an unsubscribe link in every email. Unsubscribe = one-click (no login required). Process the unsubscribe via a token link: GET /unsubscribe?token=xxx u2192 set status=UNSUBSCRIBED. NEVER re-subscribe a user who unsubscribed without explicit re-consent. Send time: when a new post is published, enqueue a newsletter job. The job sends emails in batches (50 emails/second via SES API -- stay within rate limits). For 10K subscribers: ~3 minutes to send all emails. Track bounces: hard bounces (invalid email) u2192 set status=BOUNCED, never send again. Soft bounces (mailbox full) u2192 retry 3 times, then BOUNCED. Use SES bounce notifications via SNS.

Question 4

How do you implement reading time estimation and auto-generated excerpts?

Accepted Answer

Reading time: based on the average adult reading speed of 200-250 words per minute. Algorithm: strip HTML tags from post body, split by whitespace, count words. reading_time_minutes = ceil(word_count / 200). Account for images: each image adds ~10 seconds (0.17 minutes) of viewing time. Formula: reading_time_minutes = ceil(word_count / 200 + image_count * 0.17). Display: "5 min read." Store on the Post row so it doesn't need to be recomputed on each page view. Recompute when the post body is updated. Auto-generated excerpt: take the first 150-200 characters of the stripped text (no HTML). Truncate at a word boundary (don't cut mid-word). Append "..." if truncated. This excerpt is used: in post listings, in RSS feed summaries, in the og:description meta tag (for social sharing previews). If the author manually sets an excerpt: use that instead (override). Store in the excerpt column. Excerpt length: 150-160 characters is the target -- matching search engine meta description length. Longer excerpts are truncated by search engines anyway.

Question 5

How do you handle SEO meta tags and Open Graph for a blog platform?

Accepted Answer

SEO meta tags: title tag: use post.title | site_name (max 60 characters). meta description: use post.excerpt (max 160 characters). meta robots: published posts = index,follow. Draft and archived posts = noindex,nofollow. Open Graph (og:) tags for social sharing: og:title, og:description (post.excerpt), og:image (post.featured_image_url or a generated social card), og:type = article, og:url = canonical URL, og:article:author = author profile URL, og:article:published_time = ISO-8601 publish date. Twitter Card meta tags: twitter:card = summary_large_image (shows a large preview image), twitter:title, twitter:description, twitter:image, twitter:creator = @author_twitter_handle. JSON-LD structured data: Article schema with author, datePublished, dateModified, image, headline. FAQPage schema for posts with FAQ sections (improves search result display with Q&A rich snippets). Generated at render time and cached. Cache invalidation: when a post is updated (title, excerpt, featured image changed), purge the CDN page cache for that URL. CDN edge caching: cache the full rendered HTML at the edge (CDN) for 60 seconds. Blog posts are read-heavy; CDN caching absorbs most traffic without hitting the origin.

Low-Level Design: Blog Platform — Content Management, Comments, and SEO-Friendly URLs

Core Entities

SEO-Friendly URLs and Slug Generation

Draft and Publishing Workflow

Comments with Moderation

Post Discovery and Recommendations