Question 1

How does Pinterest visual search find similar images?

Accepted Answer

Pinterest Lens uses CNN (Convolutional Neural Network) embeddings for visual search. When a user takes a photo or selects a region of a pin: (1) The CNN processes the image and produces a feature embedding vector (e.g., 2048 dimensions) capturing visual content -- colors, objects, style, composition. (2) Approximate nearest neighbor (ANN) search finds the K most similar embeddings among 10+ billion pins. Algorithms like HNSW or IVF-PQ in FAISS return results in milliseconds. (3) Results are ranked by visual similarity, user interest relevance, and pin quality (save count, freshness). Crop search: the user draws a box around a specific object (a lamp in a room photo). The cropped region gets its own embedding, returning pins of similar lamps. The model is trained on Pinterest data: pins frequently saved together are embedded closer, capturing both visual and semantic similarity.

Question 2

How is the Pinterest feed different from Twitter or Instagram feeds?

Accepted Answer

Pinterest feed is entirely recommendation-driven -- it shows pins predicted to interest the user, regardless of who posted them. Unlike Twitter (see posts from accounts you follow) or Instagram (mix of following + recommendations), Pinterest feed has no following-based component by default. Pipeline: (1) Candidate generation from multiple sources: pins similar to recently saved pins (embedding similarity), popular pins in user interest categories, pins from followed boards, and trending pins. (2) ML ranking model scores candidates by: pin quality (save rate, CTR), visual appeal, relevance to user interests (embedding distance), freshness, and diversity (avoid repetitive topics). (3) Blending ensures a balanced feed: some from interests, some discoveries, some trending. (4) Deduplication removes already-seen pins. The feed is pre-computed in batches of ~100 pins, generated when the user nears the end of the current batch.

Question 3

How does Pinterest process and understand uploaded images?

Accepted Answer

When a pin is created, the image processing pipeline: (1) Stores original in S3, generates resized versions (thumbnail, medium, full). (2) Extracts a visual feature embedding via CNN for visual search and recommendations. (3) Runs object detection and classification -- ML identifies objects (red dress, modern kitchen, chocolate cake) to enrich metadata for search. (4) Performs OCR to extract visible text (recipe instructions, product names). (5) Content moderation ML scans for policy violations. (6) Computes a perceptual hash (pHash) for duplicate/near-duplicate detection. All outputs feed into the pin metadata, making it searchable by text query, visual similarity, and object category. This pipeline runs asynchronously -- the pin is visible within seconds, with enriched metadata appearing over the next minute.

Question 4

How does the Pinterest interest graph differ from social graphs?

Accepted Answer

Unlike Facebook (who you know) or Twitter (who you follow), Pinterest is organized around interests, not people. Users follow boards (topics like Modern Kitchen Ideas) rather than just users. The recommendation engine models interests from: saved pins (the strongest signal -- saving = strong interest), clicked pins (weaker signal), followed boards (explicit interest declaration), and search queries. The interest graph is a mapping of user to topic affinities, continuously updated. This means two users who have never interacted but save similar pins will receive similar recommendations. A new user interest model is bootstrapped from their first few saves -- Pinterest asks new users to select interest categories and presents pins to save, rapidly building an interest profile. This interest-centric model is why Pinterest can recommend content from strangers effectively -- relevance comes from topic match, not social connection.

System Design: Design Pinterest — Image Discovery, Visual Search, Pin/Board System, Feed Generation, Recommendations

Data Model: Pins, Boards, and Users

Image Processing Pipeline

Visual Search: Find Similar Images

Home Feed Generation

Search