§ 00 / WINDOWS RECALL

Designing semantic search for everything you’ve seen.

Recall app on a Windows desktop, search results page with multiple match cards — Recall on the Windows desktop. Semantic search surfaced as moments, not documents.

ROLE: Senior UX/Product Designer
PLATFORM: Windows 11 (Copilot+ PCs)
TIMELINE: 2023 – 2025
TEAM: Cross-functional team of design, research, and ML engineering
MY FOCUS: Semantic search experience: ranking, relevance, and trust
STATUS: Shipped at Build 2024; reshaped post-launch around privacy

§ 01 / CONTEXT

Where was that thing I saw last week?

The problem was simple and unsolved. You'd seen something on your computer: a presentation, a snippet of code, a reference in an email. But you couldn't find it. You'd try different search terms, retrace your steps, open files one by one. Minutes wasted. Information you knew existed but couldn't retrieve.

Recall aimed to capture everything appearing on screen and make it searchable through meaning rather than filenames. But solving that technically wasn't the real challenge. We had to build something users actually trusted.

My role focused specifically on the semantic search experience: how users search their memories, how results are ranked and displayed, and how relevance is communicated in a way that feels understandable and trustworthy.

§ 02 / 09

System

§ 02 / SYSTEM

On demand intelligence

RECALL QUERY LIFECYCLE

04 · SEARCH & INDEX SERVICE
Stores enriched content in a semantic index and retrieves it by matching the meaning of a query against similarity scores, finding the right files even when the exact words don't match.
03 · MEANING ANALYSIS
Interprets raw extracted text by running meaning analysis to identify concepts and intent, then maps those concepts to semantic vectors for downstream understanding.
02 · OCR PROCESSING
Converts raw image input into structured text by straightening and denoising the image, identifying characters through optical recognition, and reconstructing the output into formatted, flowing text.
01 · SCREENSHOT CAPTURE
Continuously monitors and captures the screen at intervals, saving raw visual snapshots as the source material that enters the processing pipeline.

§ 03 / PROBLEM

Search wants precision. Memory offers fragments.

File search is built on certainty. You give it a filename or keyword. It matches exactly. Done.

But that's not how memory works. You remember a chart, maybe the color was blue, maybe it was from an email or a browser. Nothing precise enough for traditional search to grab hold of.

Recall flipped this. Instead of requiring exact queries, it indexed everything the system could see and made it searchable through semantic understanding.

If you're indexing everything, how do you let people search without overwhelming them? The system needed to think like a person, not force people to think like the system.

Gmail showing an OpenTable reservation confirmation, with extracted Recall chips below: reservation, The Front Room, July 16 2023, OpenTable, Menu, Confirmation 25564, Portland ME

§ 04 / CARDS

Cards as moments, not documents

Six Recall cards in a 3-by-2 grid. Each anchored on a desktop screenshot with timestamp and app metadata secondary.

A Recall card needed to hold several pieces of information: a screenshot, app name, timestamp, extracted text, and relevance signals.

We built around the screenshot as the primary anchor. Not a cropped asset preview, but the actual desktop as it appeared. That context is what lodges in memory.

App name, timestamp, and extracted text stayed visible but secondary. The hierarchy pushed away from “found document” and toward “revisited moment.”

The result is something between a timeline and a search interface: visual enough to scan like memory works, structured enough to act predictably.

§ 05 / TRANSPARENCY

AI-powered search has a trust problem

Technically correct results can feel wrong. Search for “blue chart spreadsheet” and the system might return something from an unrelated app that simply had a blue element.

We didn't try to eliminate every false positive at the model layer. Instead, we made results understandable.

Every card explained how it matched. Text matches were labeled as text matches. Visual matches were labeled as visual matches. Separating and showing these signals let users judge relevance themselves.

Perfection wasn't the goal. Legibility was.

Recall search results for the query 'Catering'. Match cards labeled with source app and match-type signals.

§ 06 / CARDS

Merged results killed clarity

First version blended everything. Text matches and visual matches went into one ranked list. Clean, elegant, simple.

Testing proved it didn't work. Text matches dominated the ranking. Visual matches got buried. Users couldn't figure out why something appeared.

We split text and visual matches into separate sections. That single structural change made the system's logic transparent. Cognitive load dropped. Trust went up.

The AI was technically correct even in the first version. But opaque correctness erodes trust faster than transparent mistakes.

Recall search results for ‘Presentation with a red barn’. Visual matches separated into a 4×3 grid of close-match cards with source domains and timestamps.

We made waiting feel like progress.

§ 07 / PERFORMANCE

Fast enough to feel alive

Embedding indexing is computationally heavy, so we tuned retrigger cadence to refine results per keystroke without ever blocking the user.

PER KEYSTROKE

200MS WINDOW

NO BLOCKING WAIT

§ 08 / 09

Trust

§ 08 / TRUST

Privacy was the whole product

Recall captures everything. That only works if people trust where the data sits, who can see it, and what control they actually have.

On-device processing wasn't optional. All capture, all indexing, all retrieval happened locally. Nothing left the machine.

When public scrutiny hit, we made the call to flip Recall from opt-out to opt-in, ship per-app exclusion, and give users a pause control. I owned the IA for the new privacy surface.

We made those boundaries tangible. Cards showed where results came from and when. Excluded content got explicit explanation instead of silent gaps. Trust wasn't a single setting. It lived in every interaction.

Recall card menu showing sensitive content (a credit-card snapshot from Fidelity) with a Delete snapshot action

Snapshot removed confirmation modal with an option to update Recall capture settings to block specific apps and websites

DATA STORAGE: Everything stays on-device. Local capture, local indexing, local retrieval. Nothing leaves the machine.
USER CONTROL: Opt-in by default. Users can exclude apps, pause indexing, and delete any memory at any time.
TRANSPARENCY: Every card shows where the result came from and when. System boundaries are visible, never hidden.
DELETION: Delete individual memories or wipe everything. No hidden caches. No silent retention.

§ 09 / CONSTRAINTS

We killed RAG to keep search fast

Early on we explored RAG synthesis across screenshots, but latency broke the core expectation that search should feel instantaneous. We abandoned synthesis for speed and legibility. Surface relevant moments, let people interpret them.

The system helps people rediscover what they saw. It doesn’t rewrite their history for them.

§ 10 / IMPACT

From rediscovery to reference pattern

Separating visual and text matches proved it wasn't just philosophy. In testing, users could explain why each result appeared and quickly reject things that didn't fit. Mysterious AI behavior became rational.

People recovered information they'd written off as lost. A code snippet they saw once in documentation. Files from months ago they'd forgotten existed.

The work rippled further. Privacy and trust patterns from Recall became reference points across Windows teams.

The search approach itself became foundational. Principles of relevance transparency and match-type separation showed up in Windows Search and File Explorer updates.

Recall introduced on stage at Build. System architecture diagram (Screen Region Detector, Optical Character Recognition, Parser, Text Encoder, Image Encoder) framing the Recall pill. — Recall announcement

Copilot+

DEVICES

Local intelligence required for on-device indexing

WS + FE

TEAMS ADOPTING

Windows Search and File Explorer adopting the patterns

YEARS

Shipping cycle, including privacy redesign