§ 00 / WINDOWS RECALL

Designing semantic search for everything you’ve seen.

FIG. 0.1
Recall app on a Windows desktop, search results page with multiple match cards
Recall on the Windows desktop. Semantic search surfaced as moments, not documents.
ROLE
Senior UX/Product Designer
PLATFORM
Windows 11 (Copilot+ PCs)
TIMELINE
2023 – 2025
TEAM
Cross-functional team of design, research, and ML engineering
MY FOCUS
Semantic search experience: ranking, relevance, and trust
STATUS
Shipped at Build 2024; reshaped post-launch around privacy
§ 01 / CONTEXT

Where was that thing I saw last week?


The problem was simple and unsolved. You'd seen something on your computer: a presentation, a snippet of code, a reference in an email. But you couldn't find it. You'd try different search terms, retrace your steps, open files one by one. Minutes wasted. Information you knew existed but couldn't retrieve.

Recall aimed to capture everything appearing on screen and make it searchable through meaning rather than filenames. But solving that technically wasn't the real challenge. We had to build something users actually trusted.

My role focused specifically on the semantic search experience: how users search their memories, how results are ranked and displayed, and how relevance is communicated in a way that feels understandable and trustworthy.

Recall mark. A glowing blue rotating-arrow glyph centered on a faint capture grid.

§ 02 / 09

System

§ 02 / SYSTEM

On demand intelligence


RECALL QUERY LIFECYCLE

  1. 04 · SEARCH & INDEX SERVICE

    Stores enriched content in a semantic index and retrieves it by matching the meaning of a query against similarity scores, finding the right files even when the exact words don't match.

  2. 03 · MEANING ANALYSIS

    Interprets raw extracted text by running meaning analysis to identify concepts and intent, then maps those concepts to semantic vectors for downstream understanding.

  3. 02 · OCR PROCESSING

    Converts raw image input into structured text by straightening and denoising the image, identifying characters through optical recognition, and reconstructing the output into formatted, flowing text.

  4. 01 · SCREENSHOT CAPTURE

    Continuously monitors and captures the screen at intervals, saving raw visual snapshots as the source material that enters the processing pipeline.

Isometric exploded view of the Recall query lifecycle. Four stages stacked from Search & Index Service down to Screenshot Capture.
§ 03 / PROBLEM

Search wants precision. Memory offers fragments.


File search is built on certainty. You give it a filename or keyword. It matches exactly. Done.

But that's not how memory works. You remember a chart, maybe the color was blue, maybe it was from an email or a browser. Nothing precise enough for traditional search to grab hold of.

Recall flipped this. Instead of requiring exact queries, it indexed everything the system could see and made it searchable through semantic understanding.

If you're indexing everything, how do you let people search without overwhelming them? The system needed to think like a person, not force people to think like the system.

Gmail showing an OpenTable reservation confirmation, with extracted Recall chips below: reservation, The Front Room, July 16 2023, OpenTable, Menu, Confirmation 25564, Portland ME
§ 04 / CARDS

Cards as moments, not documents


Six Recall cards in a 3-by-2 grid. Each anchored on a desktop screenshot with timestamp and app metadata secondary.

A Recall card needed to hold several pieces of information: a screenshot, app name, timestamp, extracted text, and relevance signals.

We built around the screenshot as the primary anchor. Not a cropped asset preview, but the actual desktop as it appeared. That context is what lodges in memory.

App name, timestamp, and extracted text stayed visible but secondary. The hierarchy pushed away from “found document” and toward “revisited moment.”

The result is something between a timeline and a search interface: visual enough to scan like memory works, structured enough to act predictably.

§ 05 / TRANSPARENCY

AI-powered search has a trust problem


Technically correct results can feel wrong. Search for “blue chart spreadsheet” and the system might return something from an unrelated app that simply had a blue element.

We didn't try to eliminate every false positive at the model layer. Instead, we made results understandable.

Every card explained how it matched. Text matches were labeled as text matches. Visual matches were labeled as visual matches. Separating and showing these signals let users judge relevance themselves.

Perfection wasn't the goal. Legibility was.

Recall search results for the query 'Catering'. Match cards labeled with source app and match-type signals.
§ 06 / CARDS

Merged results killed clarity


First version blended everything. Text matches and visual matches went into one ranked list. Clean, elegant, simple.

Testing proved it didn't work. Text matches dominated the ranking. Visual matches got buried. Users couldn't figure out why something appeared.

We split text and visual matches into separate sections. That single structural change made the system's logic transparent. Cognitive load dropped. Trust went up.

The AI was technically correct even in the first version. But opaque correctness erodes trust faster than transparent mistakes.

Recall search results for ‘Presentation with a red barn’. Visual matches separated into a 4×3 grid of close-match cards with source domains and timestamps.

We made waiting feel like progress.

§ 07 / PERFORMANCE

Fast enough to feel alive


Embedding indexing is computationally heavy, so we tuned retrigger cadence to refine results per keystroke without ever blocking the user.

PER KEYSTROKE

200MS WINDOW

NO BLOCKING WAIT

§ 08 / 09

Trust

§ 08 / TRUST

Privacy was the whole product


Recall captures everything. That only works if people trust where the data sits, who can see it, and what control they actually have.

On-device processing wasn't optional. All capture, all indexing, all retrieval happened locally. Nothing left the machine.

When public scrutiny hit, we made the call to flip Recall from opt-out to opt-in, ship per-app exclusion, and give users a pause control. I owned the IA for the new privacy surface.

We made those boundaries tangible. Cards showed where results came from and when. Excluded content got explicit explanation instead of silent gaps. Trust wasn't a single setting. It lived in every interaction.

Recall card menu showing sensitive content (a credit-card snapshot from Fidelity) with a Delete snapshot actionSnapshot removed confirmation modal with an option to update Recall capture settings to block specific apps and websites
DATA STORAGE
Everything stays on-device. Local capture, local indexing, local retrieval. Nothing leaves the machine.
USER CONTROL
Opt-in by default. Users can exclude apps, pause indexing, and delete any memory at any time.
TRANSPARENCY
Every card shows where the result came from and when. System boundaries are visible, never hidden.
DELETION
Delete individual memories or wipe everything. No hidden caches. No silent retention.
§ 09 / CONSTRAINTS

We killed RAG to keep search fast


Early on we explored RAG synthesis across screenshots, but latency broke the core expectation that search should feel instantaneous. We abandoned synthesis for speed and legibility. Surface relevant moments, let people interpret them.

The system helps people rediscover what they saw. It doesn’t rewrite their history for them.

§ 10 / IMPACT

From rediscovery to reference pattern


Separating visual and text matches proved it wasn't just philosophy. In testing, users could explain why each result appeared and quickly reject things that didn't fit. Mysterious AI behavior became rational.

People recovered information they'd written off as lost. A code snippet they saw once in documentation. Files from months ago they'd forgotten existed.

The work rippled further. Privacy and trust patterns from Recall became reference points across Windows teams.

The search approach itself became foundational. Principles of relevance transparency and match-type separation showed up in Windows Search and File Explorer updates.

FIG. 10.1
Recall introduced on stage at Build. System architecture diagram (Screen Region Detector, Optical Character Recognition, Parser, Text Encoder, Image Encoder) framing the Recall pill.
Recall announcement
Copilot+
DEVICES
Local intelligence required for on-device indexing
WS + FE
TEAMS ADOPTING
Windows Search and File Explorer adopting the patterns
2
YEARS
Shipping cycle, including privacy redesign

NEXT PROJECT

Teams for Education

READ