Intent-grounded browsing dataset for evaluating Edge Journeys quality
⬇ Download All Data (ZIP)Edge Journeys is an AI-powered browser feature in Microsoft Edge's Copilot Mode that transforms browsing history into task-themed clusters, helping users resume and continue their work without starting over. The feature surfaces up to 3 Journey cards on the New Tab Page (NTP), each with a title, preview image, and suggested next-step action.
This dataset provides intent-grounded evaluation data — browsing sessions where we know exactly what the user was trying to do. This lets us objectively measure whether a generated Journey "got it right" instead of relying on heuristic metrics alone.
The dataset was collected using an LLM-driven browsing agent (claude-opus-4.6-1m) that role-plays as different user personas, making reactive browsing decisions based on actual page content — not scripted paths.
Each task is a single-topic browsing session (~10 pages) with a defined browsing goal and ground-truth intent. Tests whether Journeys can generate a good card from relatively clean data.
Each profile is a multi-topic, multi-day browsing history (~100 pages) that combines 3-4 Layer 1 tasks with 60-70% background noise (email, news, social media). Simulates what a real user's browsingHistory looks like in production.