Agent UX design patterns are reusable interface rules that let people delegate work to AI systems that plan, use tools, and act on their behalf while keeping control over risk. Between May 19 and June 12, 2026, four major launches proved those patterns are no longer optional: Gemini Spark (Google I/O), Microsoft Scout (Build), Claude Cowork (Anthropic), and Siri agentic actions via App Intents (WWDC). Each product chose a different surface (inbox, Teams taskbar, desktop IDE, system intents), but every credible agent UI needs the same five families: confirmation gates, streaming state, trust indicators, error recovery, and notifications. This checklist maps those families to what each vendor shipped so you can design or evaluate any human-agent interface in 2026.

Key takeaways:

  • Five pattern families cover every agent UI: confirmation before action, visible progress while working, trust signals, recovery paths, and notification discipline.
  • Surface choice matters: Spark lives in Gmail and Workspace, Scout in Teams and Outlook, Cowork in the Claude desktop IDE, Apple in system-wide Siri intents.
  • Risk-tiered gates are non-negotiable: Spark and Cowork both pause before emails, spending, or destructive file operations.
  • OS-level status (Android Halo) sets the bar for background agents that run when the laptop is closed.

What agent UX patterns are

An AI agent, in Nielsen Norman Group's 2026 definition, is a system that pursues a goal by iteratively taking actions, evaluating progress, and deciding its own next steps. That loop is what separates agents from chatbots that reply once and stop. Agent UX (sometimes called agentic experience or AX design) is the interface layer that makes that loop legible: what the agent plans, what it is doing now, what it already changed, and how a human can stop or reverse it.

The design job is not to hide autonomy. It is to make autonomy bounded, reviewable, and reversible. NN/G adds the evaluation lens practitioners should use: watch where users lose trust, have to correct the agent, or abandon the task entirely. (Source: NN/G)

Google's Josh Woodward noted at I/O 2026 that more than 900 million people across 230 countries now use the Gemini app monthly. That scale means small UX failures in agent flows multiply into millions of mistaken sends, deleted files, or ignored background tasks. (Source: Google Gemini Spark)

What shipped: four surfaces, four philosophies

The May-June 2026 cluster is useful because each vendor optimized for a different "home screen" for agency.

Gemini Spark, announced May 19 at Google I/O, is a 24/7 personal agent built on Gemini 3.5 and the Antigravity harness. It runs on dedicated Google Cloud virtual machines, so it keeps working when you close your laptop. Spark integrates natively with Gmail, Docs, Slides, and Calendar, and users can email Spark through a dedicated Gmail address. For mobile visibility, Google previewed Android Halo, a system-level indicator at the top of the screen that shows task progress without forcing an app switch. (Sources: Google Gemini Spark, Android Halo)

Microsoft Scout, debuted June 2 at Build in San Francisco, is a work-focused personal agent built on OpenClaw and WorkIQ. Scout lives alongside Teams and Outlook, proactively handling scheduling conflicts and meeting prep without an explicit prompt. At launch it was available to Frontier customers only. (Source: The New Stack) For architecture context, see our Scout and OpenClaw coverage. Spark's inbox-first model is unpacked in Gemini Spark as a personal agent.

Claude Cowork ships inside the Claude desktop app as a third mode beside Chat and Code. Anthropic positions it around outcomes, not prompts: you describe the deliverable, Cowork decomposes steps, and loops you in before anything significant. Folder and connector permissions fence what Cowork can touch. (Source: Claude Cowork)

Apple's WWDC 2026 Siri push is developer-facing but sets consumer UX expectations. App Intents schemas let apps expose entities and actions Siri can run through natural language, validated early via the AppIntentsTesting framework without UI automation. Susan Prescott, Apple's VP of Worldwide Developer Relations, said Xcode 27's agentic coding gives developers "the tools they need to focus on what they do best." (Source: Apple Newsroom)

The pattern framework: pre-action, in-action, post-action

Smashing Magazine's 2026 lifecycle model groups agent UX into three phases and six patterns. You do not need all six on every screen, but you need coverage across the lifecycle:

  • Pre-action: Intent Preview (plan summary before execution) and Autonomy Dial (per-task permission level).
  • In-action: Explainable Rationale (why the agent chose this path) and Confidence Signal (certainty indicators).
  • Post-action: Action Audit and Undo (chronological log with rollback) and Escalation Pathway (handoff to a human when stuck).

Map those six onto the five families this article emphasizes. Confirmation gates and autonomy dials are pre-action. Streaming state and explainable rationale are in-action. Error recovery, audit logs, and escalation are post-action. Notifications span all three when agents run in the background. Google documents Spark's opt-in connectors as pre-action scope; Cowork's plan preview is the same phase in a desktop context. (Sources: Google Gemini Spark, Claude Cowork)

Confirmation gates: when agents must pause

The Intent Preview is the highest-leverage confirmation pattern. Before irreversible, financial, or outbound actions, the agent shows a plain-language plan with explicit choices: proceed, edit, or handle it yourself. Inference: practitioner frameworks suggest tracking plan acceptance and override rates, but most May-June 2026 shipping products have not published those metrics publicly.

Vendors are converging on high-stakes gates even when they differ on low-stakes automation.

Woodward wrote that Spark "is designed to ask you first before performing high-stakes actions like spending money or sending emails." Cowork's product page states it "shows you what it plans to do and waits for your approval" before significant actions, with explicit folder scoping so it cannot read outside allowed paths. Scout's proactive posture (resolving conflicts without being asked) raises the stakes for confirmation UX: users need preview surfaces before Scout acts on calendar or email, not only after. (Sources: Google Gemini Spark, Claude Cowork, The New Stack)

PatternGemini SparkMicrosoft ScoutClaude CoworkApple Siri (App Intents)
Primary surfaceGmail / Gemini appTeams / Outlook taskbarClaude Desktop IDESystem-wide Siri + host app
Confirmation gateAsk before spend or send emailProactive; preview TBD at scalePlan preview before significant actionsIntent perform + user confirmation in app
Permission modelOpt-in app + MCP connectionsWorkIQ enterprise contextFolder + connector allowlistApp schema + on-device permissions
Background executionCloud VM (laptop can close)Enterprise M365 graphDesktop sandboxOn-device + Private Cloud Compute
Progress visibilityAndroid Halo (later 2026)Teams notificationsReal-time step stream in IDEShortcuts / Siri feedback
RecoveryInference: audit via Gmail threadInference: Teams activity logRedirect or refine at any stepAppIntentsTesting + rollback in app

Streaming state: making background work visible

When agents run 24/7 on cloud VMs, invisible work is the default failure mode. Android Halo puts a subtle indicator at the top of the screen when Spark or other supported agents are on a task, in live mode, or sending a message. You see progress from any app without context switching. (Source: Android Halo)

Cowork instead streams steps inside the Claude desktop IDE, where you can watch in real time or return later to a finished deliverable. Scout surfaces progress through Teams and Outlook notifications. Inference: enterprise agents will standardize on the host app's feed before OS-wide indicators are universal.

Trust indicators users actually need

Trust is not a badge. It is a stack of legible signals.

Explainable rationale answers "why did the agent do that?" in user language, grounded in stated preferences. Confidence signals expose uncertainty before the user commits. Opt-in connectors make scope visible: Spark requires you to turn it on and choose connected apps; Cowork limits reads to folders you select.

Apple's approach shifts part of the trust burden to developers. AppIntentsTesting lets teams validate intents through system pathways before users ever talk to Siri, reducing "Siri did the wrong thing in my app" failures. That is trust through pre-ship verification, complementary to consumer-facing confidence UI. (Source: Apple Newsroom)

For Microsoft's enterprise trust layer (identity, policy, audit), see Microsoft's agent trust stack from Build 2026. Platform-level Siri and Gemini positioning is covered in Google, Apple, and Siri agent strategy.

Error recovery and escalation

Post-action patterns prevent one bad step from becoming a bad week. Action audit logs with undo are the minimum bar for file, calendar, and messaging agents. Escalation pathways hand control back when confidence drops or stakes rise. NN/G's usefulness criteria require agents that adapt when steps fail and minimize supervision without eliminating it. When users repeatedly correct the same failure mode, the UX fix is usually clearer previews or tighter permissions, not a smarter model. (Source: NN/G)

Apple's AppIntentsTesting pushes recovery left into development. Consumer agents like Spark and Cowork must still ship in-product undo for the actions users feel in Gmail or local folders. (Source: Apple Newsroom)

Operator note (first-hand)

AgenticWire audited the four May-June 2026 launches using only public product pages. Score each surface on five columns (confirmation gate, streaming state, trust signal, recovery path, notification discipline) with 0 = not documented, 1 = partial, 2 = explicit. Reproduce it: open the Google Gemini Spark blog, Anthropic Cowork product page, The New Stack Scout article, and Apple Newsroom WWDC post; list every user-visible pause, progress indicator, and undo path. Spark and Cowork scored 2 on confirmation gates. Android Halo scored 2 on mobile streaming state. Scout scored 1 on confirmation at Frontier launch. Apple scored 2 on pre-ship recovery via AppIntentsTesting. Any column at 0 is a ship blocker for that risk tier.

Frequently asked questions

What are agent UX design patterns?

Agent UX design patterns are reusable interaction rules for systems that plan, call tools, and act on a user's behalf. They cover how agents ask permission, show progress, explain decisions, recover from errors, and notify users without surprise. The May 2026 launches from Google, Microsoft, Anthropic, and Apple each implemented different surfaces but shared the same underlying needs.

How should AI agents ask for confirmation before acting?

Show a plain-language Intent Preview before irreversible, financial, or outbound actions. Offer proceed, edit, and opt-out paths. Spark asks before spending money or sending email; Cowork waits for approval before significant file or app actions. Match gate strictness to risk: low-stakes summaries can auto-run; deletes and sends should never surprise users.

What is the best way to show agent progress to users?

Match visibility to execution context. Background cloud agents need ambient indicators (Android Halo at the OS level). Desktop file agents benefit from step streams inside the workspace (Cowork in Claude Desktop). Enterprise agents can use native collaboration notifications (Scout in Teams) until OS-wide standards mature.

How do Gemini Spark, Scout, and Cowork differ in UI design?

Spark is inbox- and Workspace-centric with 24/7 cloud execution and Gmail as an input channel. Scout is Teams- and Outlook-centric with proactive enterprise task handling. Cowork is a desktop IDE mode built around outcome delegation, folder sandboxes, and plan previews. Spark optimizes personal digital life; Scout optimizes work graphs; Cowork optimizes local file deliverables.

What trust indicators do users need for autonomous agents?

Users need scoped permissions (opt-in connectors and folders), plain-language rationale for non-obvious steps, confidence cues when the agent is unsure, and auditable action history with undo. Developer platforms like Apple's App Intents add pre-ship testing trust; consumer agents must still earn trust in the Gmail thread or Downloads folder where mistakes hurt.

References