Computer Use Agents

Computer use agents can see a screen and control a computer — clicking, typing, scrolling, and navigating any application without needing an API. This unlocks automation for legacy systems, web UIs, and anything that lacks a formal interface, but introduces unique security and reliability requirements that demand careful architecture.

This section covers platform-agnostic computer use concepts and implementations. For Claude-specific computer use — browser automation, desktop pipelines, and Claude Code as a local agent — see Master Claude → Computer Use.

In This Section

Page built: 01 Jun 2026