Intermediate
How Computer Use Works
The vision and action loop: screenshot, decide, click or type, observe, repeat.
What You Will Learn
- The tool set: computer (screenshot), bash (run commands), text_editor (file editing)
- Loop: take screenshot → Claude decides action → execute → repeat
- Action types and their API representations
- How Claude reasons about what it sees on screen
- Latency and token cost of vision-based control loops
This page is under development. Content is being added progressively. Check back soon for the full article.