Anthropic released a beta version of Claude 3.5 Sonnet that can navigate a computer screen and execute clicks. The model interprets visual pixels to operate a cursor and type text. This shift toward agentic action allows the AI to handle complex multi-step workflows. Developers can now automate repetitive software tasks via API.