Google DeepMind integrated computer use capabilities into Gemini 3.5 Flash, allowing the model to interact directly with desktop interfaces. The system perceives screens through screenshots and executes precise mouse and keyboard actions. This update transforms a standard LLM into a functional agent. Developers can now automate complex, multi-step software workflows via API.