Google DeepMind integrated computer use capabilities into Gemini 3.5 Flash, allowing the model to interact directly with desktop interfaces. The system perceives screens via screenshots and executes keyboard and mouse actions. This shift moves the model from a text-based assistant to an active agent capable of navigating software for users.