India, Oct. 8 -- Google DeepMind officially released the Gemini 2.5 Computer Use model in public preview, a specialised version of Gemini 2.5 Pro built to power AI agents that directly interact with graphical user interfaces (GUIs). This marks a significant move toward creating agents capable of performing complex digital tasks that previously required human-like interaction, tasks like filling out web forms, clicking buttons, and operating behind login screens.

The model is accessible to developers through the Gemini API via Google AI Studio and Vertex AI. Its core purpose is to let agents perform multi-step digital workflows on web browsers and, promisingly, on mobile applications.

While most AI models communicate with software throug...