Google’s Gemini 2.5 AI set to control web interfaces, moving toward full computer automation

by China Urdu

Google has introduced a preview version of its latest artificial intelligence model, Gemini 2.5 Computer Use, designed to perform tasks on web browsers much like a human user. The system can move the mouse cursor, click buttons, fill out forms, scroll through pages, and type text marking a major step toward giving AI direct control over computers.

According to Google, developers can now use the new Gemini 2.5 Computer Use API to create agents capable of interpreting user interfaces through screenshots and translating commands into visual actions such as clicking, dragging, and typing. At this stage, the AI is primarily optimized for web browsers and cannot yet operate independently across full desktop environments.

Google emphasized that the model includes strong safety protocols. When faced with sensitive actions, such as sending an email or making a financial transaction, the AI will first request explicit confirmation from the user before proceeding.

The development signals a shift in artificial intelligence from conversational chatbots to more advanced “AI agents” capable of performing practical digital tasks. These agents can act beyond the chat window to execute real-world operations in software systems, similar to how a human assistant would navigate a computer.

Google claims that Gemini 2.5 Computer Use performs better than competing systems in terms of precision and speed, especially when interacting with web and mobile applications. However, the company admits the model is not yet advanced enough to control an entire operating system or handle file management and system configurations.

Despite its promise, the new model raises significant concerns about potential misuse and security vulnerabilities.

Google has acknowledged risks such as malicious use, unexpected behaviors, data fraud, and manipulation within online environments. To minimize these threats, developers can limit the AI’s permissions and must require user authorization for sensitive commands.

The Computer Use model remains in its experimental phase and may still produce errors. Google has advised developers and users to avoid applying it to high-risk or mission-critical operations without supervision.

Currently, the model is available to developers through Google AI Studio and Vertex AI. This rollout reflects Google’s broader strategy to advance autonomous AI agents capable of performing complex workflows, a move that could redefine how people interact with technology in the near future.

China Urdu

+ posts

Google’s Gemini 2.5 AI set to control web interfaces, moving toward full computer automation

China Urdu

Latest

Arshad Nadeem resumes training after injury for World Islamic Games

Pakistan hockey team to tour Spain ahead of Pro League

Cricket Australia approves BBL privatisation plan

PSB opens inquiry into controversial ban on Arshad Nadeem’s coach

Pakistan, Iran move to ease cross-border trade, Says Abdul Aleem Khan

Pakistan to revamp PARC under china-backed reform plan

Govt slashes smart meter prices to accelerate digital power reforms

Punjab launches mobile app for prison visit bookings

Saudi Arabia cuts Umrah visa entry deadline to 30 days

SC stops Peshawar Cantonment Board from issuing tax notices on evacuee trust property

ATC issues notice to police in case against journalist Matiullah Jan

Smog crisis deepens in Punjab as Faisalabad tops global pollution index

About Us

Vital Infos

From China Urdu

Editor Picks

Arshad Nadeem resumes training after injury for World Islamic Games

Pakistan hockey team to tour Spain ahead of Pro League

Cricket Australia approves BBL privatisation plan

Google’s Gemini 2.5 AI set to control web interfaces, moving toward full computer automation

RELATED ARTICLES

Vital Infos

From China Urdu

Editor Picks