OpenAI is releasing a research preview of an AI agent called Operator, that can pretty much perform all the tasks for you on the web. Imagine how convenient it would be to have an AI agent who can not only browse the web for you but also order groceries, book a ride and fill out forms for you. OpenAI in its blog has mentioned that Operator is an intelligent browser-based assistant which will save time and make online tasks super easy for users.
What is Operator?
Operator is an AI agent that can browse the web and perform tasks for you. In the blog the company mentions that using its own browser, Operator can see web pages even through screenshots, interact with them by clicking, typing and scrolling. It can even correct itself if it makes a mistake. Operator can handle a lot of repetitive tasks such as ordering groceries, reserving a dinner table.
The Operator AI agent research preview is currently available in the US. It has plans to expand to more users and integrate the agent into ChatGPT in the future.
What all can the Operator do?
Operator can perform some of the simplest tasks for you such as registering or signing up for a service on the web. It can restock your grocery by browsing grocery platforms like Instacart. It can also book rides for you and reserved dinner tables. It can also work on multiple tasks simultaneously.
Notably OpenAI has partnered with companies like DoorDash, Instacart, Uber to help Operator perform the real world tasks for you.
How does it work
Operator uses a powerful AI model called the Computer-Using Agent (CUA). This model allows it to understand and interact with the buttons, menus, and text fields on your screen, just like a human would. It uses screenshots to figure out what’s on a webpage and performs actions like clicking, typing, or scrolling with a virtual mouse and keyboard.
OpenAI can also fix its own mistakes. It tries using its reasoning skills if it encounters a problem. For tasks involving using passwords or payments, it asks the user to perform them.
Operator will available to Pro users in the US currently. It will be available to the Plus, Team and Enterprise users and integrate it into ChatGPT in the future
Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!
Find the best of Al News in one place, specially curated for you every weekend.
Stay on top of the latest tech trends and biggest startup news.