"Check my calendar and brief me on next week's meetings." "Buy ingredients for a Japanese breakfast for four and have them delivered." "Analyze three competitors and build me a slide deck." — All of this is now something you can just ask ChatGPT to do. The chatbot era is over. AI now opens a browser, clicks, types, and brings you the results.

TL;DR
Give instructions AI operates the browser Navigates/clicks/fills forms Compiles results Confirm & execute

What Is It?

ChatGPT Agent is a feature OpenAI launched in July 2025. The core idea is simple — ChatGPT got its own virtual computer. It can open browsers, use the terminal, run code, and download files. Nearly everything a person does at a computer, AI can now do on your behalf.

It started with Operator in January 2025 — a research preview where OpenAI introduced the concept of "AI operating a browser for you." The model powering it is CUA (Computer-Using Agent). It combines GPT-4o's visual capabilities with reinforcement learning, enabling it to "see" the screen via screenshots and "operate" it with mouse and keyboard.

But Operator had limitations. It was great at navigating the web but couldn't do deep analysis. Deep Research was good at analysis but couldn't click through websites. ChatGPT Agent, released in July, merged both. OpenAI researcher Casey Chu put it best: "These two approaches are actually deeply complementary. Operator struggles with reading long documents, but Deep Research handles that well. Conversely, Deep Research can't deal with interactive web pages, but that's exactly what Operator excels at."

$20/mo
Plus plan (40 uses/month)
5–30 min
Task completion time
65.4%
WebArena benchmark

Here's a breakdown of the tools it can use:

  1. Visual Browser
    It sees websites and interacts with them — clicking, scrolling, dismissing cookie popups, filling out forms, and filtering search results, just like a person.
  2. Text Browser
    A lightweight browser optimized for reading and analyzing long documents quickly. Faster and leaner than the visual browser.
  3. Terminal + Code Execution
    Runs Python scripts, downloads files, and processes data. All within a sandboxed virtual machine.
  4. App Connectors
    Connect Gmail, Google Drive, GitHub, and more to enable tasks like checking email and reviewing your schedule.

Self-Correction

This is a key feature of the CUA model. When it makes a mistake mid-task, it recognizes the error, goes back, and tries again. In DataCamp's testing, the agent was observed misreading a website, navigating back to the previous page, and self-correcting.

What\'s Different?

Until now, AI assistants could only "talk." No matter how smart they were, their limit was telling you "here's how to do it." ChatGPT Agent breaks past that — it's shifted from conversation to execution, from advice to action.

Traditional ChatGPT ChatGPT Agent
Role Provides information, generates answers Directly operates browser + executes tasks
Web Search Summarizes search results Visits sites, clicks, filters results
Data Analysis Writes code for you to run Collects data + analyzes + creates spreadsheets
Booking/Orders Explains how to do it Books directly on the site / adds to cart
Presentations Drafts content outline Creates slides + provides editable files
Control None (output only) Intervene/pause/take over anytime

The partnerships OpenAI is building matter too. They're working with DoorDash, Instacart, OpenTable, Priceline, StubHub, and Uber. The agent can book, order, and buy tickets directly on these platforms. Instacart CPO Daniel Danker called it "a technological breakthrough that makes processes like grocery ordering unbelievably easy."

Benchmark scores are strong too. 65.4% on WebArena, 41.6% on Humanity's Last Exam (SOTA), and 89.9% on the data analysis benchmark DSBench — beating the human score of 64.1%.

That said, it's not perfect yet. Timothy B. Lee of Understanding AI ran a grocery test — the agent correctly added 15 out of 16 items to the cart but forgot the onions, and a security monitor blocked the login. In DataCamp's test, it successfully collected UNESCO data from 222 countries but forgot the instruction to create a summary tab.

Good to Know

ChatGPT Agent is still in its early stages. Reviews note that complex design tasks (like creating an image collage in Canva) can take over 75 minutes with underwhelming results. For sensitive actions like payments or sending emails, always verify personally. It's designed to refuse high-risk tasks like banking transactions. It excels at research and data collection but is still weak at visual/design work.

Quick Start Guide

  1. Subscribe to ChatGPT
    Choose between Plus ($20/month, 40 uses) or Pro ($200/month, 400 uses). Starting with Plus is recommended.
  2. Activate Agent Mode
    In the ChatGPT chat window, open the tools dropdown and select "agent mode," or type /agent.
  3. Assign Your First Task
    Start simple. Research tasks work great — try "Check the weather in Seoul this week and summarize it" or "Compare pricing for competitors A, B, and C in a table."
  4. Connect App Connectors (Optional)
    Link Gmail, Google Calendar, and more to enable tasks like "Check my calendar and show me free slots this week."
  5. Automate Recurring Tasks
    After completing a task, click the clock icon to schedule it daily, weekly, or monthly. Try setting up routines like "Summarize competitor news every Monday morning."