Check the calendar, research the news, and create a briefing document — all three done with a single prompt. The ChatGPT Agent that OpenAI first unveiled in July 2025 became a completely different product in March 2026 when it was equipped with GPT-5.4. Operator, which had been running as a separate service, is set to be discontinued, and Deep Research’s analytical power has been absorbed into the agent. It’s now truly an "all-in-one" AI.

3-Second Summary
Powered by GPT-5.4 Operator + Deep Research merged Browser, terminal, APIs freely used in virtual computer Multi-step tasks completed in one prompt

What Is This?

Let me quickly walk through the timeline. In January 2025, OpenAI launched Operator — a service where AI directly controls a browser. Around the same time, there was Deep Research — a research-specialized feature that reads dozens of sources and creates comprehensive reports. The problem was that these two operated completely separately.

Operator was great at clicking and scrolling websites but clumsy at reading and analyzing long documents. Deep Research was a master analyst but couldn’t even access sites that required login. In July 2025, OpenAI merged both teams to create ChatGPT Agent.

Then on March 5, 2026, GPT-5.4 launched and ChatGPT Agent entered an entirely new phase. GPT-5.4 is OpenAI’s most powerful frontier model. It’s the first to unify reasoning, coding, and agent workflows in a single model, and the first to ship simultaneously to ChatGPT, API, and Codex.

The key change is native Computer Use. GPT-5.4 can directly interpret screens and control mouse and keyboard to automate complex workflows. Text browser, visual browser, terminal, API integrations — all tools share a single state and switch seamlessly.

75%
OSWorld benchmark (human avg: 72.4%)
83%
GDPval expert comparison match rate
272K
Default context window (up to 1M)

The inside story shared by OpenAI researchers on the Sequoia Capital podcast is impressive. The team that built the agent was surprisingly small. 3–4 Deep Research researchers, 6–8 Operator researchers, and an applied engineering team. This small group used reinforcement learning (RL) to train across thousands of virtual machines on diverse tasks. The key was that "they didn’t prescribe tool-use patterns — they let the model find the optimal strategy on its own."

Operator Being Discontinued

With the virtual browser now built into ChatGPT Agent, the separately operated operator.chatgpt.com is set to be discontinued within weeks. Existing Operator users can simply switch to ChatGPT’s agent mode with no additional steps needed.

What Changes?

The previous ChatGPT Agent (July 2025) could already manipulate browsers. But what changed with GPT-5.4 is that it went from "can do it" to "does it well."

Before (GPT-5.2 based) Now (GPT-5.4 based)
Reasoning Model GPT-5.2 Thinking + o3 GPT-5.4 Thinking single model
Desktop Control Primarily web browser Native computer use (mouse + keyboard)
Expert-Level Tasks 70.9% match across 44 professions 83.0% match across 44 professions (GDPval)
Coding Basic code generation GPT-5.3-Codex-level coding + frontend polish
Context Limited 272K default, up to 1M tokens
Spreadsheets/PPT Basic generation Directly creates and edits editable files
Thought Process Black box Shows thinking plan upfront, user can adjust

The OSWorld-Verified benchmark is symbolic. It measures AI’s ability to perform tasks in a real desktop environment, and GPT-5.4 scored 75% — surpassing the average human score of 72.4%. It’s the first benchmark proof that "AI can use a computer better than humans."

Sequoia Capital’s analysis nails it — they called it "1+1=3." Combining Operator’s visual web manipulation with Deep Research’s text analysis and synthesis lets it accomplish things neither could do alone. For example, if you say "analyze 3 competitors and make a slide deck" — it browses websites and collects data (Operator capability), synthesizes the collected information (Deep Research capability), and generates an editable presentation file (new GPT-5.4 capability) — the entire process runs as one.

Things to Know

While GPT-5.4 is a major improvement, agent tasks still take 5–30 minutes. It can still stumble on simple UI elements like datepickers, and adapting to different website layouts remains a challenge. Always manually verify high-stakes actions like payments or sending emails.

The Essentials: How to Get Started

  1. Check your plan
    GPT-5.4 Thinking is available to ChatGPT Plus ($20/mo), Team, and Pro ($200/mo) users. Plus gets 40 agent runs/month, Pro gets 400. GPT-5.4 Pro (highest performance) is exclusive to Pro and Enterprise.
  2. Enter agent mode
    Select "agent mode" from the tool dropdown at the bottom of the ChatGPT chat, or type /agent. You can switch mid-conversation at any time.
  3. Connect app connectors
    Link Google Calendar, Gmail, Google Drive, GitHub, etc. to enable personalized tasks like "Check my calendar and brief me on next week’s meetings."
  4. Assign your first task
    Research + organization combos have the highest success rate. Try things like "Find 5 AI news stories from this week and create a summary table" or "Compare pricing for competitors A/B/C and organize it in a spreadsheet."
  5. Automate recurring tasks
    Click the clock icon on a completed task to schedule daily/weekly/monthly repeats. Try automating "Every Monday morning, competitor news briefing."