Anthropic Project Glasswing — AI 보안 이니셔티브 로고

cdn.sanity.io

The Model Anthropic Built but Refused to Release — Claude Mythos and the Turning Point in AI Security

Claude Mythos, Project Glasswing, AI cybersecurity, zero-dayBusiness

Project Glasswing: Securing critical software for the AI era

Anthropic says its most powerful AI cyber model is too dangerous to release publicly

Anthropic built a model too risky to release

An AI model just found a security vulnerability that had been hiding for 27 years. One that security experts reviewed for decades, that automated tools tested 5 million times and still missed. And the company that built it decided not to release it.

TL;DR

Claude Mythos built → Thousands of zero-days found → Too dangerous to release → Project Glasswing launched → 12 big tech + 40 orgs get access

What Is This?

On April 7, 2026, Anthropic announced Project Glasswing. The story is simple — Anthropic's latest frontier model Claude Mythos Preview has surpassed the best humans at finding and exploiting software vulnerabilities, and they decided not to release it publicly.

Mythos achieved this without any specialized cybersecurity training — purely through improved coding and reasoning abilities. It scored 93.9% on SWE-bench Verified (vs. Opus 4.6's 80.8%) and 77.8% on SWE-bench Pro (vs. 53.4%). As Theo put it — "Mythos is to Opus what Opus is to Sonnet."

The specific results are what's truly alarming. On Firefox exploit generation, where Opus 4.6 managed 2 working exploits out of hundreds of attempts, Mythos hit 181. Here's what it found:

27-year-old OpenBSD bug
Found a remote crash vulnerability in one of the world's most security-hardened operating systems. Just connecting to the machine was enough to take it down.
16-year-old FFmpeg vulnerability
Caught a flaw in the near-ubiquitous video encoder that automated testing tools had exercised 5 million times without flagging.
Linux kernel privilege escalation chain
Autonomously found and chained multiple vulnerabilities to escalate from regular user access to complete machine control.

181

Firefox exploits (Opus managed 2)

93.9%

SWE-bench Verified score

$100M

Defensive credits committed

All of this happened fully autonomously, without any human steering. Mythos read the code, found the vulnerabilities, and developed the exploits on its own.

Why Does This Matter?

The key here is an unprecedented decision: an AI company admitting its own model is too dangerous to release. Newton Cheng, Anthropic's Frontier Red Team Cyber Lead, told VentureBeat:

"Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout — for economies, public safety, and national security — could be severe."
— Newton Cheng, Anthropic Frontier Red Team

	Traditional AI Security Tools	Claude Mythos
Approach	Pattern matching (known vuln DBs)	Reasoning-based (understands code context)
Autonomy	Follows human-set rules	Fully autonomous exploration + exploit dev
Complex attacks	Detects individual vulnerabilities	Auto-chains multiple vulns together
Discovery scope	Within known patterns	Includes decades-old undiscovered zero-days
Availability	Anyone can use	Restricted to partners only

Platformer's Casey Newton nailed the tension — this whole initiative "is built on a deeply uncomfortable premise: that the only way to protect us from dangerous AI models is to build them first."

Alex Stamos, CPO at Corridor and former Facebook/Yahoo security chief, gave a blunt warning:

"We only have something like six months before the open-weight models catch up to the foundation models in bug finding. At which point every ransomware actor will be able to find and weaponize bugs without leaving traces for law enforcement to find."
— Alex Stamos, former Facebook security chief

Defenders have months, not years — that's the industry consensus.

Project Glasswing — Arming the Defenders First

Anthropic's response is Project Glasswing. Here's the structure:

Coalition of 12 big tech companies
AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks. Competitors Google and Microsoft on the same team tells you how serious this is.
40+ additional organizations
Organizations that build or maintain critical software infrastructure get Mythos Preview access to scan and secure both proprietary and open-source systems.
$100M in usage credits + $4M donations
Anthropic covers the model usage costs for defensive research, plus direct donations to the Linux Foundation and Apache Software Foundation.
Responsible disclosure pipeline
Vulnerabilities aren't dumped on maintainers. Professional triagers manually validate every report, coordinating at a pace maintainers can handle. 45-day buffer after patches before publishing technical details.

The irony

While Anthropic positions itself as the safe custodian of this unprecedented cyber capability, in March they accidentally leaked 3,000 internal documents through a CMS misconfiguration, and Claude Code's 512,000-line source code was publicly accessible via npm for about 3 hours. Neither was a core system breach, but for a company asking governments to trust it with infrastructure-level security tools, the optics weren't great.

Business Context — Why Now?

On the same day as the Glasswing launch, Anthropic disclosed $30B annualized revenue (3x year-over-year), a 3.5GW compute deal with Google-Broadcom, and the hiring of former Microsoft exec Eric Boyd. Reports suggest an IPO as early as October 2026.

Korean tech outlet THE ELEC analyzed the timing: "beneath the defensive initiative surface lies a complex intent — IPO narrative building, leverage in the Pentagon dispute, and securing partner endorsements."

What's telling is that companies with their own AI security tech — CrowdStrike, Palo Alto Networks — joined the coalition. The interpretation? "They're effectively admitting their own security AI can't defend against Mythos-level attacks."

Key Takeaways: Why This Matters to You

If you build software
Within 6 months, open-weight models will have similar vulnerability-finding capabilities. Time to overhaul your security pipeline. There's no reason to delay adopting AI security tools.
If you're a business decision-maker
"Defending AI-created threats with AI" is the new paradigm. Review your security budget and strategy from this perspective.
If you follow the AI industry
This is the first time an AI company has admitted its own model is too dangerous to release. Not marketing — a new benchmark in AI safety discourse that will directly influence regulation.

The defender's timeline

Anthropic committed to publicly reporting Project Glasswing learnings within 90 days. That's your "defender head start" window. If you're a security practitioner, consider applying to Anthropic's Claude for Open Source program.

🔗

Go Deeper

Project Glasswing Official Page

Anthropic's full announcement — benchmarks, partner quotes, technical details

VentureBeat Deep Dive

Exclusive Newton Cheng interview and vulnerability disclosure pipeline details

Platformer: Cybersecurity experts react

Alex Stamos interview — "open-weight models catch up in 6 months"

THE ELEC: The AI model too dangerous to release

Korean analysis — IPO strategy and Pentagon conflict context

Ben's Bites: Model too risky to release

Ben Tossell's newsletter — Mythos summary and industry context

FAQ

Is there any way to use Mythos?

Not for general users. You can get access as a Project Glasswing partner company or by applying as an open-source maintainer through Anthropic's Claude for Open Source program. After the preview period, API pricing is $25 input / $125 output per million tokens.

How is this different from Claude Code Security announced earlier?

Claude Code Security runs on Opus 4.6 and focuses on scanning for vulnerabilities. Mythos is a leap beyond — it not only finds vulnerabilities but autonomously chains multiple ones together to create working attack paths. Quantitatively, Firefox exploits went from 2 to 181.

I maintain an open-source project. Will I get buried under vulnerability reports?

Anthropic has contracted professional triagers who manually validate every report before sending it. They don't dump large volumes on a single project and will coordinate with maintainers on a sustainable pace for disclosure.

Won't attackers have the same capabilities once open-weight models catch up in 6 months?

Yes, that's exactly why time is the key variable. Project Glasswing's strategy is to find and patch vulnerabilities before attack capabilities proliferate — shrinking the attack surface preemptively. Not perfect, but the industry consensus is it's the best realistic option.

Written by Rush

Tracking where business meets AI.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

Top 20% of Companies Capture 74% of AI's Economic Value — PwC's 1,217-Executive Study Reveals the Real Gap

PwC's 2026 AI Performance Study shows that 74% of AI's economic value is captured by just 20% of companies. Here's what AI leaders do differently and how to close the gap.

Explore more AI workflow guides on similar topics

AI Covers 94% of Tasks but Only 33% Adopt It — Anthropic Measured the Gap

i.redd.it

Anthropic's research shows AI can handle 94% of knowledge work tasks, yet real a

AI Covers 94% of Tasks but Only 33% Adopt It — Anthropic Measured the Gap

Anthropic's research shows AI can handle 94% of knowledge work tasks, yet real adoption sits at 33%. Here's why.

Microsoft Copilot Wave 3 — From Chat Assistant to Agentic Platform

blogs.microsoft.com

Wave 3 transforms Microsoft Copilot from a simple chat helper into a full agenti

Microsoft Copilot Wave 3 — From Chat Assistant to Agentic Platform

Wave 3 transforms Microsoft Copilot from a simple chat helper into a full agentic platform that takes action.

Next →Top 20% of Companies Capture 74% of AI's Economic Value — PwC's 1,217-Executive Study Reveals the Real Gap