OpenBSD is considered the world's most security-hardened OS. A bug had been hiding there for 27 years. AI found it.

Alone. Automatically. And you can't use that AI yet.

30-second summary
AI surpasses top hackers 23,000+ vulnerabilities found Coalition instead of public release The AI security game has changed

You still think AI security tools are just "assistants," right?

Honestly? That was accurate until recently. AI-based security scanners were good at finding known patterns, but real zero-day vulnerabilities remained the domain of elite security researchers.

In April 2026, that assumption was officially overturned. Anthropic revealed an unreleased model called Claude Mythos Preview — and the benchmark numbers are quite provocative.

83.1%
CyberGym (vulnerability reproduction)
93.9%
SWE-bench Verified
77.8%
SWE-bench Pro

More telling than the benchmarks: independent security firm XBOW called Mythos absolutely unprecedented precision, while the UK AI Security Institute declared it "the first model to solve both of their cyber ranges end to end."

Previous security AIs would find a bug and stop at "this looks suspicious." Mythos finds the bug, converts it into working exploit code, and chains multiple vulnerabilities into larger attacks. It runs full penetration tests, not just bug identification.

How does AI find bugs that hid for 27 and 16 years?

Numbers first. Mythos Preview scanned over 1,000 open-source projects and flagged 23,019 issues total — 6,202 rated high or critical severity. Six independent security firms verified 1,752 of these; 90.6% confirmed real vulnerabilities.

SoftwareHidden forSeverityImpact
wolfSSL cryptography libraryCVE-2026-5194CriticalCertificate forgery — enables convincing fake bank and email sites
OpenBSD27 yearsHighRemote system crash
FFmpeg (video encoder)16 yearsHighSurvived 5M+ automated test runs undetected
Linux kernelMultipleCriticalPrivilege escalation from user to full system control

You may not know wolfSSL, but your devices almost certainly use it. It's embedded in billions of IoT devices, automotive systems, and hardware. The exploit would have let attackers forge certificates — making fake websites appear completely legitimate with a green padlock. It's patched now.

Cloudflare found 2,000 bugs in their codebase, 400 of them high or critical severity. False positive rate was better than human testers. Mozilla found 271 vulnerabilities in Firefox 150 — more than 10x what they found with Claude Opus 4.6 on prior versions.

Why didn't anyone catch that 16-year-old FFmpeg bug?

Automated tests ran 5 million+ times and missed it. Traditional fuzzers ask "does this input crash the program?" in brute repetition. AI understands the logic of the code and constructs sophisticated edge cases. It attacks meaning, not just surface patterns.

So why didn't Anthropic release it publicly?

Here's the core. Mythos Preview is not available to general users. Access is restricted to specific partner organizations, and public release has been pushed until "stronger safety features are added."

This isn't just "it's dangerous, lock it up." The logic is: the same AI can attack or defend — who gets there first determines the outcome. Attackers gaining this capability is only a matter of time. Anthropic's bet was getting defense there first.

That's Project Glasswing — an industry defense coalition launched in April 2026.

AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks

— 11 founding partners of Project Glasswing

It launched with 50 partners and expanded to 150+ organizations across 15+ countries by June 2026. Korea is included — Samsung, SK Hynix, and SK Telecom are among the partners. Anthropic has committed up to $100M in model credits and $4M in direct grants to open-source foundations.

Old security paradigmPost-Glasswing
Vulnerability discovery speedMonthsWeeks
Monthly high-risk findings150–300900+
Patch deployment time60–150 days avgTarget: under 2 weeks
False positive rateHigh (noisy)Better than human testers

Results are already showing. A partner bank intercepted a $1.5M fraudulent wire transfer, and Palo Alto Networks has been patching at 5x their normal rate.

What your team should do right now

  1. Audit your open-source dependencies
    Like wolfSSL, you may be using libraries you don't know about. Run npm audit, pip-audit, or trivy to map your dependency tree today, and automate alerts with GitHub Dependabot or Snyk.
  2. Cut your patch deployment cycle
    Industry average is 60–150 days. AI has compressed discovery to weeks — patches need to keep up. Build security patch validation and deployment into your CI/CD pipeline. That's the most urgent structural change right now.
  3. Join the Claude Security waitlist
    Anthropic is running an enterprise beta of Claude Security. Early data: 2,100 vulnerabilities patched in 3 weeks using Claude Opus 4.7. Sign up now to move fast when access opens.
  4. Harden your defense architecture
    Cloudflare's lesson: "Patching alone isn't enough." WAF, Zero Trust, and system segmentation need to run in parallel. Even when you find vulnerabilities, attackers can move before patches ship.
  5. Understand the AI-vs-AI era
    Security expert Sejun Park (CEO, Theori) notes the acceleration in AI-driven vulnerability discovery predates Mythos. Defense teams without AI tooling will find it increasingly hard to keep pace with AI-powered attacks.