Anthropic Expands Project Glasswing to Defend Critical Infrastructure With AI

Anthropic expanded Project Glasswing to about 150 organizations across 15-plus countries, using a frontier model to find vulnerabilities before attackers do.

Sam CarterJun 27, 2026 8 min read

Cover image for Anthropic Expands Project Glasswing to Defend Critical Infrastructure With AI — Photo: Pacific Northwest National Laboratory - PNNL / flickr (BY-NC-SA 2.0)

In early June 2026, Anthropic expanded Project Glasswing, a program that uses an advanced AI model to find and fix software vulnerabilities in critical infrastructure. The expansion brings the effort to roughly 150 organizations across more than 15 countries, spanning sectors like energy, water, healthcare, communications, and technology. It is one of the most concrete examples yet of frontier AI being aimed squarely at defense.

Quick answer

In early June 2026, Anthropic expanded Project Glasswing to roughly 150 organizations across more than 15 countries, putting a gated frontier model (Claude Mythos Preview) in the hands of critical-infrastructure operators in energy, water, healthcare, communications, and hardware. Since launching in April 2026 the model has surfaced more than 10,000 high or critical vulnerabilities, and early partners like Cloudflare (2,000 bugs) and Mozilla (271 Firefox fixes) acted on thousands of them. Anthropic says the bottleneck is no longer finding flaws but human capacity to triage, disclose, and patch them.

Key takeaways

Anthropic added about 150 organizations across more than 15 countries, broadening Glasswing into power, water, healthcare, communications, and hardware sectors.
The program runs on Claude Mythos Preview, a gated, unreleased frontier model unusually capable at finding software flaws.
Since launching in early April 2026, the model has surfaced more than 10,000 high- or critical-severity vulnerabilities.
Early partners reported striking results: Cloudflare found 2,000 bugs (400 high or critical) and Mozilla fixed 271 vulnerabilities in Firefox while testing the model.
Anthropic says the real bottleneck is now human capacity to triage, disclose, and patch the flaws the model finds.

What happened

Project Glasswing pairs Anthropic with critical-infrastructure operators and major technology companies to test next-generation AI tools for defensive cybersecurity. The June expansion widened both the number of participating organizations and the sectors involved, deliberately reaching into areas underrepresented in the first wave, including power, water, healthcare, communications, and hardware. Many of the new partners are vendors whose codebases underpin critical infrastructure systems.

At the center is a gated research-preview model Anthropic refers to as Claude Mythos Preview, an unreleased frontier model the company says is unusually capable at finding and exploiting software flaws. Since the program launched in early April 2026, the model has surfaced more than 10,000 high- or critical-severity vulnerabilities. In one effort, Anthropic said the model scanned more than 1,000 open-source projects and flagged over 23,000 potential vulnerabilities, with several thousand rated high or critical. Of a sample reviewed independently, the company said more than 90% were confirmed as valid.

Note

A "zero-day" is a vulnerability that defenders do not yet know about and have not patched. Finding them first lets defenders fix the hole before attackers can use it; finding them second usually means cleaning up after a breach.

The details

Early partner results give a sense of scale. Cloudflare said the model identified 2,000 bugs across its critical-path systems, including 400 rated high or critical. Mozilla reported finding and fixing 271 vulnerabilities in Firefox 150 while testing the model, more than ten times the number found in a previous Firefox version. Those are not abstract figures; they represent real flaws in software that millions of people rely on every day.

Partner / effort	Reported result	Why it matters
Cloudflare	2,000 bugs found, 400 high or critical	Flaws in critical-path edge systems
Mozilla (Firefox 150)	271 vulnerabilities fixed	More than 10x a prior Firefox version
Open-source scan	23,000+ potential flaws across 1,000+ projects	90%+ of a reviewed sample confirmed valid
Program total (since April 2026)	10,000+ high or critical findings	Scale no manual team could match

A security analyst studying highlighted source code on a large monitor — Photo: ricardodiaz11 / flickr (BY 2.0)

Anthropic is candid about the limiting factor. As the company put it, the bottleneck in fixing bugs like these is the human capacity to triage, report, and design and deploy patches for them. An AI that surfaces thousands of valid flaws only helps if organizations can act on the findings, which is why the program emphasizes coordination with vendors rather than dumping vulnerability lists into the open.

Why it matters

The premise behind Glasswing is a double-edged observation: AI has reached a point where it can find software vulnerabilities better than all but the most skilled human researchers. That capability is enormously useful for defense, but the same skill is exactly what an attacker would want. The pace at which exploited flaws now move from disclosure to attack is sobering, as seen in cases like the Cisco Unified CM SSRF flaw weaponized within 24 hours.

Anthropic frames the program as an attempt to get ahead of that shift. By putting a powerful vulnerability-finding model in the hands of defenders now, the company argues, critical systems can be hardened before similarly capable models become widely available, potentially without safety guardrails, to anyone including attackers. Anthropic has said it expects other AI companies to field comparable models within roughly 6 to 12 months.

For the operators involved, the appeal is practical. Power grids, hospitals, and telecom networks run on layers of software, much of it old and hard to audit. An AI that can systematically scan large codebases and surface the most serious flaws could find problems that manual review would take years to reach.

The risks and the debate

The flip side is unavoidable. A tool good enough to find thousands of real vulnerabilities is also a tool that, in the wrong hands, could accelerate attacks rather than prevent them. That is why Mythos is described as gated and limited to a controlled set of partners rather than released publicly. The same dual-use tension is driving governments to slow public releases of frontier models, as with OpenAI's restricted GPT-5.6 rollout.

Critics and supporters alike tend to agree on the core tension: powerful offensive-security capabilities are emerging whether or not any single company releases them, so the policy question is how to distribute the defensive version responsibly and quickly enough to matter. The broader category of AI-amplified attacks, including prompt injection against agents, is explored in our 2026 prompt injection defense playbook.

Warning

The same AI capability that helps defenders patch flaws first can help attackers find them first. The value of programs like Glasswing depends heavily on access staying controlled and defenders moving faster than adversaries.

What is next

Things to watch:

Confirmed fixes, not just findings. The real measure is how many flagged vulnerabilities actually get patched across participating organizations.
Independent validation. Outside review of the model's accuracy and false-positive rate will shape trust in the approach.
The 6-to-12-month window. If comparable models reach the open market on that timeline, the race between offense and defense intensifies.
Policy response. Governments may weigh in on how powerful vulnerability-discovery tools should be governed.

Frequently asked questions

What is Claude Mythos Preview?

It is an unreleased, gated frontier model from Anthropic, made available only to vetted Project Glasswing partners. Anthropic says it is unusually strong at finding and even exploiting software vulnerabilities, which is why access is tightly controlled.

How accurate are the model's findings?

In an independent review of a sample of its high- and critical-rated findings, more than 90% were confirmed valid. Real-world partners like Cloudflare and Mozilla also acted on thousands of flagged bugs, suggesting a meaningful signal rather than mostly noise.

Could this tool help attackers instead?

In principle, yes, which is why it is gated. A model capable of finding thousands of valid flaws would be dangerous if released openly. Anthropic's bet is that controlled defensive access lets operators patch before comparable models leak to attackers.

Why is patching the bottleneck?

Finding vulnerabilities is now faster than fixing them. Triaging, disclosing responsibly, writing patches, and deploying them across complex infrastructure all require human effort and time, which the AI does not remove.

For now, Project Glasswing is one of the most concrete examples of frontier AI being aimed at defense, and a preview of a security landscape where both attackers and defenders have far sharper tools.

#news#security#ai