Anthropic Hides New AI Model Amid Discovery Of Thousands Of External Vulnerabilities

Anthropic’s AI Breakthrough: Project Glasswing and Claude Mythos Preview

Anthropic has made waves in the tech industry by developing its latest AI model, Claude Mythos Preview, which has already identified thousands of cybersecurity vulnerabilities across major operating systems and web browsers. This groundbreaking achievement is part of a broader initiative called Project Glasswing. Instead of releasing this powerful tool to the general public, Anthropic has chosen to collaborate with organizations that play pivotal roles in the health of the internet, focusing on enhancing cybersecurity across the ecosystem.

Contents

Anthropic’s AI Breakthrough: Project Glasswing and Claude Mythos Preview

The Collaborative Power of Project Glasswing
A Model That Exceeds Benchmarks
Unprecedented Exploitation Capabilities
Why Mythos Preview Remains Under Wraps
Addressing the Open-Source Challenge
Future Directions for AI in Cybersecurity

The Collaborative Power of Project Glasswing

Project Glasswing boasts an impressive array of launch partners including tech giants like Amazon Web Services, Apple, Microsoft, and Google, among others. By fostering this collaborative effort, Anthropic extends access to over 40 additional organizations that are vital to the maintenance of critical software infrastructure. With a significant commitment of up to $100 million in usage credits for Mythos Preview and $4 million in direct donations to open-source security organizations, the impact of this initiative is poised to be profound.

A Model That Exceeds Benchmarks

Interestingly, Claude Mythos Preview was not specifically designed for cybersecurity functions. Instead, Anthropic revealed that its capabilities emerged as an unexpected yet valuable consequence of advancements in code generation, reasoning, and autonomy. This unique evolution of the model has led it to excel not only in identifying vulnerabilities but also in reverse-engineering and exploiting them.

Mythos Preview stands out for its ability to saturate existing security benchmarks, pushing Anthropic to explore novel, real-world tasks, particularly focusing on zero-day vulnerabilities—flaws that are not yet known to the developers of the software in question. For instance, it discovered a 27-year-old bug in OpenBSD, demonstrating its advanced capabilities even in seemingly secure systems.

Unprecedented Exploitation Capabilities

Perhaps one of the most remarkable aspects of Mythos Preview is its ability to autonomously identify and exploit vulnerabilities. In one striking incident, the AI successfully detected and utilized a 17-year-old remote code execution vulnerability in FreeBSD—CVE-2026-4747—allowing unauthorized users to take full control of servers running NFS without any human intervention. Nicholas Carlini from Anthropic’s research team highlighted the model’s remarkable ability to chain vulnerabilities, creating complex exploits from multiple flaws, leading to outcomes that would typically require human expertise.

Why Mythos Preview Remains Under Wraps

Anthropic’s decision not to widely release Claude Mythos Preview stems from serious safety concerns. Newton Cheng, Anthropic’s Frontier Red Team Cyber Lead, emphasized the risks associated with such advanced AI capabilities. Cheng expressed that with the rapid pace of AI development, it is likely that these capabilities could fall into the hands of malicious actors, which could have dire implications for economies, public safety, and national security. Past incidents, such as an AI-driven cyberattack executed by a state-sponsored group, underscore the potential dangers of uncontrolled AI exploitation in cybersecurity.

Addressing the Open-Source Challenge

One significant dimension of Project Glasswing is its focus on the open-source software community. Historically, organizations with substantial resources were the only ones capable of maintaining robust cybersecurity. As Jim Zemlin, CEO of the Linux Foundation, articulated, open-source maintainers often had to navigate security challenges independently. By donating $2.5 million to organizations like Alpha-Omega and OpenSSF, along with $1.5 million to the Apache Software Foundation, Anthropic is empowering these maintainers with AI tools for pinpointing cybersecurity vulnerabilities—tools that were previously out of reach.

Future Directions for AI in Cybersecurity

Anthropic has ambitious plans for the future of its AI models. While Mythos Preview exhibits extraordinary capabilities, the company aims to introduce new safeguards first and release new models at scale, starting with the Claude Opus model. This approach allows Anthropic to refine its safety mechanisms without the heightened risks associated with more powerful models.

The competitive landscape is evolving rapidly, particularly with OpenAI’s release of GPT-5.3-Codex, classified as a high-capability model for cybersecurity tasks. Anthropic’s decision to control the deployment of Mythos Preview signals a shift towards cautious, responsible AI development within the industry, setting a new standard for managing advanced capabilities.

With the landscape of AI-driven cybersecurity continuing to change, the future holds numerous questions regarding the balance between innovation and safety. The implications of such advanced technologies are far-reaching, and how they are managed will shape the dynamics of both offensive and defensive strategies in cybersecurity operations.

Inspired by: Source

Anthropic Hides New AI Model Amid Discovery of Thousands of External Vulnerabilities

Anthropic’s AI Breakthrough: Project Glasswing and Claude Mythos Preview

The Collaborative Power of Project Glasswing

A Model That Exceeds Benchmarks

Unprecedented Exploitation Capabilities

Why Mythos Preview Remains Under Wraps

Addressing the Open-Source Challenge

Future Directions for AI in Cybersecurity

Stay Connected

Explore Top AI Tools Instantly

Latest News

How Apple’s Self-Driving Car Program Paved the Way for Advanced AI Chip Technology

Paris AI Voice Startup Gradium Secures $100M Seed Funding with Nvidia Support

OpenAI’s Head of Safety Departing: What This Means for the Company

OpenAI Launches New ChatGPT Model After White House Cybersecurity Delays | Latest Update

Leading global tech insights for 20M+ innovators

Quick Link

Support

Sign Up for Our Newsletter

Anthropic’s AI Breakthrough: Project Glasswing and Claude Mythos Preview

The Collaborative Power of Project Glasswing

A Model That Exceeds Benchmarks

Unprecedented Exploitation Capabilities

More Read

Why Mythos Preview Remains Under Wraps

Addressing the Open-Source Challenge

Future Directions for AI in Cybersecurity

Sign Up For Daily Newsletter

Get AI news first! Join our newsletter for fresh updates on open-source models.

Stay Connected

Explore Top AI Tools Instantly

Latest News

How Apple’s Self-Driving Car Program Paved the Way for Advanced AI Chip Technology

Paris AI Voice Startup Gradium Secures $100M Seed Funding with Nvidia Support

OpenAI’s Head of Safety Departing: What This Means for the Company

OpenAI Launches New ChatGPT Model After White House Cybersecurity Delays | Latest Update