In a significant development for the AI industry, Anthropic has unveiled Claude Mythos Preview, a frontier model that has demonstrated an alarming ability to discover software vulnerabilities. This capability, which surpasses even that of highly skilled human researchers, has prompted the launch of Project Glasswing, a broad industry initiative aimed at enhancing the security of critical software.
The video and accompanying announcements reveal that Mythos Preview has identified thousands of zero-day vulnerabilities across major operating systems and web browsers. This level of performance has led to a palpable sense of unease and a call for proactive security measures within the AI community.
The Power of Mythos Preview
The capabilities of Claude Mythos Preview are underscored by its performance on various benchmarks. In the SWE-bench Pro, the model achieved an accuracy of 77.8%, significantly outperforming Claude Opus 4.6 at 53.4%. Similarly, on Terminal-Bench 2.0, Mythos Preview scored 82.0%, compared to Opus 4.6's 65.4%. The model also showed remarkable performance in multilingual and multimodal benchmarks.
