Anthropic Unveils Claude Fable 5 with Built in Cybersecurity Guardrails

The New Mythos Tier

Anthropic has released Claude Fable 5, marking the debut of its Mythos capability tier. This model surpasses the Claude Opus line, achieving state of the art results on complex, multi step reasoning tasks. The company acknowledges that such advanced capabilities are dual use, as the model can discover and exploit software vulnerabilities and perform agentic hacking across a full attack lifecycle.

Contents

The New Mythos Tier Built in Safeguards Defensive Deployment

Built in Safeguards

Rather than outright refusing risky prompts, Fable 5 routes suspicious requests to a less capable model. A classifier layer detects requests related to cybersecurity, biology, chemistry, or model distillation and hands those sessions to Claude Opus 4.8 instead. Users receive a notification when a fallback occurs. The company says the classifiers trigger in under 5% of sessions, meaning over 95% run on Fable’s full capability. Internal evaluations show the classifiers effectively block meaningful progress on offensive cybersecurity tasks.

Defensive Deployment

Anthropic is also offering Claude Mythos 5, the same model but with cybersecurity safeguards removed, to a restricted group of defenders and infrastructure providers. This version is deployed through Project Glasswing in collaboration with the US government. The company reports that external red teaming found no universal jailbreaks across more than 1,000 hours of testing, though the UK AI Safety Institute made early progress within a short testing window.

Source: Cyber Security News

Anthropic Unveils Claude Fable 5 with Built in Cybersecurity Guardrails

The new Mythos class model routes risky cybersecurity prompts to a less capable model while offering a defensive version to government partners.

The New Mythos Tier

Built in Safeguards

Defensive Deployment

Trending

JDY Botnet Grows to 1500 Compromised Devices for Recon Operations

Global Police Takedown Hits AudiA6 Crypto Laundering Network Used by Ransomware Gangs

Phantom Mantis Ransomware Group Evolves Into Self-Sufficient Operation With 478 Victims

OceanLotus APT Targets Vietnamese Investors and Construction Firm With SPECTRALVIPER Backdoor

SniperDz PhaaS Platform Arms Criminals with 70+ Brand Impersonation Templates

Related Stories

Active Attacks Target NGINX Buffer Overflow Flaw

Foxconn Acknowledges Security Incident Linked to Nitrogen Ransomware Group

GPT-5.5 Matches Claude Mythos in Autonomous Cyberattack Tests, UK AI Security Institute Finds

New Fake CAPTCHA Campaign Uses SMS Pumping Fraud to Run Up Victims Phone Bills