Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable
7.6 relevance
Score Breakdown
technical depth 8
novelty 8
actionability 5
community 9
strategic 8
personal 9
Scored daily by a customisable AI persona to surface the most relevant engineering leadership news.
Anthropic's Fable guardrails controversy is critical for AI security roles.
Summary
Anthropic's Fable, a restricted version of its Mythos cybersecurity model, faces backlash from researchers like IBM's Valentina Palmiotti and Tolmo's Matt Suiche due to overly broad keyword-based guardrails that block innocuous requests (e.g., code review, secure coding) and force fallback to Claude Opus 4.8. Anthropic offers a Cyber Verification Program for approved researchers to bypass restrictions, mirroring OpenAI's Trusted Access for Cyber, while Project Glasswing expands Mythos access to hundreds of organizations in 15 countries.