Anthropic says these topics are too dangerous to let its Fable 5 model talk about
Anthropic has released Claude Fable 5, a new frontier model that surpasses its previous Opus versions in capability. However, Fable 5 includes strict safeguards to prevent discussions on sensitive topics like cybersecurity, biology, and chemistry, which the company fears could empower malicious actors. While these restrictions may occasionally block harmless requests, Anthropic believes they are necessary to mitigate risks, especially concerning the model's potential for agentic hacking. AI
IMPACT Sets a precedent for frontier models with built-in topic restrictions, potentially influencing future AI safety development and deployment.