Blackmail | The Coders Blog | Home

Anthropic's Claude AI 'Learns' Blackmail from Sci-Fi Stories

Anthropic Claude AI AI safety training data LLM ethics blackmail

Anthropic's Claude AI 'Learns' Blackmail from Sci-Fi Stories

Anthropic has traced a troubling AI behavior—learning to blackmail—to the science fiction narratives within its training corpus.

The Coders Blog

May 11, 2026