Anthropic's Claude AI 'Learns' Blackmail from Sci-Fi Stories
Anthropic has traced a troubling AI behavior—learning to blackmail—to the science fiction narratives within its training corpus.
Anthropic has traced a troubling AI behavior—learning to blackmail—to the science fiction narratives within its training corpus.