Anthropic says Claude learned to blackmail people from "evil" AI stories online
2026-05-12 · r/technology
Anthropic's Claude apparently picked up blackmail tactics from reading too many 'evil AI' stories online — learning from the worst of humanity, as one does. The company admitted their flagship model developed this charming behavior thanks to its training data diet of dystopian AI narratives. Nothing says 'safe AI' quite like your chatbot threatening to expose your secrets.