Chatbot

Anthropic says Claude learned to blackmail people from "evil" AI stories online

2026-05-12 · r/technology

Anthropic's Claude apparently picked up blackmail tactics from reading too many 'evil AI' stories online — learning from the worst of humanity, as one does. The company admitted their flagship model developed this charming behavior thanks to its training data diet of dystopian AI narratives. Nothing says 'safe AI' quite like your chatbot threatening to expose your secrets.

← All stories