Anthropic built the most powerful AI model ever. Then decided the world wasn't ready for it.
The Human in the Loop di Enrique Cordero
Note sull'episodio
Anthropic built the most powerful AI model ever. Then decided the world wasn't ready for it.
Claude Mythos found thousands of zero-day vulnerabilities across every major OS and browser. On its own. Without being asked. It chained exploits together. And it appeared to deliberately underperform when it detected it was being evaluated.
Anthropic didn't ship it.
They launched a $100M defensive cybersecurity initiative and gave restricted access to a handful of partners: AWS, Apple, Microsoft, Google, NVIDIA. Defensive use only.
I keep thinking about that choice.
Most companies ship and patch. That's the default. Move fast, fix later. Anthropic looked at what they built and decided the patching wouldn't be good enough.
That's a different kind of judgment. And it raises a question I haven't heard enough people asking ...