32 Steps
The Human in the Loop por Enrique Cordero
Notas del episodio
32 steps.
That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.
The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.
Anthropic decided not to ship it.
That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.
Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.
The gap between "what got released" and "what got built" ...