32 Steps | Episodio del podcast in RSS.com

32 Steps

The Human in the Loop di Enrique Cordero

S1 · E23

19 apr 2026

14:52

Note sull'episodio

32 steps.

That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.

The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.

Anthropic decided not to ship it.

That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.

Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.

The gap between "what got released" and "what got built" ...

Leggi dettagli

Parole chiave

Artificial IntelligenceAI NewsAI SafetyAnthropicLLMsAI AdoptionEnterpriseAIAIEngineeringOpenAI

Dove è stato create l'episodio

City

Oulu, North Ostrobothnia, Mainland Finland, Finland