AI
32 Steps
AI

The Human in the Loop by Enrique Cordero

Episode notes

32 steps.

That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.

The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.

Anthropic decided not to ship it.

That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.

Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.

The gap between "what got released" and "what got built"  ... 

Read more
Keywords
Artificial IntelligenceAI NewsAI SafetyAnthropicLLMsAI AdoptionEnterpriseAIAIEngineeringOpenAI