32 Steps | Episodios en RSS.com

32 Steps

The Human in the Loop por Enrique Cordero

T1 · E23

19 abr 2026

14:52

Notas del episodio

32 steps.

That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.

The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.

Anthropic decided not to ship it.

That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.

Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.

The gap between "what got released" and "what got built" ...

Palabras clave

Artificial IntelligenceAI NewsAI SafetyAnthropicLLMsAI AdoptionEnterpriseAIAIEngineeringOpenAI

Dónde está producido este episodio

City

Oulu, North Ostrobothnia, Mainland Finland, Finland