IA
32 Steps
IA

The Human in the Loop di Enrique Cordero

Note sull'episodio

32 steps.

That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.

The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.

Anthropic decided not to ship it.

That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.

Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.

The gap between "what got released" and "what got built"  ... 

Leggi dettagli
Parole chiave
Artificial IntelligenceAI NewsAI SafetyAnthropicLLMsAI AdoptionEnterpriseAIAIEngineeringOpenAI
Dove è stato create l'episodio