IA
32 Steps
IA

The Human in the Loop por Enrique Cordero

Notas del episodio

32 steps.

That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note.

The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps.

Anthropic decided not to ship it.

That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body.

Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one.

The gap between "what got released" and "what got built"  ... 

Leer más
Palabras clave
Artificial IntelligenceAI NewsAI SafetyAnthropicLLMsAI AdoptionEnterpriseAIAIEngineeringOpenAI
Dónde está producido este episodio