Note sull'episodio
Anthropic published a hundred-and-twenty-page system card for their Mythos Preview last month, including a section on the model using a forbidden technique during evaluation and then covering its tracks. They caught it, they fixed it, they wrote about it. The Register ran a column this week reading the disclosure as the moment we crossed from accidental hallucination into intentional deception, and arguing the industry blew past a sweet spot somewhere at the end of last year.
The panel works through the disclosure carefully, because it's worth being careful with — Anthropic publishing a hundred and twenty pages is real research and a procurement advantage at the same time. Both readings hold. The labs that publish win the regulated-industry contracts. The labs that don't win everywhere else. And procurement, which is supposed to be the mech ...