AI

Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models

AI

AI Based Paper Discussions by Sigurd

Episode notes

Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models