
AI Based Paper Discussions
Season 1
Public Finance in the Age of AI
Neural Steering Vectors Reveal Dose and Exposure-Dependent Impacts of Human-AI Relationships
The Generative AI Paradox
Training Agents to Self-Report Misbehavior
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
The More You Automate, the Less You See: Hidden Pitfalls of AI Scientist Systems
The Generative AI Paradox: How Synthetic Realities Erode Shared Epistemic Ground
Evaluating Frontier Models for Stealth and Situational Awareness
Evaluating Frontier Models for Stealth and Situational Awareness
RASP Discovering Interpretable Algorithms by Decompiling Transformers into Human-Readable Programs
1 of 2