Episode notes
Is AI really ready to replace doctors? Stanford PhD researcher Suana reveals shocking truths about medical AI that Big Tech doesn't want you to know. When she tested leading AI models like GPT-4, Claude, and DeepSeek on modified medical questions, their accuracy plummeted by up to 40%!In this eye-opening conversation, we dive deep into:
❌ Why 95%+ accuracy on medical exams means nothing in real clinical practice
❌ How AI models fail when there's "no right answer" (which happens constantly in medicine)
❌ The dangerous gap between flashy headlines and clinical reality
✅ How doctors can safely use AI as a co-pilot (not replacement)
✅ The future of medical AI evaluation and what needs to changeSuana is a 3rd-year PhD student at Stanford in Biomedical Data Science, pioneering real-world evaluation methods for medical AI. ...