news
Selected Media Coverage
- The Guardian — AI outperforms doctors in Harvard trial of emergency triage diagnoses (April 2026)
- NPR — In real-world test, an AI model did better than ER doctors at diagnosing patients (April 2026)
- Harvard Magazine — AI Outperforms Doctors in Emergency Room Tasks, New Harvard Study Shows (April 2026)
- Harvard Medicine — Study Suggests AI Good Enough at Diagnosing Complex Medical Cases to Warrant Clinical Testing (April 2026)
- Harvard Medicine — Harvard Medical School’s Top 10 Science Stories of 2025 (December 2025)
- Harvard Medicine — AI System with Detailed Diagnostic Reasoning Makes Its Case (October 2025)
- The New Yorker — If AI Can Diagnose Patients, What Are Doctors For? (September 2025)
- JAMA+ AI Podcast — Interview on LLMs for Clinical Reasoning (April 2025)
- Harvard Medicine — Open-Source AI Matches Top Proprietary LLM in Solving Tough Medical Cases (March 2025)
- The Harvard Crimson — HMS Researchers Develop AI Diagnostic Tool (March 2025)
- Harvard DBMI News — Gift Empowers Trainees to Leverage Clinical AI for Better Health (November 2024)
Notable Mentions
Greg Brockman (Co-founder and President of OpenAI) posted about our paper: Superhuman Performance of a Large Language Model on the Reasoning Tasks of a Physician:

Shoutout from Polymarket:

Updates
| Apr 30, 2026 | Our paper evaluating LLMs across six clinical reasoning tasks was published in Science. Coverage in The Guardian, NPR, Harvard Magazine, and Harvard Medical School. |
|---|---|
| Mar 25, 2026 | Our reply about Dr. CaBot was published in the New England Journal of Medicine. |
| Dec 22, 2025 | Dr. CaBot was named one of Harvard Medical School’s Top 10 Stories of 2025. |
| Oct 08, 2025 | Harvard Medicine covered our AI-generated diagnosis published in the New England Journal of Medicine: AI System with Detailed Diagnostic Reasoning Makes Its Case. |
| Sep 29, 2025 | Dr. CaBot featured in The New Yorker: If AI Can Diagnose Patients, What Are Doctors For? |
| Sep 15, 2025 | Launched CPCBench, the public website for CPC-Bench, a large-scale benchmark based on the NEJM CPCs, and Dr. CaBot. |
| Apr 04, 2025 | Interviewed on the JAMA+ AI Podcast about our paper showing that open-source LLMs can now compete with closed-source models in diagnostic reasoning. |
| Mar 31, 2025 | Research on Llama 3.1 for complex diagnosis featured in The Harvard Crimson and Harvard Medicine News. |
| Nov 06, 2024 | Awarded the inaugural Dunleavy Fellowship for Clinical AI at Harvard Medical School. |