news

Selected Media Coverage

The Guardian — AI outperforms doctors in Harvard trial of emergency triage diagnoses (April 2026)
NPR — In real-world test, an AI model did better than ER doctors at diagnosing patients (April 2026)
Harvard Magazine — AI Outperforms Doctors in Emergency Room Tasks, New Harvard Study Shows (April 2026)
Harvard Medicine — Study Suggests AI Good Enough at Diagnosing Complex Medical Cases to Warrant Clinical Testing (April 2026)
Harvard Medicine — Harvard Medical School’s Top 10 Science Stories of 2025 (December 2025)
Harvard Medicine — AI System with Detailed Diagnostic Reasoning Makes Its Case (October 2025)
The New Yorker — If AI Can Diagnose Patients, What Are Doctors For? (September 2025)
JAMA+ AI Podcast — Interview on LLMs for Clinical Reasoning (April 2025)
Harvard Medicine — Open-Source AI Matches Top Proprietary LLM in Solving Tough Medical Cases (March 2025)
The Harvard Crimson — HMS Researchers Develop AI Diagnostic Tool (March 2025)
Harvard DBMI News — Gift Empowers Trainees to Leverage Clinical AI for Better Health (November 2024)

Notable Mentions

Greg Brockman (Co-founder and President of OpenAI) posted about our paper: Superhuman Performance of a Large Language Model on the Reasoning Tasks of a Physician:

Greg Brockman comment on the Superhuman paper

Shoutout from Polymarket:

Polymarket shoutout

Updates

Apr 30, 2026	Our paper evaluating LLMs across six clinical reasoning tasks was published in Science. Coverage in The Guardian, NPR, Harvard Magazine, and Harvard Medical School.
Mar 25, 2026	Our reply about Dr. CaBot was published in the New England Journal of Medicine.
Dec 22, 2025	Dr. CaBot was named one of Harvard Medical School’s Top 10 Stories of 2025.
Oct 08, 2025	Harvard Medicine covered our AI-generated diagnosis published in the New England Journal of Medicine: AI System with Detailed Diagnostic Reasoning Makes Its Case.
Sep 29, 2025	Dr. CaBot featured in The New Yorker: If AI Can Diagnose Patients, What Are Doctors For?
Sep 15, 2025	Launched CPCBench, the public website for CPC-Bench, a large-scale benchmark based on the NEJM CPCs, and Dr. CaBot.
Apr 04, 2025	Interviewed on the JAMA+ AI Podcast about our paper showing that open-source LLMs can now compete with closed-source models in diagnostic reasoning.
Mar 31, 2025	Research on Llama 3.1 for complex diagnosis featured in The Harvard Crimson and Harvard Medicine News.
Nov 06, 2024	Awarded the inaugural Dunleavy Fellowship for Clinical AI at Harvard Medical School.