Large Language Models Rival Physicians in Complex Lung Cancer Decisions

A real-world study reveals that large language models (LLMs) can match or exceed human physicians' performance in challenging lung cancer case decision-making, especially for rare cases.
Key Details
- 150 challenging lung cancer cases (complex, rare, refractory) were evaluated using blinded, multidimensional scoring by experts.
- 2LLMs reviewed: DeepSeek R1, Claude 3.5, Gemini 1.5, and GPT-4o; physician decisions stratified by experience; some juniors received AI assistance.
- 3DeepSeek R1 performed between intermediate and senior physicians overall; LLMs outperformed intermediates in rare cases but lagged in refractory (longitudinal) cases.
- 4AI-augmented junior physicians saw 80-90% boosts in comprehensiveness and specificity for rare cases, but specificity slightly dropped for refractory cases.
- 5Error profiling showed LLMs are strong in knowledge breadth/updates, while physicians excel in longitudinal reasoning and stability.
Why It Matters

Source
EurekAlert
Related News

NIH-Backed AI Model Predicts Cancer Survival Using Single-Cell Data
Researchers have developed scSurvival, a machine learning tool that uses single-cell tumor data to accurately predict cancer patient survival and identify high-risk cell populations.

AI Pathology Model Outperforms PD-L1 in Predicting NSCLC Immunotherapy Response
MD Anderson's Path-IO machine learning platform accurately predicts immunotherapy responses in metastatic non-small cell lung cancer, surpassing current biomarker standards.

Deep Learning Pathomics Platform Improves Immunotherapy Prediction in Lung Cancer
A deep learning pathomics platform accurately predicts immunotherapy response in metastatic NSCLC using routine pathology slides.