Large Language Models Rival Physicians in Complex Lung Cancer Decisions

A real-world study reveals that large language models (LLMs) can match or exceed human physicians' performance in challenging lung cancer case decision-making, especially for rare cases.
Key Details
- 150 challenging lung cancer cases (complex, rare, refractory) were evaluated using blinded, multidimensional scoring by experts.
- 2LLMs reviewed: DeepSeek R1, Claude 3.5, Gemini 1.5, and GPT-4o; physician decisions stratified by experience; some juniors received AI assistance.
- 3DeepSeek R1 performed between intermediate and senior physicians overall; LLMs outperformed intermediates in rare cases but lagged in refractory (longitudinal) cases.
- 4AI-augmented junior physicians saw 80-90% boosts in comprehensiveness and specificity for rare cases, but specificity slightly dropped for refractory cases.
- 5Error profiling showed LLMs are strong in knowledge breadth/updates, while physicians excel in longitudinal reasoning and stability.
Why It Matters

Source
EurekAlert
Related News

AI Model BIOPREVENT Predicts Complications in Stem Cell Transplant Patients
A new AI tool, BIOPREVENT, predicts serious post-transplant complications months before symptoms appear using blood biomarkers and clinical data.

New Review Explores Cancer-Inflammation Link and Imaging AI in Precision Therapy
A recent review highlights inflammation’s dual role in cancer progression and how emerging tools like AI and imaging biomarkers are enhancing personalized immunotherapy.

CNN-Based AI Enhances Lung Nodule Detection on CT Scans
A CNN-based system achieved high accuracy in detecting and classifying pulmonary nodules using LIDC-IDRI CT data.