Large Language Models Rival Physicians in Complex Lung Cancer Decisions

A real-world study reveals that large language models (LLMs) can match or exceed human physicians' performance in challenging lung cancer case decision-making, especially for rare cases.
Key Details
- 150 challenging lung cancer cases (complex, rare, refractory) were evaluated using blinded, multidimensional scoring by experts.
- 2LLMs reviewed: DeepSeek R1, Claude 3.5, Gemini 1.5, and GPT-4o; physician decisions stratified by experience; some juniors received AI assistance.
- 3DeepSeek R1 performed between intermediate and senior physicians overall; LLMs outperformed intermediates in rare cases but lagged in refractory (longitudinal) cases.
- 4AI-augmented junior physicians saw 80-90% boosts in comprehensiveness and specificity for rare cases, but specificity slightly dropped for refractory cases.
- 5Error profiling showed LLMs are strong in knowledge breadth/updates, while physicians excel in longitudinal reasoning and stability.
Why It Matters

Source
EurekAlert
Related News

Researchers Develop All-Optical Synapse for Neuromorphic Imaging Systems
A new artificial synapse, controlled entirely by light, enables in-sensor neuromorphic processing for more efficient and noise-resistant imaging systems.

AI-Simulation Approach Achieves 90% Faster Brain MRI with Minimal Data
A simulation-based AI method can reconstruct brain MRI scans with only 10% of the usual data, greatly reducing scan times.

Ultrasound-Guided Nerve Freezing Revolutionizes Pediatric Ear Surgery Recovery
Lurie Children’s Hospital pioneers ultrasound-guided nerve freezing to eliminate prolonged postoperative pain in microtia repair.