Back to all news

Large Language Models Rival Physicians in Complex Lung Cancer Decisions

EurekAlertResearch
Large Language Models Rival Physicians in Complex Lung Cancer Decisions

A real-world study reveals that large language models (LLMs) can match or exceed human physicians' performance in challenging lung cancer case decision-making, especially for rare cases.

Key Details

  • 150 challenging lung cancer cases (complex, rare, refractory) were evaluated using blinded, multidimensional scoring by experts.
  • 2LLMs reviewed: DeepSeek R1, Claude 3.5, Gemini 1.5, and GPT-4o; physician decisions stratified by experience; some juniors received AI assistance.
  • 3DeepSeek R1 performed between intermediate and senior physicians overall; LLMs outperformed intermediates in rare cases but lagged in refractory (longitudinal) cases.
  • 4AI-augmented junior physicians saw 80-90% boosts in comprehensiveness and specificity for rare cases, but specificity slightly dropped for refractory cases.
  • 5Error profiling showed LLMs are strong in knowledge breadth/updates, while physicians excel in longitudinal reasoning and stability.

Why It Matters

This study highlights when and how LLMs can best augment clinical decision-making—showing strong synergies for rare cases and pointing to limitations in complex, longitudinal care settings. These insights will inform the integration of AI tools into radiology and oncology multidisciplinary teams for case-based decision support.

Ready to Sharpen Your Edge?

Subscribe to join 9,000+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.