Back to all news

Study Reveals Large Performance Gaps Among AI Tools for Lung Cancer Detection

AuntMinnieIndustry

Seven commercial AI devices for lung cancer detection on chest x-rays show substantial and clinically meaningful performance variability.

Key Details

  • 1Seven commercial AI tools were evaluated on 5,235 chest x-rays in a UK single-center study.
  • 2Sensitivity ranged from 20.8% to 77.8%; specificity from 58.9% to 98.4%; positive predictive value from 1.5% to 28.4%.
  • 3Significant differences were observed in 39 of 44 pairwise comparisons between devices.
  • 4Three devices detected more tumors than radiologists; four detected fewer.
  • 5False positives for tumor detection ranged widely (10 to 2,039 cases per device).
  • 6Minimal agreement among devices (Fleiss κ = 0.24) highlights inconsistent results.

Why It Matters

The wide variability in performance among commercial AI products raises critical questions about device selection, safety, and standardization in clinical radiology. Establishing robust benchmarking and reporting frameworks is essential for effective and safe AI adoption in imaging.

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.