Back to all papers

Repeatability of AI-quantified incidental findings on lung cancer screening CT scans in the NELSON trial.

June 17, 2026pubmed logopapers

Authors

Bunk S,Bruins Slot T,Bennink E,Sidorenkov G,van der Velden N,Schurink N,Lades F,Sebald M,Heuvelmans MA,Gietema HA,Aerts JG,de Bock GH,Schaefer-Prokop C,de Jong PA,Vliegenthart R,Mohamed Hoesein FAA

Affiliations (8)

  • University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands. [email protected].
  • University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.
  • University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
  • Siemens Healthineers Nederland B.V, Zoetermeer, The Netherlands.
  • Siemens Healthineers AG, Forchheim, Germany.
  • Maastricht University Medical Center, Maastricht University, Maastricht, The Netherlands.
  • University Medical Center Rotterdam, Erasmus University Rotterdam, Rotterdam, The Netherlands.
  • Radboud University Medical Center, Nijmegen, The Netherlands.

Abstract

Computed tomography (CT) scans for lung cancer screening provide the opportunity of quantifying incidental findings. We evaluated the repeatability of AI-based measurements of incidental findings using short-term repeat CT scan pairs. AI-Rad Companion Chest CT software was applied to low-dose non-contrast CT scans from the NELSON lung cancer screening trial to measure aorta diameters, coronary artery calcium volume (CACV), vertebral height and radiodensity, and emphysema (low attenuation area percentage, LAA%). Categories (absent/present) of aortic dilatation and osteopenia, and severity (none/mild/moderate/severe) of CACV and emphysema were calculated. We included subjects who had a short-term repeat CT scan pair with a maximum interval of 120 days. We analyzed repeatability with absolute and relative differences, and agreement with the intraclass correlation coefficient (ICC) and Cohen's kappa. 1436 subjects were included, with age (mean ± SD) 59.7 ± 5.7 years, 86.3% men, 55.9% currently smoking, and scan interval 85 ± 20 days. Mean absolute differences were 0.7 to 1.5 mm for aorta diameters, 26 mm<sup>3</sup> for CACV, 0.3 mm to 0.4 mm for vertebral height, and 7.6 HU for vertebral radiodensity. Median absolute difference was 0.8% for LAA%. Aorta diameters showed good (0.75 < ICC < 0.9) to excellent (ICC > 0.9) agreement. Agreement for the rest was excellent. Categorization Cohen's kappa between the first and second measurements was 0.68 for aortic dilatation, 0.63 for CACV, 0.84 for osteopenia, and 0.70 for emphysema. In a lung cancer screening cohort with short-term repeat CT, the repeatability and agreement of automated AI measurements of aortic diameters, coronary calcium, vertebral height and radiodensity, and emphysema was good to excellent. Question Can AI measurements of incidental findings beyond lung nodules be repeatably performed on pairs of lung cancer screening chest CT scans of the same subject? Findings The repeatability and agreement of AI-based measurements of aorta diameter, coronary calcium, vertebrae, and emphysema were generally excellent. Clinical relevance Measurement by radiologists of incidental findings on lung cancer screening chest CT scans imposes a high workload. AI algorithms allow for automatic measurement of incidental findings on lung cancer screening chest CT with high repeatability and agreement.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.