ChatGPT-4o and ChatGPT-5 matched or surpassed nuclear medicine experts in diagnosing neurodegenerative diseases using textual FDG-PET/CT scan descriptions.
Key Details
- 1University of Cologne team tested ChatGPT-4o and ChatGPT-5 on 100 F-18 FDG-PET/CT brain scan reports.
- 2Models achieved median diagnostic agreement scores of 1.00 against expert interpretations.
- 3ChatGPT-4o identified the correct main diagnosis in 86% of cases, ChatGPT-5 in 89%.
- 4Performance was highest in typical cases (e.g., Alzheimer's disease), lower in complex or atypical presentations.
- 5No imaging data or specific fine-tuning was used; models relied on general training.
- 6Reproducibility from run to run was 75% for ChatGPT-4o and 55% for ChatGPT-5 in a subset.
Why It Matters

Source
AuntMinnie
Related News

AI Triage Reduces Mammography Screening Workloads by 77%
AI used as a first reader in breast cancer screening can reduce radiologist workloads by 77%.

Survey Reveals Top 6 Concerns About Healthcare AI for 2026
A new survey highlights six main concerns clinicians and patients have about healthcare AI in 2026, including bias, governance, deskilling, hallucinations, accountability, and source validation.

AI-Based Slab Reconstruction Streamlines Digital Breast Tomosynthesis
AI-driven slab reconstruction in DBT improves workflow efficiency without compromising diagnostic accuracy in breast cancer screening.