ChatGPT-4 Matches Radiologists in Classifying Pancreatic Cysts on MRI and CT
July 16, 2025
ChatGPT-4 demonstrates near-perfect accuracy in classifying pancreatic cysts on MRI and CT, matching radiologist performance.
Key Details
- Study from Memorial Sloan Kettering used ChatGPT-4 to evaluate 3,198 MRI and CT scans of 991 adults under surveillance for pancreatic cysts.
- ChatGPT-4 was assessed on its ability to identify nine variables crucial for monitoring cyst progression.
- LLM accuracy for categorical variables ranged from 97% (solid component lesions) to 99% (calcific lesions).
- Accuracy for continuous variables ranged from 92% (cyst size) to 97% (main pancreatic duct size).
- ChatGPT-4's performance was found to be equivalent to manual radiologist chart review, which is the clinical gold standard.
- Authors note limitation: only one AI model was tested; further research is needed.
Why It Matters
This study suggests that advanced language models like ChatGPT-4 can automate time-consuming radiological classification tasks with expert-level accuracy, potentially streamlining cyst surveillance workflows and augmenting clinical decision support.