GPT-4o, with prompt engineering, selected optimal abdominal/pelvic CT protocols more accurately than radiologists without increasing inappropriate selections.
Key Details
- 1Study evaluated 1,448 abdominal and pelvic CT exams between Jan-June 2024.
- 2GPT-4o with detailed prompting selected optimal protocols 96.2% of the time, compared to 88.3% for radiologists (p<0.001).
- 3Rates of inappropriate protocols were similarly low: 1.3% (GPT-4o) vs. 2.4% (radiologists), not statistically significant (p=0.21).
- 4Fine-tuning GPT-4o offered no performance increase over meticulous prompting (both 96.2%).
- 5Performance in protocol matching was consistent across training levels (radiologist, fellow, resident).
Why It Matters

Source
AuntMinnie
Related News

Study: Computer Vision Models Best LLMs in Chest CT Breast Abnormality Detection
Computer vision models (CVMs) surpass large language models (LLMs) in accurately labeling incidental breast abnormalities on chest CT scans.

Radiology Maintains Lead in FDA-Cleared AI Algorithms, Cardiology Follows
Radiology remains the top specialty for FDA-cleared AI, with cardiology as a strong second, particularly in cardiovascular imaging.

Deep Learning Models Rival Radiologists for Pancreatic Cancer Detection on CT
Deep-learning models achieved comparable or superior accuracy to experienced radiologists in detecting pancreatic cancer on CT scans, especially for small tumors.