GPT-4o's Mixed Performance in Medical Imaging Interpretation Highlighted in Study
July 15, 2025
A new study finds that GPT-4o still faces significant hurdles in accurately interpreting medical images despite promising capabilities.
Key Details
- Researchers evaluated GPT-4o on 377 imaging cases across X-ray, CT, and MRI.
- The model did not receive clinical context or prior imaging for analysis.
- Three radiologists rated GPT-4o's responses using a 5-point scale.
- GPT-4o showed high accuracy in some instances but inconsistent, 'all or nothing' results in others.
- Potential applications include improving radiology workflows and expanding access to care in rural settings.
Why It Matters
The inconsistent performance of leading large language models like GPT-4o in medical imaging highlights both the potential value and current limitations of applying general AI models in radiology. Progress in this area could significantly impact radiology workflows and help mitigate specialist shortages, but further development and validation are clearly needed.