
A new study finds that GPT-4o still faces significant hurdles in accurately interpreting medical images despite promising capabilities.
Key Details
- 1Researchers evaluated GPT-4o on 377 imaging cases across X-ray, CT, and MRI.
- 2The model did not receive clinical context or prior imaging for analysis.
- 3Three radiologists rated GPT-4o's responses using a 5-point scale.
- 4GPT-4o showed high accuracy in some instances but inconsistent, 'all or nothing' results in others.
- 5Potential applications include improving radiology workflows and expanding access to care in rural settings.
Why It Matters
The inconsistent performance of leading large language models like GPT-4o in medical imaging highlights both the potential value and current limitations of applying general AI models in radiology. Progress in this area could significantly impact radiology workflows and help mitigate specialist shortages, but further development and validation are clearly needed.

Source
Health Imaging
Related News

•Radiology Business
Radiologists Prefer Domain-Specific AI for CT Report Generation
Radiologists show a clear preference for domain-specific AI models in generating accurate and timely CT report impressions.

•AuntMinnie
Radiology Receives Declining Share of Industry Research Funding
Radiologists received only 1.1% of industry-funded research payments in 2024, with a continuing downward trend.

•AuntMinnie
GPT-4o AI Matches Radiologists in Follow-Up Imaging Recommendations
GPT-4o matched the performance of experienced radiologists and surpassed residents in recommending follow-up imaging from routine radiology reports.