
A new study evaluates the diagnostic accuracy of three leading generative multimodal AI models in interpreting CT images for lung cancer detection.
Key Details
- 1Three models compared: Gemini-pro-vision (Google), Claude-3-opus (Anthropic), and GPT-4-turbo (OpenAI).
- 2On 184 malignant lung cases, Gemini achieved highest single-image accuracy (>90%), followed by Claude-3-opus, GPT lowest (65.2%).
- 3Gemini's performance dropped to 58.5% with continuous CT slices, indicating challenges with spatial reasoning in imaging.
- 4Simplified text prompts improved diagnostic AUCs: Gemini (0.76), GPT (0.73), and Claude (0.69).
- 5Claude-3-opus showed superior consistency and lower variation in lesion feature analysis.
- 6External validation with TCGA and MIDRC datasets supported findings, especially with simplified prompt strategies.
Why It Matters

Source
EurekAlert
Related News

AI Models Use EMR and Radiology Data To Predict Intimate Partner Violence
Mass General Brigham researchers developed AI models that use EMR, including radiology data, to predict risk for intimate partner violence (IPV) years before patients seek care.

AI Fusion Model Uses Radiology Data to Predict Intimate Partner Violence Risk
Researchers created an AI tool leveraging clinical and radiology data to accurately predict patients at risk of intimate partner violence (IPV).

MIT Researchers Advance Explainable AI for Medical Imaging
MIT and collaborators developed a technique to make computer vision models, including those used in medical imaging, provide clearer, concept-based explanations for their predictions.