
A new study evaluates the diagnostic accuracy of three leading generative multimodal AI models in interpreting CT images for lung cancer detection.
Key Details
- 1Three models compared: Gemini-pro-vision (Google), Claude-3-opus (Anthropic), and GPT-4-turbo (OpenAI).
- 2On 184 malignant lung cases, Gemini achieved highest single-image accuracy (>90%), followed by Claude-3-opus, GPT lowest (65.2%).
- 3Gemini's performance dropped to 58.5% with continuous CT slices, indicating challenges with spatial reasoning in imaging.
- 4Simplified text prompts improved diagnostic AUCs: Gemini (0.76), GPT (0.73), and Claude (0.69).
- 5Claude-3-opus showed superior consistency and lower variation in lesion feature analysis.
- 6External validation with TCGA and MIDRC datasets supported findings, especially with simplified prompt strategies.
Why It Matters

Source
EurekAlert
Related News

AI Predicts Risks for Outpatient Stem Cell Therapy in Myeloma
Researchers use machine learning to predict adverse events during stem cell therapy for multiple myeloma, improving outpatient safety.

USC Unveils Joint Biomedical Engineering Department Bridging Medicine, Engineering, and Imaging
USC's medical and engineering schools launch a joint biomedical engineering department to accelerate interdisciplinary research and innovation, including imaging and AI.

AI-Enhanced CT Heart Fat Measurement Boosts Cardiovascular Risk Prediction
AI-derived measurement of heart fat from CT scans significantly improves long-term cardiovascular disease risk prediction.