
A new study evaluates the diagnostic accuracy of three leading generative multimodal AI models in interpreting CT images for lung cancer detection.
Key Details
- 1Three models compared: Gemini-pro-vision (Google), Claude-3-opus (Anthropic), and GPT-4-turbo (OpenAI).
- 2On 184 malignant lung cases, Gemini achieved highest single-image accuracy (>90%), followed by Claude-3-opus, GPT lowest (65.2%).
- 3Gemini's performance dropped to 58.5% with continuous CT slices, indicating challenges with spatial reasoning in imaging.
- 4Simplified text prompts improved diagnostic AUCs: Gemini (0.76), GPT (0.73), and Claude (0.69).
- 5Claude-3-opus showed superior consistency and lower variation in lesion feature analysis.
- 6External validation with TCGA and MIDRC datasets supported findings, especially with simplified prompt strategies.
Why It Matters

Source
EurekAlert
Related News

FDA Approves Johns Hopkins AI Tool for Early Sepsis Detection
FDA clears an AI-driven system developed by Johns Hopkins to detect sepsis up to 48 hours earlier and reduce mortality rates.

AI-Driven Handheld Endomicroscope Enhances Early Cancer Detection
Researchers develop PrecisionView, a handheld AI-powered endomicroscope for real-time, high-resolution cancer diagnostics.

New AI Vision-Language Model Enhances Chest CT Diagnostics
Researchers developed an interpretable AI model that uses visual question answering to generate detailed diagnostic findings from chest CT scans, aimed at improving lung cancer diagnosis.