A new AI model, LLaVA-Endo, dramatically improves diagnostic accuracy during gastrointestinal endoscopies by merging visual and language analysis.
Key Details
- 1LLaVA-Endo is developed by an international team including universities in China and the UAE.
- 2The model combines image recognition and natural language processing for GI endoscopy.
- 3It outperformed major AI systems, such as GPT-4V, Gemini, and MiniGPT-v2, on diagnostic tasks.
- 4LLaVA-Endo was validated through expert review and benchmark tests using thousands of annotated GI images.
- 5The study appears in Frontiers of Computer Science, with publication scheduled for April 2025.
Why It Matters

Source
EurekAlert
Related News

AI-Powered OCT Enables Rapid 'Optical Biopsy' for Early Endometrial Cancer Detection
A team at Washington University has developed a catheter-based 3D OCT system with AI to quickly and noninvasively detect early endometrial cancers.

AI Clinical Reasoning in Diagnostics and Digital Fatigue in Healthcare
Recent JMIR features explore large language models in clinical diagnostics and digital fatigue among healthcare professionals.

KAIST, MIT, Microsoft Develop Efficient AI Image Upsampling for Robotics
KAIST, MIT, and Microsoft have created 'Upsample Anything,' a training-free AI method to restore high-resolution visual data from compressed images with up to 16x improved GPU memory efficiency.