Advanced large language models like GPT-4 accurately identify thoracic diseases in chest CT reports, enhancing pre-operative surgical planning.
Key Details
- 1Five LLMs (GPT-4, Claude-3.5, Qwen-Max, GPT-3.5-Turbo, Gemini-Pro) compared using 13,489 real-world chest CT reports.
- 2GPT-4 achieved up to 75% accuracy in identifying 13 common chest diseases with multiple-choice prompts.
- 3Multiple-choice prompts significantly improved model accuracy compared to open-ended questions.
- 4Fine-tuning GPT-3.5-Turbo increased its accuracy from 42% to 65% in challenging cases.
- 5No single LLM was best for all diseases, suggesting a tailored approach may be optimal.
- 6Future research will use explainable AI tools to increase transparency and reliability.
Why It Matters

Source
EurekAlert
Related News

AI-Powered OCT Enables Rapid 'Optical Biopsy' for Early Endometrial Cancer Detection
A team at Washington University has developed a catheter-based 3D OCT system with AI to quickly and noninvasively detect early endometrial cancers.

AI Clinical Reasoning in Diagnostics and Digital Fatigue in Healthcare
Recent JMIR features explore large language models in clinical diagnostics and digital fatigue among healthcare professionals.

KAIST, MIT, Microsoft Develop Efficient AI Image Upsampling for Robotics
KAIST, MIT, and Microsoft have created 'Upsample Anything,' a training-free AI method to restore high-resolution visual data from compressed images with up to 16x improved GPU memory efficiency.