Advanced large language models like GPT-4 accurately identify thoracic diseases in chest CT reports, enhancing pre-operative surgical planning.
Key Details
- 1Five LLMs (GPT-4, Claude-3.5, Qwen-Max, GPT-3.5-Turbo, Gemini-Pro) compared using 13,489 real-world chest CT reports.
- 2GPT-4 achieved up to 75% accuracy in identifying 13 common chest diseases with multiple-choice prompts.
- 3Multiple-choice prompts significantly improved model accuracy compared to open-ended questions.
- 4Fine-tuning GPT-3.5-Turbo increased its accuracy from 42% to 65% in challenging cases.
- 5No single LLM was best for all diseases, suggesting a tailored approach may be optimal.
- 6Future research will use explainable AI tools to increase transparency and reliability.
Why It Matters

Source
EurekAlert
Related News

AI Model Improves Early Detection of Primary Aldosteronism via EHR Data
An AI-driven model using 30 years of EHR data enhances screening for primary aldosteronism, a frequently underdiagnosed hypertension cause.

Peking University Debuts LargePNet for Superior Fluorescence Image Restoration
Peking University's Xi Peng lab introduces LargePNet, a new AI for robust fluorescence image restoration, outperforming patch-based methods.

AI Detects Smuggled Marine Life in Airport CT Scans
Researchers developed an AI algorithm to identify smuggled marine wildlife in airport luggage using CT scans with high accuracy.