Advanced large language models like GPT-4 accurately identify thoracic diseases in chest CT reports, enhancing pre-operative surgical planning.
Key Details
- 1Five LLMs (GPT-4, Claude-3.5, Qwen-Max, GPT-3.5-Turbo, Gemini-Pro) compared using 13,489 real-world chest CT reports.
- 2GPT-4 achieved up to 75% accuracy in identifying 13 common chest diseases with multiple-choice prompts.
- 3Multiple-choice prompts significantly improved model accuracy compared to open-ended questions.
- 4Fine-tuning GPT-3.5-Turbo increased its accuracy from 42% to 65% in challenging cases.
- 5No single LLM was best for all diseases, suggesting a tailored approach may be optimal.
- 6Future research will use explainable AI tools to increase transparency and reliability.
Why It Matters

Source
EurekAlert
Related News

ML and Multimodal Imaging Power Cerebral Blood Flow Monitoring for Spaceflight
Researchers developed a machine learning model that uses ultrasound and MRI data to predict cerebral blood flow in simulated microgravity for astronaut health.

Deep Learning Model Predicts Language Outcomes After Cochlear Implants Using MRI
AI model using deep transfer learning accurately predicts spoken language outcomes in deaf children after cochlear implantation based on pre-implantation brain MRI scans.

AI Model Accurately Predicts Blood Loss Risk in Liposuction
A machine learning model predicts blood loss during high-volume liposuction with 94% accuracy.