Advanced large language models like GPT-4 accurately identify thoracic diseases in chest CT reports, enhancing pre-operative surgical planning.
Key Details
- 1Five LLMs (GPT-4, Claude-3.5, Qwen-Max, GPT-3.5-Turbo, Gemini-Pro) compared using 13,489 real-world chest CT reports.
- 2GPT-4 achieved up to 75% accuracy in identifying 13 common chest diseases with multiple-choice prompts.
- 3Multiple-choice prompts significantly improved model accuracy compared to open-ended questions.
- 4Fine-tuning GPT-3.5-Turbo increased its accuracy from 42% to 65% in challenging cases.
- 5No single LLM was best for all diseases, suggesting a tailored approach may be optimal.
- 6Future research will use explainable AI tools to increase transparency and reliability.
Why It Matters

Source
EurekAlert
Related News

New VIS-Fb Nanobody Probes Transform High-Precision Cellular Imaging
Salk and Einstein researchers have developed visible-spectrum antigen-stabilizable fluorescent nanobodies (VIS-Fbs) for sharper, multi-color live-cell imaging with minimal background noise.

NIH-Backed AI Model Predicts Cancer Survival Using Single-Cell Data
Researchers have developed scSurvival, a machine learning tool that uses single-cell tumor data to accurately predict cancer patient survival and identify high-risk cell populations.

Deep Learning Pathomics Platform Improves Immunotherapy Prediction in Lung Cancer
A deep learning pathomics platform accurately predicts immunotherapy response in metastatic NSCLC using routine pathology slides.