Toronto Study: LLMs Must Cite Sources for Radiology Decision Support

University of Toronto researchers found that large language models (LLMs) such as DeepSeek V3 and GPT-4o offer promising support for radiology decision-making in pancreatic cancer when their recommendations cite guideline sources.

Key Details

1LLMs were tested for generating NCCN-compliant management plans for 328 pancreatic ductal adenocarcinoma (PDAC) cases.
2DeepSeek V3 had a 100% completion rate and 1.5% discordance; GPT-4o had a 96.3% completion rate and 8.8% discordance, a statistically significant difference.
3Both LLMs had high (>91%) category-specific concordance, though DeepSeek outperformed GPT-4o—except GPT-4o had 86% for locally advanced nonresectable cancer.
4Radiologist review flagged occasional inaccurate recommendations, including misclassification of tumor resectability and overtreatment.
5Researchers emphasized that LLM explainability and the ability to cite guidelines are key for clinical trust and workflow integration.

Why It Matters

Effective and explainable LLM integration could enhance radiology workflow efficiency and clinical safety, especially where multidisciplinary tumor board processes are resource-intensive. Citation of guideline sources is essential for radiologist trust and broader adoption of AI tools in oncology decision-making.

Read the full article on AuntMinnie

Toronto Study: LLMs Must Cite Sources for Radiology Decision Support

Key Details

Why It Matters

Related News

Deep Learning AI Outperforms Radiologists in Detecting ENE on CT

Patients Favor AI in Imaging Diagnostics, Hesitate on Triage Use

FDA Clears Multi-Disease AI Screening Platform for CT Imaging

Ready to Sharpen Your Edge?