Multi-modal large language models in radiology: principles, applications, and potential.

June 1, 2025

papers

DOI: 10.1007/s00261-024-04708-8 PMID: 39621074

Authors

Shen Y,Xu Y,Ma J,Rui W,Zhao C,Heacock L,Huang C

Affiliations (4)

New York University Langone Medical Center, New York, USA. [email protected].
New York University, New York, USA.
New York University Shanghai, Shanghai, China.
New York University Langone Medical Center, New York, USA.

Abstract

Large language models (LLMs) and multi-modal large language models (MLLMs) represent the cutting-edge in artificial intelligence. This review provides a comprehensive overview of their capabilities and potential impact on radiology. Unlike most existing literature reviews focusing solely on LLMs, this work examines both LLMs and MLLMs, highlighting their potential to support radiology workflows such as report generation, image interpretation, EHR summarization, differential diagnosis generation, and patient education. By streamlining these tasks, LLMs and MLLMs could reduce radiologist workload, improve diagnostic accuracy, support interdisciplinary collaboration, and ultimately enhance patient care. We also discuss key limitations, such as the limited capacity of current MLLMs to interpret 3D medical images and to integrate information from both image and text data, as well as the lack of effective evaluation methods. Ongoing efforts to address these challenges are introduced.

View Source Full Text PDF

Topics

Artificial IntelligenceRadiologyJournal ArticleReview

Multi-modal large language models in radiology: principles, applications, and potential.

Authors

Affiliations (4)

Abstract

Tags

Topics

Ready to Sharpen Your Edge?