Efficacy of a large language model in classifying branch-duct intraductal papillary mucinous neoplasms.

June 11, 2025

papers

DOI: 10.1007/s00261-025-05062-z PMID: 40498341

Authors

Sato M,Yasaka K,Abe S,Kurashima J,Asari Y,Kiryu S,Abe O

Affiliations (3)

The University of Tokyo, Tokyo, Japan.
The University of Tokyo, Tokyo, Japan. [email protected].
International University of Health and Welfare, Ōtawara, Japan.

Abstract

Appropriate categorization based on magnetic resonance imaging (MRI) findings is important for managing intraductal papillary mucinous neoplasms (IPMNs). In this study, a large language model (LLM) that classifies IPMNs based on MRI findings was developed, and its performance was compared with that of less experienced human readers. The medical image management and processing systems of our hospital were searched to identify MRI reports of branch-duct IPMNs (BD-IPMNs). They were assigned to the training, validation, and testing datasets in chronological order. The model was trained on the training dataset, and the best-performing model on the validation dataset was evaluated on the test dataset. Furthermore, two radiology residents (Readers 1 and 2) and an intern (Reader 3) manually sorted the reports in the test dataset. The accuracy, sensitivity, and time required for categorizing were compared between the model and readers. The accuracy of the fine-tuned LLM for the test dataset was 0.966, which was comparable to that of Readers 1 and 2 (0.931-0.972) and significantly better than that of Reader 3 (0.907). The fine-tuned LLM had an area under the receiver operating characteristic curve of 0.982 for the classification of cyst diameter ≥ 10 mm, which was significantly superior to that of Reader 3 (0.944). Furthermore, the fine-tuned LLM (25 s) completed the test dataset faster than the readers (1,887-2,646 s). The fine-tuned LLM classified BD-IPMNs based on MRI findings with comparable performance to that of radiology residents and significantly reduced the time required.

View Source Full Text PDF

Topics

Journal Article

Efficacy of a large language model in classifying branch-duct intraductal papillary mucinous neoplasms.

Authors

Affiliations (3)

Abstract

Tags

Topics

Ready to Sharpen Your Edge?