Ultrasound-based Detection and Malignancy Prediction of Breast Lesions Eligible for Biopsy: A Multi-center Clinical-scenario Study Using Nomograms, Large Language Models, and Radiologist Evaluation.
Authors
Affiliations (9)
Affiliations (9)
- Department of Radiology Technology, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran (A.A.A., A.G., F.F.). Electronic address: [email protected].
- Department of Radiology, Faculty of Medicine, Urmia University of Medical Science, Urmia, Iran (A.M.).
- Department of Radiology Sancaktepe Şehit Prof. Dr. İlhan Varank Training and Research Hospital, University of Health Sciences, Istanbul, Turkey (T.Y.K.).
- Kartal Dr. LÜtfi Kırdar City Hospital, Istanbul, Turkey (B.N.K.).
- Department of Information Engineering, University of Padova, Padova, Italy (H.K.).
- Department of Radiology Technology, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran (A.A.A., A.G., F.F.).
- Universal Scientific Education and Research Network (USERN), Tehran, Iran (A.M.); School of Medicine, Tehran University of Medical Sciences, Tehran, Iran (A.M.).
- Clinical AI-Research in Omics and Medical Data Science (CAROM) group, Department of Medicine, Faculty of Medicine and Dentistry, Danube Private University, Krems an der Donau, Austria (S.H.); Austrian Center for Medical Innovation and Technology (ACMIT), Wiener Neustadt, Austria (S.H.).
- School of Mathematics, Physics and Computing, University of Southern Queensland, Springfield, Queensland, Australia (U.R.A.); Centre for Health Research, University of Southern Queensland, Springfield, Queensland, Australia (U.R.A.).
Abstract
To develop and externally validate ultrasound nomograms combining BI-RADS features and quantitative morphometric characteristics, and to compare their performance with expert radiologists and large language models in biopsy recommendation and malignancy prediction for breast lesions. In this multi-center, multi-national study, 1747 women with breast lesions underwent ultrasound across three centers in Iran and Turkey. A total of 10 BIRADS and 26 morphological features were extracted from each lesion. Three nomograms based on BI-RADS, morphometric, and both feature sets were constructed. Three radiologists (one senior, two general) and two ChatGPTs including ChatGPT-o3 and o4-mini-high interpreted de-identified breast lesion images. Diagnostic performance for biopsy recommendation and malignancy prediction was assessed across all cohorts. According to the pooled results, although the difference between the fused nomogram and the BI-RADS version was not statistically significant, the fused version consistently outperformed all models in biopsy recommendation and malignancy prediction (AUCs of 0.901 and 0.853, respectively) compared to BI-RADS nomogram (AUCs of 0.898 and 0.834), morphometric nomogram (AUCs of 0.825 and 0.708), radiologist1 (AUCs of 0.820 and 0.729), radiologist2 (AUCs of 0.605 and 0.719), radiologist3 (AUCs of 0.728 and 0.699), ChatGPT-o3 (AUCs of 0.729 and 0.689), and o4-mini-high (AUCs of 0.713 and 0.695). The proposed BI-RADS-morphometric nomogram outperforms standalone nomogram models, LLMs, and radiologists in guiding biopsy decisions and predicting malignancy. The proposed novel fused nomogram has the potential to reduce unnecessary biopsies and enhance personalized decision-making in breast imaging.