Back to all papers

Development of Machine Learning Models for Predicting Prostate Cancer in Biopsy Candidates Using Prostate-Specific Antigen, Magnetic Resonance Imaging, and Hematologic Parameters.

March 30, 2026pubmed logopapers

Authors

Özlü DN,Arıkan Y,Emir B,Ayten A,Sungur U,Ekşi M,Polat H,Karadağ S,Bitkin A

Affiliations (4)

  • Department of Urology, Bakırköy Dr. Sadi Konuk Training and Research Hospital, University of Health Sciences, Istanbul, Türkiye.
  • Department of Urology, Izmir Tepecik Training and Research Hospital, University of Health Sciences, Izmir, Türkiye.
  • Department of Biostatistics, Faculty of Medicine, İzmir Katip Celebi University, Izmir, Türkiye.
  • Department of Urology, Gaziosmanpasa Training and Research Hospital, University of Health Sciences, İstanbul, Türkiye.

Abstract

Prostate-specific antigen (PSA) alone is insufficient for the diagnosis of prostate cancer (PCa), particularly within the gray zone range of 4-10 ng/mL. Multiparametric magnetic resonance imaging (mpMRI), although widely used, has notable limitations in the diagnostic pathway. The aim of this study was to develop a biopsy prediction model by evaluating multiple machine learning (ML) algorithms incorporating PSA-related variables, mpMRI findings, and hematologic parameters. This study included patients who underwent either systematic biopsy or combined biopsy (systematic plus fusion) based on mpMRI findings and had pre-biopsy PSA levels ≤ 10 ng/mL between 2017 and 2024 at our center. Laboratory findings, mpMRI results, and prostate biopsy outcomes were recorded. Based on the pathological evaluation of biopsy specimens, the patients were divided into two groups: those without malignancy (Group 1) and those with malignancy (Group 2). To develop a useful model for prediction, five ML techniques were applied: logistic regression, random forest (RF), extra trees, extreme gradient boosting (XGBoost) classifier, and light gradient-boosting machine classifier. There were 1223 patients (84.5%) in Group 1 and 225 patients (15.5%) in Group 2. The model with the best accuracy was XGBoost, with a sensitivity of 94.74% and a specificity of 100% on the test data set. For the RF model, sensitivity and specificity on the test data set were determined to be 78.95% and 100%, respectively. The area under the curve (AUC) values for XGBoost and RF were 0.97 and 0.89, respectively. According to the permutation feature importance analysis, the three most influential variables were free/total PSA, Prostate Imaging-Reporting and Data System Score 4, and platelet-to-lymphocyte ratio. XGBoost and RF models demonstrated excellent performance, with high AUC values. To generalize the results, it is necessary to confirm the accuracy of ML models through external validation in different populations.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.