Sort by:
Page 23 of 1431427 results

Preoperative prediction model for benign and malignant gallbladder polyps on the basis of machine-learning algorithms.

Zeng J, Hu W, Wang Y, Jiang Y, Peng J, Li J, Liu X, Zhang X, Tan B, Zhao D, Li K, Zhang S, Cao J, Qu C

pubmed logopapersJun 10 2025
This study aimed to differentiate between benign and malignant gallbladder polyps preoperatively by developing a prediction model integrating preoperative transabdominal ultrasound and clinical features using machine-learning algorithms. A retrospective analysis was conducted on clinical and ultrasound data from 1,050 patients at 2 centers who underwent cholecystectomy for gallbladder polyps. Six machine-learning algorithms were used to develop preoperative models for predicting benign and malignant gallbladder polyps. Internal and external test cohorts evaluated model performance. The Shapley Additive Explanations algorithm was used to understand feature importance. The main study cohort included 660 patients with benign polyps and 285 patients with malignant polyps, randomly divided into a 3:1 stratified training and internal test cohorts. The external test cohorts consisted of 73 benign and 32 malignant polyps. In the training cohort, the Shapley Additive Explanations algorithm, on the basis of variables selected by Least Absolute Shrinkage and Selection Operator regression and multivariate logistic regression, further identified 6 key predictive factors: polyp size, age, fibrinogen, carbohydrate antigen 19-9, presence of stones, and cholinesterase. Using these factors, 6 predictive models were developed. The random forest model outperformed others, with an area under the curve of 0.963, 0.940, and 0.958 in the training, internal, and external test cohorts, respectively. Compared with previous studies, the random forest model demonstrated excellent clinical utility and predictive performance. In addition, the Shapley Additive Explanations algorithm was used to visualize feature importance, and an online calculation platform was developed. The random forest model, combining preoperative ultrasound and clinical features, accurately predicts benign and malignant gallbladder polyps, offering valuable guidance for clinical decision-making.

U<sub>2</sub>-Attention-Net: a deep learning automatic delineation model for parotid glands in head and neck cancer organs at risk on radiotherapy localization computed tomography images.

Wen X, Wang Y, Zhang D, Xiu Y, Sun L, Zhao B, Liu T, Zhang X, Fan J, Xu J, An T, Li W, Yang Y, Xing D

pubmed logopapersJun 10 2025
This study aimed to develop a novel deep learning model, U<sub>2</sub>-Attention-Net (U<sub>2</sub>A-Net), for precise segmentation of parotid glands on radiotherapy localization CT images. CT images from 79 patients with head and neck cancer were selected, on which the label maps were delineated by relevant practitioners to construct a dataset. The dataset was divided into the training set (n = 60), validation set (n = 6), and test set (n = 13), with the training set augmented. U<sub>2</sub>A-Net, divided into U<sub>2</sub>A-Net V<sub>1</sub> (sSE) and U<sub>2</sub>A-Net V<sub>2</sub> (cSE) based on different attention mechanisms, was evaluated for parotid gland segmentation based on the DL loss function with U-Net, Attention U-Net, DeepLabV3+, and TransUNet as comparision models. Segmentation was also performed using GDL and GD-BCEL loss functions. Model performance was evaluated using DSC, JSC, PPV, SE, HD, RVD, and VOE metrics. The quantitative results revealed that U<sub>2</sub>A-Net based on DL outperformed the comparative models. While U<sub>2</sub>A-Net V<sub>1</sub> had the highest PPV, U<sub>2</sub>A-Net V<sub>2</sub> demonstrated the best quantitative results in other metrics. Qualitative results showed that U<sub>2</sub>A-Net's segmentation closely matched expert delineations, reducing oversegmentation and undersegmentation, with U<sub>2</sub>A-Net V<sub>2</sub> being more effective. In comparing loss functions, U<sub>2</sub>A-Net V<sub>1</sub> using GD-BCEL and U<sub>2</sub>A-Net V<sub>2</sub> using DL performed best. The U<sub>2</sub>A-Net model significantly improved parotid gland segmentation on radiotherapy localization CT images. The cSE attention mechanism showed advantages with DL, while sSE performed better with GD-BCEL.

Empirical evaluation of artificial intelligence distillation techniques for ascertaining cancer outcomes from electronic health records.

Riaz IB, Naqvi SAA, Ashraf N, Harris GJ, Kehl KL

pubmed logopapersJun 10 2025
Phenotypic information for cancer research is embedded in unstructured electronic health records (EHR), requiring effort to extract. Deep learning models can automate this but face scalability issues due to privacy concerns. We evaluated techniques for applying a teacher-student framework to extract longitudinal clinical outcomes from EHRs. We focused on the challenging task of ascertaining two cancer outcomes-overall response and progression according to Response Evaluation Criteria in Solid Tumors (RECIST)-from free-text radiology reports. Teacher models with hierarchical Transformer architecture were trained on data from Dana-Farber Cancer Institute (DFCI). These models labeled public datasets (MIMIC-IV, Wiki-text) and GPT-4-generated synthetic data. "Student" models were then trained to mimic the teachers' predictions. DFCI "teacher" models achieved high performance, and student models trained on MIMIC-IV data showed comparable results, demonstrating effective knowledge transfer. However, student models trained on Wiki-text and synthetic data performed worse, emphasizing the need for in-domain public datasets for model distillation.

Uncertainty estimation for trust attribution to speed-of-sound reconstruction with variational networks.

Laguna S, Zhang L, Bezek CD, Farkas M, Schweizer D, Kubik-Huch RA, Goksel O

pubmed logopapersJun 10 2025
Speed-of-sound (SoS) is a biomechanical characteristic of tissue, and its imaging can provide a promising biomarker for diagnosis. Reconstructing SoS images from ultrasound acquisitions can be cast as a limited-angle computed-tomography problem, with variational networks being a promising model-based deep learning solution. Some acquired data frames may, however, get corrupted by noise due to, e.g., motion, lack of contact, and acoustic shadows, which in turn negatively affects the resulting SoS reconstructions. We propose to use the uncertainty in SoS reconstructions to attribute trust to each individual acquired frame. Given multiple acquisitions, we then use an uncertainty-based automatic selection among these retrospectively, to improve diagnostic decisions. We investigate uncertainty estimation based on Monte Carlo Dropout and Bayesian Variational Inference. We assess our automatic frame selection method for differential diagnosis of breast cancer, distinguishing between benign fibroadenoma and malignant carcinoma. We evaluate 21 lesions classified as BI-RADS 4, which represents suspicious cases for probable malignancy. The most trustworthy frame among four acquisitions of each lesion was identified using uncertainty-based criteria. Selecting a frame informed by uncertainty achieved an area under curve of 76% and 80% for Monte Carlo Dropout and Bayesian Variational Inference, respectively, superior to any uncertainty-uninformed baselines with the best one achieving 64%. A novel use of uncertainty estimation is proposed for selecting one of multiple data acquisitions for further processing and decision making.

Automated Diffusion Analysis for Non-Invasive Prediction of IDH Genotype in WHO Grade 2-3 Gliomas.

Wu J, Thust SC, Wastling SJ, Abdalla G, Benenati M, Maynard JA, Brandner S, Carrasco FP, Barkhof F

pubmed logopapersJun 10 2025
Glioma molecular characterization is essential for risk stratification and treatment planning. Noninvasive imaging biomarkers such as apparent diffusion coefficient (ADC) values have shown potential for predicting glioma genotypes. However, manual segmentation of gliomas is time-consuming and operator-dependent. To address this limitation, we aimed to establish a single-sequence-derived automatic ADC extraction pipeline using T2-weighted imaging to support glioma isocitrate dehydrogenase (IDH) genotyping. Glioma volumes from a hospital data set (University College London Hospitals; n=247) were manually segmented on T2-weighted MRI scans using ITK-Snap Toolbox and co-registered to ADC maps sequences using the FMRIB Linear Image Registration Tool in FSL, followed by ADC histogram extraction (Python). Separately, a nnUNet deep learning algorithm was trained to segment glioma volumes using T2w only from BraTS 2021 data (n=500, 80% training, 5% validation and 15% test split). nnUnet was then applied to the University College London Hospitals (UCLH) data for segmentation and ADC read-outs. Univariable logistic regression was used to test the performance manual and nnUNet derived ADC metrics for IDH status prediction. Statistical equivalence was tested (paired two-sided t-test). nnUnet segmentation achieved a median Dice of 0.85 on BraTS data, and 0.83 on UCLH data. For the best performing metric (rADCmean) the area under the receiver operating characteristic curve (AUC) for differentiating IDH-mutant from IDHwildtype gliomas was 0.82 (95% CI: 0.78-0.88), compared to the manual segmentation AUC 0.84 (95% CI: 0.77-0.89). For all ADC metrics, manually and nnUNet extracted ADC were statistically equivalent (p<0.01). nnUNet identified one area of glioma infiltration missed by human observers. In 0.8% gliomas, nnUnet missed glioma components. In 6% of cases, over-segmentation of brain remote from the tumor occurred (e.g. temporal poles). The T2w trained nnUnet algorithm achieved ADC readouts for IDH genotyping with a performance statistically equivalent to human observers. This approach could support rapid ADC based identification of glioblastoma at an early disease stage, even with limited input data. AUC = Area under the receiver operating characteristic curve, BraTS = The brain tumor segmentation challenge held by MICCAI, Dice = Dice Similarity Coefficient, IDH = Isocitrate dehydrogenase, mGBM = Molecular glioblastoma, ADCmin = Fifth ADC histogram percentile, ADCmean = Mean ADC value, ADCNAWM = ADC in the contralateral centrum semiovale normal white matter, rADCmin = Normalized ADCmin, VOI rADCmean = Normalized ADCmean.

Challenges and Advances in Classifying Brain Tumors: An Overview of Machine, Deep Learning, and Hybrid Approaches with Future Perspectives in Medical Imaging.

Alshomrani F

pubmed logopapersJun 10 2025
Accurate brain tumor classification is essential in neuro-oncology, as it directly informs treatment strategies and influences patient outcomes. This review comprehensively explores machine learning (ML) and deep learning (DL) models that enhance the accuracy and efficiency of brain tumor classification using medical imaging data, particularly Magnetic Resonance Imaging (MRI). As a noninvasive imaging technique, MRI plays a central role in detecting, segmenting, and characterizing brain tumors by providing detailed anatomical views that help distinguish various tumor types, including gliomas, meningiomas, and metastatic brain lesions. The review presents a detailed analysis of diverse ML approaches, from classical algorithms such as Support Vector Machines (SVM) and Decision Trees to advanced DL models, including Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), and hybrid architectures that combine multiple techniques for improved performance. Through comparative analysis of recent studies across various datasets, the review evaluates these methods using metrics such as accuracy, sensitivity, specificity, and AUC-ROC, offering insights into their effectiveness and limitations. Significant challenges in the field are examined, including the scarcity of annotated datasets, computational complexity requirements, model interpretability issues, and barriers to clinical integration. The review proposes future directions to address these challenges, highlighting the potential of multi-modal imaging that combines MRI with other imaging modalities, explainable AI frameworks for enhanced model transparency, and privacy-preserving techniques for securing sensitive patient data. This comprehensive analysis demonstrates the transformative potential of ML and DL in advancing brain tumor diagnosis while emphasizing the necessity for continued research and innovation to overcome current limitations and ensure successful clinical implementation for improved patient care.

DWI-based Biologically Interpretable Radiomic Nomogram for Predicting 1- year Biochemical Recurrence after Radical Prostatectomy: A Deep Learning, Multicenter Study.

Niu X, Li Y, Wang L, Xu G

pubmed logopapersJun 10 2025
It is not rare to experience a biochemical recurrence (BCR) following radical prostatectomy (RP) for prostate cancer (PCa). It has been reported that early detection and management of BCR following surgery could improve survival in PCa. This study aimed to develop a nomogram integrating deep learning-based radiomic features and clinical parameters to predict 1-year BCR after RP and to examine the associations between radiomic scores and the tumor microenvironment (TME). In this retrospective multicenter study, two independent cohorts of patients (n = 349) who underwent RP after multiparametric magnetic resonance imaging (mpMRI) between January 2015 and January 2022 were included in the analysis. Single-cell RNA sequencing data from four prospectively enrolled participants were used to investigate the radiomic score-related TME. The 3D U-Net was trained and optimized for prostate cancer segmentation using diffusion-weighted imaging, and radiomic features of the target lesion were extracted. Predictive nomograms were developed via multivariate Cox proportional hazard regression analysis. The nomograms were assessed for discrimination, calibration, and clinical usefulness. In the development cohort, the clinical-radiomic nomogram had an AUC of 0.892 (95% confidence interval: 0.783--0.939), which was considerably greater than those of the radiomic signature and clinical model. The Hosmer-Lemeshow test demonstrated that the clinical-radiomic model performed well in both the development (P = 0.461) and validation (P = 0.722) cohorts. Decision curve analysis revealed that the clinical-radiomic nomogram displayed better clinical predictive usefulness than the clinical or radiomic signature alone in both cohorts. Radiomic scores were associated with a significant difference in the TME pattern. Our study demonstrated the feasibility of a DWI-based clinical-radiomic nomogram combined with deep learning for the prediction of 1-year BCR. The findings revealed that the radiomic score was associated with a distinctive tumor microenvironment.

RadGPT: A system based on a large language model that generates sets of patient-centered materials to explain radiology report information.

Herwald SE, Shah P, Johnston A, Olsen C, Delbrouck JB, Langlotz CP

pubmed logopapersJun 10 2025
The Cures Act Final Rule requires that patients have real-time access to their radiology reports, which contain technical language. Our objective to was to use a novel system called RadGPT, which integrates concept extraction and a large language model (LLM), to help patients understand their radiology reports. RadGPT generated 150 concept explanations and 390 question-and-answer pairs from 30 radiology report impressions from between 2012 and 2020. The extracted concepts were used to create concept-based explanations, as well as concept-based question-and-answer pairs where questions were generated using either a fixed template or an LLM. Additionally, report-based question-and-answer pairs were generated directly from the impression using an LLM without concept extraction. One board-certified radiologist and 4 radiology residents rated the material quality using a standardized rubric. Concept-based LLM-generated questions were significantly higher quality than concept-based template-generated questions (p < 0.001). Excluding those template-based question-and-answer pairs from further analysis, nearly all (> 95%) of RadGPT-generated materials were rated highly, with at least 50% receiving the highest possible ranking from all 5 raters. No answers or explanations were rated as likely to affect the safety or effectiveness of patient care. Report-level LLM-based questions and answers were rated particularly highly, with 92% of report-level LLM-based questions and 61% of the corresponding report-level answers receiving the highest rating from all raters. The educational tool RadGPT generated high-quality explanations and question-and-answer pairs that were personalized for each radiology report, unlikely to produce harmful explanations and likely to enhance patient understanding of radiology information.

Ultrasound Radiomics and Dual-Mode Ultrasonic Elastography Based Machine Learning Model for the Classification of Benign and Malignant Thyroid Nodules.

Yan J, Zhou X, Zheng Q, Wang K, Gao Y, Liu F, Pan L

pubmed logopapersJun 9 2025
The present study aims to construct a random forest (RF) model based on ultrasound radiomics and elastography, offering a new approach for the differentiation of thyroid nodules (TNs). We retrospectively analyzed 152 TNs from 127 patients and developed four machine learning models. The examination was performed using the Resona 9Pro equipped with a 15-4 MHz linear array probe. The region of interest (ROI) was delineated with 3D Slicer. Using the RF algorithm, four models were developed based on sound touch elastography (STE) parameters, strain elastography (SE) parameters, and the selected radiomic features: the STE model, SE model, radiomics model, and the combined model. Decision Curve Analysis (DCA) is employed to assess the clinical benefit of each model. The DeLong test is used to determine whether the area under the curves (AUC) values of different models are statistically significant. A total of 1396 radiomic features were extracted using the Pyradiomics package. After screening, a total of 7 radiomic features were ultimately included in the construction of the model. In STE, SE, radiomics model, and combined model, the AUCs are 0.699 (95% CI: 0.570-0.828), 0.812 (95% CI: 0.683-0.941), 0.851 (95% CI: 0.739-0.964) and 0.911 (95% CI: 0.806-1.000), respectively. In these models, the combined model and the radiomics model exhibited outstanding performance. The combined model, integrating elastography and radiomics, demonstrates superior predictive accuracy compared to single models, offering a promising approach for the diagnosis of TNs.

Improving Patient Communication by Simplifying AI-Generated Dental Radiology Reports With ChatGPT: Comparative Study.

Stephan D, Bertsch AS, Schumacher S, Puladi B, Burwinkel M, Al-Nawas B, Kämmerer PW, Thiem DG

pubmed logopapersJun 9 2025
Medical reports, particularly radiology findings, are often written for professional communication, making them difficult for patients to understand. This communication barrier can reduce patient engagement and lead to misinterpretation. Artificial intelligence (AI), especially large language models such as ChatGPT, offers new opportunities for simplifying medical documentation to improve patient comprehension. We aimed to evaluate whether AI-generated radiology reports simplified by ChatGPT improve patient understanding, readability, and communication quality compared to original AI-generated reports. In total, 3 versions of radiology reports were created using ChatGPT: an original AI-generated version (text 1), a patient-friendly, simplified version (text 2), and a further simplified and accessibility-optimized version (text 3). A total of 300 patients (n=100, 33.3% per group), excluding patients with medical education, were randomly assigned to review one text version and complete a standardized questionnaire. Readability was assessed using the Flesch Reading Ease (FRE) score and LIX indices. Both simplified texts showed significantly higher readability scores (text 1: FRE score=51.1; text 2: FRE score=55.0; and text 3: FRE score=56.4; P<.001) and lower LIX scores, indicating enhanced clarity. Text 3 had the shortest sentences, had the fewest long words, and scored best on all patient-rated dimensions. Questionnaire results revealed significantly higher ratings for texts 2 and 3 across clarity (P<.001), tone (P<.001), structure, and patient engagement. For example, patients rated the ability to understand findings without help highest for text 3 (mean 1.5, SD 0.7) and lowest for text 1 (mean 3.1, SD 1.4). Both simplified texts significantly improved patients' ability to prepare for clinical conversations and promoted shared decision-making. AI-generated simplification of radiology reports significantly enhances patient comprehension and engagement. These findings highlight the potential of ChatGPT as a tool to improve patient-centered communication. While promising, future research should focus on ensuring clinical accuracy and exploring applications across diverse patient populations to support equitable and effective integration of AI in health care communication.
Page 23 of 1431427 results
Show
per page

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.