Sort by:
Page 7 of 1261257 results

DWI-based Biologically Interpretable Radiomic Nomogram for Predicting 1- year Biochemical Recurrence after Radical Prostatectomy: A Deep Learning, Multicenter Study.

Niu X, Li Y, Wang L, Xu G

pubmed logopapersJun 10 2025
It is not rare to experience a biochemical recurrence (BCR) following radical prostatectomy (RP) for prostate cancer (PCa). It has been reported that early detection and management of BCR following surgery could improve survival in PCa. This study aimed to develop a nomogram integrating deep learning-based radiomic features and clinical parameters to predict 1-year BCR after RP and to examine the associations between radiomic scores and the tumor microenvironment (TME). In this retrospective multicenter study, two independent cohorts of patients (n = 349) who underwent RP after multiparametric magnetic resonance imaging (mpMRI) between January 2015 and January 2022 were included in the analysis. Single-cell RNA sequencing data from four prospectively enrolled participants were used to investigate the radiomic score-related TME. The 3D U-Net was trained and optimized for prostate cancer segmentation using diffusion-weighted imaging, and radiomic features of the target lesion were extracted. Predictive nomograms were developed via multivariate Cox proportional hazard regression analysis. The nomograms were assessed for discrimination, calibration, and clinical usefulness. In the development cohort, the clinical-radiomic nomogram had an AUC of 0.892 (95% confidence interval: 0.783--0.939), which was considerably greater than those of the radiomic signature and clinical model. The Hosmer-Lemeshow test demonstrated that the clinical-radiomic model performed well in both the development (P = 0.461) and validation (P = 0.722) cohorts. Decision curve analysis revealed that the clinical-radiomic nomogram displayed better clinical predictive usefulness than the clinical or radiomic signature alone in both cohorts. Radiomic scores were associated with a significant difference in the TME pattern. Our study demonstrated the feasibility of a DWI-based clinical-radiomic nomogram combined with deep learning for the prediction of 1-year BCR. The findings revealed that the radiomic score was associated with a distinctive tumor microenvironment.

RadGPT: A system based on a large language model that generates sets of patient-centered materials to explain radiology report information.

Herwald SE, Shah P, Johnston A, Olsen C, Delbrouck JB, Langlotz CP

pubmed logopapersJun 10 2025
The Cures Act Final Rule requires that patients have real-time access to their radiology reports, which contain technical language. Our objective to was to use a novel system called RadGPT, which integrates concept extraction and a large language model (LLM), to help patients understand their radiology reports. RadGPT generated 150 concept explanations and 390 question-and-answer pairs from 30 radiology report impressions from between 2012 and 2020. The extracted concepts were used to create concept-based explanations, as well as concept-based question-and-answer pairs where questions were generated using either a fixed template or an LLM. Additionally, report-based question-and-answer pairs were generated directly from the impression using an LLM without concept extraction. One board-certified radiologist and 4 radiology residents rated the material quality using a standardized rubric. Concept-based LLM-generated questions were significantly higher quality than concept-based template-generated questions (p < 0.001). Excluding those template-based question-and-answer pairs from further analysis, nearly all (> 95%) of RadGPT-generated materials were rated highly, with at least 50% receiving the highest possible ranking from all 5 raters. No answers or explanations were rated as likely to affect the safety or effectiveness of patient care. Report-level LLM-based questions and answers were rated particularly highly, with 92% of report-level LLM-based questions and 61% of the corresponding report-level answers receiving the highest rating from all raters. The educational tool RadGPT generated high-quality explanations and question-and-answer pairs that were personalized for each radiology report, unlikely to produce harmful explanations and likely to enhance patient understanding of radiology information.

Ultrasound Radiomics and Dual-Mode Ultrasonic Elastography Based Machine Learning Model for the Classification of Benign and Malignant Thyroid Nodules.

Yan J, Zhou X, Zheng Q, Wang K, Gao Y, Liu F, Pan L

pubmed logopapersJun 9 2025
The present study aims to construct a random forest (RF) model based on ultrasound radiomics and elastography, offering a new approach for the differentiation of thyroid nodules (TNs). We retrospectively analyzed 152 TNs from 127 patients and developed four machine learning models. The examination was performed using the Resona 9Pro equipped with a 15-4 MHz linear array probe. The region of interest (ROI) was delineated with 3D Slicer. Using the RF algorithm, four models were developed based on sound touch elastography (STE) parameters, strain elastography (SE) parameters, and the selected radiomic features: the STE model, SE model, radiomics model, and the combined model. Decision Curve Analysis (DCA) is employed to assess the clinical benefit of each model. The DeLong test is used to determine whether the area under the curves (AUC) values of different models are statistically significant. A total of 1396 radiomic features were extracted using the Pyradiomics package. After screening, a total of 7 radiomic features were ultimately included in the construction of the model. In STE, SE, radiomics model, and combined model, the AUCs are 0.699 (95% CI: 0.570-0.828), 0.812 (95% CI: 0.683-0.941), 0.851 (95% CI: 0.739-0.964) and 0.911 (95% CI: 0.806-1.000), respectively. In these models, the combined model and the radiomics model exhibited outstanding performance. The combined model, integrating elastography and radiomics, demonstrates superior predictive accuracy compared to single models, offering a promising approach for the diagnosis of TNs.

Improving Patient Communication by Simplifying AI-Generated Dental Radiology Reports With ChatGPT: Comparative Study.

Stephan D, Bertsch AS, Schumacher S, Puladi B, Burwinkel M, Al-Nawas B, Kämmerer PW, Thiem DG

pubmed logopapersJun 9 2025
Medical reports, particularly radiology findings, are often written for professional communication, making them difficult for patients to understand. This communication barrier can reduce patient engagement and lead to misinterpretation. Artificial intelligence (AI), especially large language models such as ChatGPT, offers new opportunities for simplifying medical documentation to improve patient comprehension. We aimed to evaluate whether AI-generated radiology reports simplified by ChatGPT improve patient understanding, readability, and communication quality compared to original AI-generated reports. In total, 3 versions of radiology reports were created using ChatGPT: an original AI-generated version (text 1), a patient-friendly, simplified version (text 2), and a further simplified and accessibility-optimized version (text 3). A total of 300 patients (n=100, 33.3% per group), excluding patients with medical education, were randomly assigned to review one text version and complete a standardized questionnaire. Readability was assessed using the Flesch Reading Ease (FRE) score and LIX indices. Both simplified texts showed significantly higher readability scores (text 1: FRE score=51.1; text 2: FRE score=55.0; and text 3: FRE score=56.4; P<.001) and lower LIX scores, indicating enhanced clarity. Text 3 had the shortest sentences, had the fewest long words, and scored best on all patient-rated dimensions. Questionnaire results revealed significantly higher ratings for texts 2 and 3 across clarity (P<.001), tone (P<.001), structure, and patient engagement. For example, patients rated the ability to understand findings without help highest for text 3 (mean 1.5, SD 0.7) and lowest for text 1 (mean 3.1, SD 1.4). Both simplified texts significantly improved patients' ability to prepare for clinical conversations and promoted shared decision-making. AI-generated simplification of radiology reports significantly enhances patient comprehension and engagement. These findings highlight the potential of ChatGPT as a tool to improve patient-centered communication. While promising, future research should focus on ensuring clinical accuracy and exploring applications across diverse patient populations to support equitable and effective integration of AI in health care communication.

Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.

Su H, Sun Y, Li R, Zhang A, Yang Y, Xiao F, Duan Z, Chen J, Hu Q, Yang T, Xu B, Zhang Q, Zhao J, Li Y, Li H

pubmed logopapersJun 9 2025
The integration of large language models (LLMs) into medical diagnostics has garnered substantial attention due to their potential to enhance diagnostic accuracy, streamline clinical workflows, and address health care disparities. However, the rapid evolution of LLM research necessitates a comprehensive synthesis of their applications, challenges, and future directions. This scoping review aimed to provide an overview of the current state of research regarding the use of LLMs in medical diagnostics. The study sought to answer four primary subquestions, as follows: (1) Which LLMs are commonly used? (2) How are LLMs assessed in diagnosis? (3) What is the current performance of LLMs in diagnosing diseases? (4) Which medical domains are investigating the application of LLMs? This scoping review was conducted according to the Joanna Briggs Institute Manual for Evidence Synthesis and adheres to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews). Relevant literature was searched from the Web of Science, PubMed, Embase, IEEE Xplore, and ACM Digital Library databases from 2022 to 2025. Articles were screened and selected based on predefined inclusion and exclusion criteria. Bibliometric analysis was performed using VOSviewer to identify major research clusters and trends. Data extraction included details on LLM types, application domains, and performance metrics. The field is rapidly expanding, with a surge in publications after 2023. GPT-4 and its variants dominated research (70/95, 74% of studies), followed by GPT-3.5 (34/95, 36%). Key applications included disease classification (text or image-based), medical question answering, and diagnostic content generation. LLMs demonstrated high accuracy in specialties like radiology, psychiatry, and neurology but exhibited biases in race, gender, and cost predictions. Ethical concerns, including privacy risks and model hallucination, alongside regulatory fragmentation, were critical barriers to clinical adoption. LLMs hold transformative potential for medical diagnostics but require rigorous validation, bias mitigation, and multimodal integration to address real-world complexities. Future research should prioritize explainable artificial intelligence frameworks, specialty-specific optimization, and international regulatory harmonization to ensure equitable and safe clinical deployment.

Automated detection of spinal bone marrow oedema in axial spondyloarthritis: training and validation using two large phase 3 trial datasets.

Jamaludin A, Windsor R, Ather S, Kadir T, Zisserman A, Braun J, Gensler LS, Østergaard M, Poddubnyy D, Coroller T, Porter B, Ligozio G, Readie A, Machado PM

pubmed logopapersJun 9 2025
To evaluate the performance of machine learning (ML) models for the automated scoring of spinal MRI bone marrow oedema (BMO) in patients with axial spondyloarthritis (axSpA) and compare them with expert scoring. ML algorithms using SpineNet software were trained and validated on 3483 spinal MRIs from 686 axSpA patients across two clinical trial datasets. The scoring pipeline involved (i) detection and labelling of vertebral bodies and (ii) classification of vertebral units for the presence or absence of BMO. Two models were tested: Model 1, without manual segmentation, and Model 2, incorporating an intermediate manual segmentation step. Model outputs were compared with those of human experts using kappa statistics, balanced accuracy, sensitivity, specificity, and AUC. Both models performed comparably to expert readers, regarding presence vs absence of BMO. Model 1 outperformed Model 2, with an AUC of 0.94 (vs 0.88), accuracy of 75.8% (vs 70.5%), and kappa of 0.50 (vs 0.31), using absolute reader consensus scoring as the external reference; this performance was similar to the expert inter-reader accuracy of 76.8% and kappa of 0.47, in a radiographic axSpA dataset. In a non-radiographic axSpA dataset, Model 1 achieved an AUC of 0.97 (vs 0.91 for Model 2), accuracy of 74.6% (vs 70%), and kappa of 0.52 (vs 0.27), comparable to the expert inter-reader accuracy of 74.2% and kappa of 0.46. ML software shows potential for automated MRI BMO assessment in axSpA, offering benefits such as improved consistency, reduced labour costs, and minimised inter- and intra-reader variability. Clinicaltrials.gov, MEASURE 1 study (NCT01358175); PREVENT study (NCT02696031).

Radiomics-based machine learning atherosclerotic carotid artery disease in ultrasound: systematic review with meta-analysis of RQS.

Vacca S, Scicolone R, Pisu F, Cau R, Yang Q, Annoni A, Pontone G, Costa F, Paraskevas KI, Nicolaides A, Suri JS, Saba L

pubmed logopapersJun 9 2025
Stroke, a leading global cause of mortality and neurological disability, is often associated with atherosclerotic carotid artery disease. Distinguishing between symptomatic and asymptomatic carotid artery disease is crucial for appropriate treatment decisions. Radiomics, a quantitative image analysis technique, and machine learning (ML) have emerged as promising tools in Ultrasound (US) imaging, potentially providing a helpful tool in the screening of such lesions. Pubmed, Web of Science and Scopus databases were searched for relevant studies published from January 2005 to May 2023. The Radiomics Quality Score (RQS) was used to assess methodological quality of studies included in the review. The Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) assessed the risk of bias. Sensitivity, specificity, and logarithmic diagnostic odds ratio (logDOR) meta-analyses have been conducted, alongside an influence analysis. RQS assessed methodological quality, revealing an overall low score and consistent findings with other radiology domains. QUADAS-2 indicated an overall low risk, except for two studies with high bias. The meta-analysis demonstrated that radiomics-based ML models for predicting culprit plaques on US had a satisfactory performance, with a sensitivity of 0.84 and specificity of 0.82. The logDOR analysis confirmed the positive results, yielding a pooled logDOR of 3.54. The summary ROC curve provided an AUC of 0.887. Radiomics combined with ML provide high sensitivity and low false positive rate for carotid plaque vulnerability assessment on US. However, current evidence is not definitive, given the low overall study quality and high inter-study heterogeneity. High quality, prospective studies are needed to confirm the potential of these promising techniques.

Multi-task and multi-scale attention network for lymph node metastasis prediction in esophageal cancer.

Yi Y, Wang J, Li Z, Wang L, Ding X, Zhou Q, Huang Y, Li B

pubmed logopapersJun 9 2025
The accurate diagnosis of lymph node metastasis in esophageal squamous cell carcinoma is crucial in the treatment workflow, and the process is often time-consuming for clinicians. Recent deep learning models predicting whether lymph nodes are affected by cancer in esophageal cancer cases suffer from challenging node delineation and hence gain poor diagnosis accuracy. This paper proposes an innovative multi-task and multi-scale attention network (M <math xmlns="http://www.w3.org/1998/Math/MathML"><mmultiscripts><mrow></mrow> <mrow></mrow> <mn>2</mn></mmultiscripts> </math> ANet) to predict lymph node metastasis precisely. The network softly expands the regions of the node mask and subsequently utilizes the expanded mask to aggregate image features, thereby amplifying the node contexts. It additionally proposes a two-branch training strategy that compels the model to simultaneously predict metastasis probability and node masks, fostering a more comprehensive learning process. The node metastasis prediction performance has been evaluated on a self-collected dataset with 177 patients. Our model finally achieves a competitive accuracy of 83.7% on the test set comprising 577 nodes. With the adaptability to intricate patterns and ability to handle data variations, M <math xmlns="http://www.w3.org/1998/Math/MathML"><mmultiscripts><mrow></mrow> <mrow></mrow> <mn>2</mn></mmultiscripts> </math> ANet emerges as a promising tool for robust and comprehensive lymph node metastasis prediction in medical image analysis.

Brain tau PET-based identification and characterization of subpopulations in patients with Alzheimer's disease using deep learning-derived saliency maps.

Li Y, Wang X, Ge Q, Graeber MB, Yan S, Li J, Li S, Gu W, Hu S, Benzinger TLS, Lu J, Zhou Y

pubmed logopapersJun 9 2025
Alzheimer's disease (AD) is a heterogeneous neurodegenerative disorder in which tau neurofibrillary tangles are a pathological hallmark closely associated with cognitive dysfunction and neurodegeneration. In this study, we used brain tau data to investigate AD heterogeneity by identifying and characterizing the subpopulations among patients. We included 615 cognitively normal and 159 AD brain <sup>18</sup>F-flortaucipr PET scans, along with T1-weighted MRI from the Alzheimer Disease Neuroimaging Initiative database. A three dimensional-convolutional neural network model was employed for AD detection using standardized uptake value ratio (SUVR) images. The model-derived saliency maps were generated and employed as informative image features for clustering AD participants. Among the identified subpopulations, statistical analysis of demographics, neuropsychological measures, and SUVR were compared. Correlations between neuropsychological measures and regional SUVRs were assessed. A generalized linear model was utilized to investigate the sex and APOE ε4 interaction effect on regional SUVRs. Two distinct subpopulations of AD patients were revealed, denoted as S<sub>Hi</sub> and S<sub>Lo</sub>. Compared to the S<sub>Lo</sub> group, the S<sub>Hi</sub> group exhibited a significantly higher global tau burden in the brain, but both groups showed similar cognition distribution levels. In the S<sub>Hi</sub> group, the associations between the neuropsychological measurements and regional tau deposition were weakened. Moreover, a significant interaction effect of sex and APOE ε4 on tau deposition was observed in the S<sub>Lo</sub> group, but no such effect was found in the S<sub>Hi</sub> group. Our results suggest that tau tangles, as shown by SUVR, continue to accumulate even when cognitive function plateaus in AD patients, highlighting the advantages of PET in later disease stages. The differing relationships between cognition and tau deposition, and between gender, APOE4, and tau deposition, provide potential for subtype-specific treatments. Targeting gender-specific and genetic factors influencing tau deposition, as well as interventions aimed at tau's impact on cognition, may be effective.

Curriculum check, 2025-equipping radiology residents for AI challenges of tomorrow.

Venugopal VK, Kumar A, Tan MO, Szarf G

pubmed logopapersJun 9 2025
The exponential rise in the artificial intelligence (AI) tools for medical imaging is profoundly impacting the practice of radiology. With over 1000 FDA-cleared AI algorithms now approved for clinical use-many of them designed for radiologic tasks-the responsibility lies with training institutions to ensure that radiology residents are equipped not only to use AI systems, but to critically evaluate, monitor, respond to their output in a safe, ethical manner. This review proposes a comprehensive framework to integrate AI into radiology residency curricula, targeting both essential competencies required of all residents, optional advanced skills for those interested in research or AI development. Core educational strategies include structured didactic instruction, hands-on lab exposure to commercial AI tools, case-based discussions, simulation-based clinical pathways, teaching residents how to interpret model cards, regulatory documentation. Clinical examples such as stroke triage, Urinary tract calculi detection, AI-CAD in mammography, false-positive detection are used to anchor theory in practice. The article also addresses critical domains of AI governance: model transparency, ethical dilemmas, algorithmic bias, the role of residents in human-in-the-loop oversight systems. It outlines mentorship, faculty development strategies to build institutional readiness, proposes a roadmap to future-proof radiology education. This includes exposure to foundation models, vision-language systems, multi-agent workflows, global best practices in post-deployment AI monitoring. This pragmatic framework aims to serve as a guide for residency programs adapting to the next era of radiology practice.
Page 7 of 1261257 results
Show
per page
Get Started

Upload your X-ray image and get interpretation.

Upload now →

Disclaimer: X-ray Interpreter's AI-generated results are for informational purposes only and not a substitute for professional medical advice. Always consult a healthcare professional for medical diagnosis and treatment.