Latest Papers on Radiology AI. Tags: Mixed Modality, Order: Best Match, Limit: 10.

Cardiac imaging for the detection of ischemia: current status and future perspectives.

Rodriguez C, Pappas L, Le Hong Q, Baquero L, Nagel E

•papers•May 12 2025

Coronary artery disease is the main cause of mortality worldwide mandating early detection, appropriate treatment, and follow-up. Noninvasive cardiac imaging techniques allow detection of obstructive coronary heart disease by direct visualization of the arteries or myocardial blood flow reduction. These techniques have made remarkable progress since their introduction, achieving high diagnostic precision. This review aims at evaluating these noninvasive cardiac imaging techniques, rendering a thorough overview of diagnostic decision-making for detection of ischemia. We discuss the latest advances in the field such as computed tomography angiography, single-photon emission tomography, positron emission tomography, and cardiac magnetic resonance; their main advantages and disadvantages, their most appropriate use and prospects. For the review, we analyzed the literature from 2009 to 2024 on noninvasive cardiac imaging in the diagnosis of coronary artery disease. The review included the 78 publications considered most relevant, including landmark trials, review articles and guidelines. The progress in cardiac imaging is anticipated to overcome various limitations such as high costs, radiation exposure, artifacts, and differences in interpretation among observers. It is expected to lead to more automated scanning processes, and with the assistance of artificial intelligence-driven post-processing software, higher accuracy and reproducibility may be attained.

Mixed Modality Classification Cardiac Review Concept Academic Lab

BodyGPS: Anatomical Positioning System

Halid Ziya Yerebakan, Kritika Iyer, Xueqi Guo, Yoshihisa Shinagawa, Gerardo Hermosillo Valadez

•preprint•May 12 2025

We introduce a new type of foundational model for parsing human anatomy in medical images that works for different modalities. It supports supervised or unsupervised training and can perform matching, registration, classification, or segmentation with or without user interaction. We achieve this by training a neural network estimator that maps query locations to atlas coordinates via regression. Efficiency is improved by sparsely sampling the input, enabling response times of less than 1 ms without additional accelerator hardware. We demonstrate the utility of the algorithm in both CT and MRI modalities.

Mixed Modality Registration Whole Body Methodology In Silico Breakthrough

Benchmarking Radiology Report Generation From Noisy Free-Texts.

Yuan Y, Zheng Y, Qu L

•papers•May 12 2025

Automatic radiology report generation can enhance diagnostic efficiency and accuracy. However, clean open-source imaging scan-report pairs are limited in scale and variety. Moreover, the vast amount of radiological texts available online is often too noisy to be directly employed. To address this challenge, we introduce a novel task called Noisy Report Refinement (NRR), which generates radiology reports from noisy free-texts. To achieve this, we propose a report refinement pipeline that leverages large language models (LLMs) enhanced with guided self-critique and report selection strategies. To address the inability of existing radiology report generation metrics in measuring cleanliness, radiological usefulness, and factual correctness across various modalities of reports in NRR task, we introduce a new benchmark, NRRBench, for NRR evaluation. This benchmark includes two online-sourced datasets and four clinically explainable LLM-based metrics: two metrics evaluate the matching rate of radiology entities and modality-specific template attributes respectively, one metric assesses report cleanliness, and a combined metric evaluates overall NRR performance. Experiments demonstrate that guided self-critique and report selection strategies significantly improve the quality of refined reports. Additionally, our proposed metrics show a much higher correlation with noisy rate and error count of reports than radiology report generation metrics in evaluating NRR.

Mixed Modality LLM Radiology Report Other Methodology In Silico Benchmark SOTA GenAI

A comparison of performance of DeepSeek-R1 model-generated responses to musculoskeletal radiology queries against ChatGPT-4 and ChatGPT-4o - A feasibility study.

Uldin H, Saran S, Gandikota G, Iyengar KP, Vaishya R, Parmar Y, Rasul F, Botchu R

•papers•May 12 2025

Artificial Intelligence (AI) has transformed society and chatbots using Large Language Models (LLM) are playing an increasing role in scientific research. This study aims to assess and compare the efficacy of newer DeepSeek R1 and ChatGPT-4 and 4o models in answering scientific questions about recent research. We compared output generated from ChatGPT-4, ChatGPT-4o, and DeepSeek-R1 in response to ten standardized questions in the setting of musculoskeletal (MSK) radiology. These were independently analyzed by one MSK radiologist and one final-year MSK radiology trainee and graded using a Likert scale from 1 to 5 (1 being inaccurate to 5 being accurate). Five DeepSeek answers were significantly inaccurate and provided fictitious references only on prompting. All ChatGPT-4 and 4o answers were well-written with good content, the latter including useful and comprehensive references. ChatGPT-4o generates structured research answers to questions on recent MSK radiology research with useful references in all our cases, enabling reliable usage. DeepSeek-R1 generates articles that, on the other hand, may appear authentic to the unsuspecting eye but contain a higher amount of falsified and inaccurate information in the current version. Further iterations may improve these accuracies.

Mixed Modality LLM Radiology Report Musculoskeletal Retrospective Clinical In Silico Academic Lab GenAI Reproducibility

Application of artificial intelligence-based three dimensional digital reconstruction technology in precision treatment of complex total hip arthroplasty.

Zheng Q, She H, Zhang Y, Zhao P, Liu X, Xiang B

•papers•May 10 2025

To evaluate the predictive ability of AI HIP in determining the size and position of prostheses during complex total hip arthroplasty (THA). Additionally, it investigates the factors influencing the accuracy of preoperative planning predictions. From April 2021 to December 2023, patients with complex hip joint diseases were divided into the AI preoperative planning group (n = 29) and the X-ray preoperative planning group (n = 27). Postoperative X-rays were used to measure acetabular anteversion angle, abduction angle, tip-to-sternum distance, intraoperative duration, blood loss, planning time, postoperative Harris Hip Scores (at 2 weeks, 3 months, and 6 months), and visual analogue scale (VAS) pain scores (at 2 weeks and at final follow-up) to analyze clinical outcomes. On the acetabular side, the accuracy of AI preoperative planning was higher compared to X-ray preoperative planning (75.9% vs. 44.4%, P = 0.016). On the femoral side, AI preoperative planning also showed higher accuracy compared to X-ray preoperative planning (85.2% vs. 59.3%, P = 0.033). The AI preoperative planning group showed superior outcomes in terms of reducing bilateral leg length discrepancy (LLD), decreasing operative time and intraoperative blood loss, early postoperative recovery, and pain control compared to the X-ray preoperative planning group (P < 0.05). No significant differences were observed between the groups regarding bilateral femoral offset (FO) differences, bilateral combined offset (CO) differences, abduction angle, anteversion angle, or tip-to-sternum distance. Factors such as gender, age, affected side, comorbidities, body mass index (BMI) classification, bone mineral density did not affect the prediction accuracy of AI HIP preoperative planning. Artificial intelligence-based 3D planning can be effectively utilized for preoperative planning in complex THA. Compared to X-ray templating, AI demonstrates superior accuracy in prosthesis measurement and provides significant clinical benefits, particularly in early postoperative recovery.

Mixed Modality Reconstruction Musculoskeletal Retrospective Clinical Clinical Pilot Academic Lab

Preoperative radiomics models using CT and MRI for microsatellite instability in colorectal cancer: a systematic review and meta-analysis.

Capello Ingold G, Martins da Fonseca J, Kolenda Zloić S, Verdan Moreira S, Kago Marole K, Finnegan E, Yoshikawa MH, Daugėlaitė S, Souza E Silva TX, Soato Ratti MA

•papers•May 10 2025

Microsatellite instability (MSI) is a novel predictive biomarker for chemotherapy and immunotherapy response, as well as prognostic indicator in colorectal cancer (CRC). The current standard for MSI identification is polymerase chain reaction (PCR) testing or the immunohistochemical analysis of tumor biopsy samples. However, tumor heterogeneity and procedure complications pose challenges to these techniques. CT and MRI-based radiomics models offer a promising non-invasive approach for this purpose. A systematic search of PubMed, Embase, Cochrane Library and Scopus was conducted to identify studies evaluating the diagnostic performance of CT and MRI-based radiomics models for detecting MSI status in CRC. Pooled area under the curve (AUC), sensitivity, and specificity were calculated in RStudio using a random-effects model. Forest plots and a summary ROC curve were generated. Heterogeneity was assessed using I² statistics and explored through sensitivity analyses, threshold effect assessment, subgroup analyses and meta-regression. 17 studies with a total of 6,045 subjects were included in the analysis. All studies extracted radiomic features from CT or MRI images of CRC patients with confirmed MSI status to train machine learning models. The pooled AUC was 0.815 (95% CI: 0.784-0.840) for CT-based studies and 0.900 (95% CI: 0.819-0.943) for MRI-based studies. Significant heterogeneity was identified and addressed through extensive analysis. Radiomics models represent a novel and promising tool for predicting MSI status in CRC patients. These findings may serve as a foundation for future studies aimed at developing and validating improved models, ultimately enhancing the diagnosis, treatment, and prognosis of colorectal cancer.

Mixed Modality Classification Abdominal Meta Analysis In Silico Academic Lab

Machine learning approaches for classifying major depressive disorder using biological and neuropsychological markers: A meta-analysis.

Zhang L, Jian L, Long Y, Ren Z, Calhoun VD, Passos IC, Tian X, Xiang Y

•papers•May 10 2025

Traditional diagnostic methods for major depressive disorder (MDD), which rely on subjective assessments, may compromise diagnostic accuracy. In contrast, machine learning models have the potential to classify and diagnose MDD more effectively, reducing the risk of misdiagnosis associated with conventional methods. The aim of this meta-analysis is to evaluate the overall classification accuracy of machine learning models in MDD and examine the effects of machine learning algorithms, biomarkers, diagnostic comparison groups, validation procedures, and participant age on classification performance. As of September 2024, a total of 176 studies were ultimately included in the meta-analysis, encompassing a total of 60,926 participants. A random-effects model was applied to analyze the extracted data, resulting in an overall classification accuracy of 0.825 (95% CI [0.810; 0.839]). Convolutional neural networks significantly outperformed support vector machines (SVM) when using electroencephalography and magnetoencephalography data. Additionally, SVM demonstrated significantly better performance with functional magnetic resonance imaging data compared to graph neural networks and gaussian process classification. The sample size was negatively correlated to classification accuracy. Furthermore, evidence of publication bias was also detected. Therefore, while this study indicates that machine learning models show high accuracy in distinguishing MDD from healthy controls and other psychiatric disorders, further research is required before these findings can be generalized to large-scale clinical practice.

Mixed Modality Classification Neurological Meta Analysis In Silico Academic Lab

Improving Generalization of Medical Image Registration Foundation Model

Jing Hu, Kaiwei Yu, Hongjiang Xian, Shu Hu, Xin Wang

•preprint•May 10 2025

Deformable registration is a fundamental task in medical image processing, aiming to achieve precise alignment by establishing nonlinear correspondences between images. Traditional methods offer good adaptability and interpretability but are limited by computational efficiency. Although deep learning approaches have significantly improved registration speed and accuracy, they often lack flexibility and generalizability across different datasets and tasks. In recent years, foundation models have emerged as a promising direction, leveraging large and diverse datasets to learn universal features and transformation patterns for image registration, thus demonstrating strong cross-task transferability. However, these models still face challenges in generalization and robustness when encountering novel anatomical structures, varying imaging conditions, or unseen modalities. To address these limitations, this paper incorporates Sharpness-Aware Minimization (SAM) into foundation models to enhance their generalization and robustness in medical image registration. By optimizing the flatness of the loss landscape, SAM improves model stability across diverse data distributions and strengthens its ability to handle complex clinical scenarios. Experimental results show that foundation models integrated with SAM achieve significant improvements in cross-dataset registration performance, offering new insights for the advancement of medical image registration technology. Our code is available at https://github.com/Promise13/fm_sam}{https://github.com/Promise13/fm\_sam.

Mixed Modality Registration Other Methodology In Silico Open Code

Deeply Explainable Artificial Neural Network

David Zucker

•preprint•May 10 2025

While deep learning models have demonstrated remarkable success in numerous domains, their black-box nature remains a significant limitation, especially in critical fields such as medical image analysis and inference. Existing explainability methods, such as SHAP, LIME, and Grad-CAM, are typically applied post hoc, adding computational overhead and sometimes producing inconsistent or ambiguous results. In this paper, we present the Deeply Explainable Artificial Neural Network (DxANN), a novel deep learning architecture that embeds explainability ante hoc, directly into the training process. Unlike conventional models that require external interpretation methods, DxANN is designed to produce per-sample, per-feature explanations as part of the forward pass. Built on a flow-based framework, it enables both accurate predictions and transparent decision-making, and is particularly well-suited for image-based tasks. While our focus is on medical imaging, the DxANN architecture is readily adaptable to other data modalities, including tabular and sequential data. DxANN marks a step forward toward intrinsically interpretable deep learning, offering a practical solution for applications where trust and accountability are essential.

Mixed Modality Classification Other Methodology Concept Ethics

Evaluating an information theoretic approach for selecting multimodal data fusion methods.

Zhang T, Ding R, Luong KD, Hsu W

•papers•May 10 2025

Interest has grown in combining radiology, pathology, genomic, and clinical data to improve the accuracy of diagnostic and prognostic predictions toward precision health. However, most existing works choose their datasets and modeling approaches empirically and in an ad hoc manner. A prior study proposed four partial information decomposition (PID)-based metrics to provide a theoretical understanding of multimodal data interactions: redundancy, uniqueness of each modality, and synergy. However, these metrics have only been evaluated in a limited collection of biomedical data, and the existing work does not elucidate the effect of parameter selection when calculating the PID metrics. In this work, we evaluate PID metrics on a wider range of biomedical data, including clinical, radiology, pathology, and genomic data, and propose potential improvements to the PID metrics. We apply the PID metrics to seven different modality pairs across four distinct cohorts (datasets). We compare and interpret trends in the resulting PID metrics and downstream model performance in these multimodal cohorts. The downstream tasks being evaluated include predicting the prognosis (either overall survival or recurrence) of patients with non-small cell lung cancer, prostate cancer, and glioblastoma. We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance. Of the seven different modality pairs, three had poor (0%), three had moderate (66%-89%), and only one had perfect (100%) consistency between the PID values and model performance. We propose two improvements to the PID metrics (determining the optimal parameters and uncertainty estimation) and identified areas where PID metrics could be further improved. The current PID metrics are not accurate enough for estimating the multimodal data interactions and need to be improved before they can serve as a reliable tool. We propose improvements and provide suggestions for future work. Code: https://github.com/zhtyolivia/pid-multimodal.

Mixed Modality Classification Other Methodology In Silico Academic Lab Open Code

Cardiac imaging for the detection of ischemia: current status and future perspectives.

BodyGPS: Anatomical Positioning System

Benchmarking Radiology Report Generation From Noisy Free-Texts.

A comparison of performance of DeepSeek-R1 model-generated responses to musculoskeletal radiology queries against ChatGPT-4 and ChatGPT-4o - A feasibility study.

Application of artificial intelligence-based three dimensional digital reconstruction technology in precision treatment of complex total hip arthroplasty.

Preoperative radiomics models using CT and MRI for microsatellite instability in colorectal cancer: a systematic review and meta-analysis.

Machine learning approaches for classifying major depressive disorder using biological and neuropsychological markers: A meta-analysis.

Improving Generalization of Medical Image Registration Foundation Model

Deeply Explainable Artificial Neural Network

Evaluating an information theoretic approach for selecting multimodal data fusion methods.

Ready to Sharpen Your Edge?