Latest Papers on Radiology AI. Tags: Musculoskeletal, Order: Best Match, Limit: 10.

The safety and accuracy of radiation-free spinal navigation using a short, scoliosis-specific BoneMRI-protocol, compared to CT.

Lafranca PPG, Rommelspacher Y, Walter SG, Muijs SPJ, van der Velden TA, Shcherbakova YM, Castelein RM, Ito K, Seevinck PR, Schlösser TPC

•papers•Jul 21 2025

Spinal navigation systems require pre- and/or intra-operative 3-D imaging, which expose young patients to harmful radiation. We assessed a scoliosis-specific MRI-protocol that provides T2-weighted MRI and AI-generated synthetic-CT (sCT) scans, through deep learning algorithms. This study aims to compare MRI-based synthetic-CT spinal navigation to CT for safety and accuracy of pedicle screw planning and placement at thoracic and lumbar levels. Spines of 5 cadavers were scanned with thin-slice CT and the scoliosis-specific MRI-protocol (to create sCT). Preoperatively, on both CT and sCT screw trajectories were planned. Subsequently, four spine surgeons performed surface-matched, navigated placement of 2.5 mm k-wires in all pedicles from T3 to L5. Randomization for CT/sCT, surgeon and side was performed (1:1 ratio). On postoperative CT-scans, virtual screws were simulated over k-wires. Maximum angulation, distance between planned and postoperative screw positions and medial breach rate (Gertzbein-Robbins classification) were assessed. 140 k-wires were inserted, 3 were excluded. There were no pedicle breaches > 2 mm. Of sCT-guided screws, 59 were grade A and 10 grade B. For the CT-guided screws, 47 were grade A and 21 grade B (p = 0.022). Average distance (± SD) between intraoperative and postoperative screw positions was 2.3 ± 1.5 mm in sCT-guided screws, and 2.4 ± 1.8 mm for CT (p = 0.78), average maximum angulation (± SD) was 3.8 ± 2.5° for sCT and 3.9 ± 2.9° for CT (p = 0.75). MRI-based, AI-generated synthetic-CT spinal navigation allows for safe and accurate planning and placement of thoracic and lumbar pedicle screws in a cadaveric model, without significant differences in distance and angulation between planned and postoperative screw positions compared to CT.

Mixed Modality Image Synthesis Musculoskeletal Retrospective Clinical Phantom/Animal Academic Lab Reproducibility

Detecting Fifth Metatarsal Fractures on Radiographs through the Lens of Smartphones: A FIXUS AI Algorithm

Taseh, A., Shah, A., Eftekhari, M., Flaherty, A., Ebrahimi, A., Jones, S., Nukala, V., Nazarian, A., Waryasz, G., Ashkani-Esfahani, S.

•preprint•Jul 18 2025

BackgroundFifth metatarsal (5MT) fractures are common but challenging to diagnose, particularly with limited expertise or subtle fractures. Deep learning shows promise but faces limitations due to image quality requirements. This study develops a deep learning model to detect 5MT fractures from smartphone-captured radiograph images, enhancing accessibility of diagnostic tools. MethodsA retrospective study included patients aged >18 with 5MT fractures (n=1240) and controls (n=1224). Radiographs (AP, oblique, lateral) from Electronic Health Records (EHR) were obtained and photographed using a smartphone, creating a new dataset (SP). Models using ResNet 152V2 were trained on EHR, SP, and combined datasets, then evaluated on a separate smartphone test dataset (SP-test). ResultsOn validation, the SP model achieved optimal performance (AUROC: 0.99). On the SP-test dataset, the EHR models performance decreased (AUROC: 0.83), whereas SP and combined models maintained high performance (AUROC: 0.99). ConclusionsSmartphone-specific deep learning models effectively detect 5MT fractures, suggesting their practical utility in resource-limited settings.

X-Ray Detection Musculoskeletal Retrospective Clinical In Silico

OrthoInsight: Rib Fracture Diagnosis and Report Generation Based on Multi-Modal Large Models

Ningyong Wu, Jinzhi Wang, Wenhong Zhao, Chenzhan Yu, Zhigang Xiu, Duwei Dai

•preprint•Jul 18 2025

The growing volume of medical imaging data has increased the need for automated diagnostic tools, especially for musculoskeletal injuries like rib fractures, commonly detected via CT scans. Manual interpretation is time-consuming and error-prone. We propose OrthoInsight, a multi-modal deep learning framework for rib fracture diagnosis and report generation. It integrates a YOLOv9 model for fracture detection, a medical knowledge graph for retrieving clinical context, and a fine-tuned LLaVA language model for generating diagnostic reports. OrthoInsight combines visual features from CT images with expert textual data to deliver clinically useful outputs. Evaluated on 28,675 annotated CT images and expert reports, it achieves high performance across Diagnostic Accuracy, Content Completeness, Logical Coherence, and Clinical Guidance Value, with an average score of 4.28, outperforming models like GPT-4 and Claude-3. This study demonstrates the potential of multi-modal learning in transforming medical image analysis and providing effective support for radiologists.

CT Detection Musculoskeletal Methodology In Silico Academic Lab GenAI

Multi-Centre Validation of a Deep Learning Model for Scoliosis Assessment

Šimon Kubov, Simon Klíčník, Jakub Dandár, Zdeněk Straka, Karolína Kvaková, Daniel Kvak

•preprint•Jul 18 2025

Scoliosis affects roughly 2 to 4 percent of adolescents, and treatment decisions depend on precise Cobb angle measurement. Manual assessment is time consuming and subject to inter observer variation. We conducted a retrospective, multi centre evaluation of a fully automated deep learning software (Carebot AI Bones, Spine Measurement functionality; Carebot s.r.o.) on 103 standing anteroposterior whole spine radiographs collected from ten hospitals. Two musculoskeletal radiologists independently measured each study and served as reference readers. Agreement between the AI and each radiologist was assessed with Bland Altman analysis, mean absolute error (MAE), root mean squared error (RMSE), Pearson correlation coefficient, and Cohen kappa for four grade severity classification. Against Radiologist 1 the AI achieved an MAE of 3.89 degrees (RMSE 4.77 degrees) with a bias of 0.70 degrees and limits of agreement from minus 8.59 to plus 9.99 degrees. Against Radiologist 2 the AI achieved an MAE of 3.90 degrees (RMSE 5.68 degrees) with a bias of 2.14 degrees and limits from minus 8.23 to plus 12.50 degrees. Pearson correlations were r equals 0.906 and r equals 0.880 (inter reader r equals 0.928), while Cohen kappa for severity grading reached 0.51 and 0.64 (inter reader kappa 0.59). These results demonstrate that the proposed software reproduces expert level Cobb angle measurements and categorical grading across multiple centres, suggesting its utility for streamlining scoliosis reporting and triage in clinical workflows.

X-Ray Segmentation Musculoskeletal Retrospective Clinical Clinical Pilot Startup

Diagnostic interchangeability of deep-learning based Synth-STIR images generated from T1 and T2 weighted spine images.

Li J, Xu M, Jiang B, Dong Q, Xia Y, Zhou T, Lin X, Ma Y, Jiang S, Zhang Z, Xiang L, Fan L, Liu S

•papers•Jul 18 2025

To evaluate image quality and diagnostic interchangeability of synth short-tau inversion recovery (STIR) generated by deep learning in comparison with standard STIR. This prospective study recruited participants between July 2023 and August 2023. Participants were scanned with T1WI and T2WI, then generated Synth-STIR. Signal-to-noise ratios (SNR), contrast-to-noise ratios (CNR) were calculated for quantitative evaluation. Four independent, blinded radiologists performed subjective quality and lesion characteristic assessment. Wilcoxon tests were used to assess the differences in SNR, CNR, and subjective image quality. Various diagnostic findings pertinent to the spine were tested for interchangeability using the individual equivalence index (IEI). Inter-reader and intra-reader agreement and concordance were computed, and McNemar tests were performed for comprehensive evaluation. One hundred ninety-nine participants (106 male patients, mean age 46.8 ± 16.9 years) were included. Compared to standard-STIR, Synth-STIR reduces sequence scanning time by approximately 180 s, has significantly higher SNR and CNR (p < 0.001). For artifacts, noise, sharpness, and diagnostic confidence, all readers agreed that Synth-STIR was significantly better than standard-STIR (all p < 0.001). In addition, the IEI was less than 1.61%. Kappa and Kendall showed a moderate to excellent agreement in the range of 0.52-0.97. There was no significant difference in the frequencies of the major features as reported with standard-STIR and Synth-STIR (p = 0.211-1). Synth-STIR shows significantly higher SNR and CNR, and is diagnostically interchangeable with standard-STIR with a substantial overall reduction in the imaging time, thereby improving efficiency without sacrificing diagnostic value. Question Can generating STIR improve image quality while reducing spine MRI acquisition time in order to increase clinical spine MRI throughput? Findings With reduced acquisition time, Synth-STIR has significantly higher SNR and CNR than standard-STIR and can be interchangeably diagnosed with standard-STIR in detecting spinal abnormalities. Clinical relevance Our Synth-STIR provides the same high-quality images for clinical diagnosis as standard-STIR, while reducing scanning time for spine MRI protocols. Increase clinical spine MRI throughput.

MRI Image Synthesis Musculoskeletal Prospective Clinical Pilot Academic Lab Benchmark SOTA

A Deep Learning-Based Ensemble System for Automated Shoulder Fracture Detection in Clinical Radiographs

Hemanth Kumar M, Karthika M, Saianiruth M, Vasanthakumar Venugopal, Anandakumar D, Revathi Ezhumalai, Charulatha K, Kishore Kumar J, Dayana G, Kalyan Sivasailam, Bargava Subramanian

•preprint•Jul 17 2025

Background: Shoulder fractures are often underdiagnosed, especially in emergency and high-volume clinical settings. Studies report up to 10% of such fractures may be missed by radiologists. AI-driven tools offer a scalable way to assist early detection and reduce diagnostic delays. We address this gap through a dedicated AI system for shoulder radiographs. Methods: We developed a multi-model deep learning system using 10,000 annotated shoulder X-rays. Architectures include Faster R-CNN (ResNet50-FPN, ResNeXt), EfficientDet, and RF-DETR. To enhance detection, we applied bounding box and classification-level ensemble techniques such as Soft-NMS, WBF, and NMW fusion. Results: The NMW ensemble achieved 95.5% accuracy and an F1-score of 0.9610, outperforming individual models across all key metrics. It demonstrated strong recall and localization precision, confirming its effectiveness for clinical fracture detection in shoulder X-rays. Conclusion: The results show ensemble-based AI can reliably detect shoulder fractures in radiographs with high clinical relevance. The model's accuracy and deployment readiness position it well for integration into real-time diagnostic workflows. The current model is limited to binary fracture detection, reflecting its design for rapid screening and triage support rather than detailed orthopedic classification.

X-Ray Detection Musculoskeletal Methodology In Silico

Clinical Implementation of Sixfold-Accelerated Deep Learning Superresolution Knee MRI in Under 5 Minutes: Arthroscopy-Validated Diagnostic Performance.

Vosshenrich J, Breit HC, Donners R, Obmann MM, Walter SS, Serfaty A, Rodrigues TC, Recht M, Stern SE, Fritz J

•papers•Jul 16 2025

BACKGROUND. Deep learning (DL) superresolution image reconstruction enables higher acceleration factors for combined parallel imaging-simultaneous multislice-accelerated knee MRI but requires performance validation against external reference standards. OBJECTIVE. The purpose of this study was to validate the clinical efficacy of six-fold-accelerated sub-5-minute 3-T knee MRI using combined threefold parallel imaging (PI) and twofold simultaneous multislice (SMS) acceleration and DL superresolution image reconstruction against arthroscopic surgery. METHODS. Consecutive adult patients with painful knee conditions who underwent sixfold PI-SMS-accelerated DL superresolution 3-T knee MRI and arthroscopic surgery between October 2022 and July 2023 were retrospectively included. Seven fellowship-trained musculoskeletal radiologists independently assessed the MRI studies for image-quality parameters; presence of artifacts; structural visibility (Likert scale: 1 [very bad/severe] to 5 [very good/absent]); and the presence of cruciate ligament tears, collateral ligament tears, meniscal tears, cartilage defects, and fractures. Statistical analyses included kappa-based interreader agreements and diagnostic performance testing. RESULTS. The final sample included 124 adult patients (mean age ± SD, 46 ± 17 years; 79 men, 45 women) who underwent knee MRI and arthroscopic surgery within a median of 28 days (range, 4-56 days). Overall image quality was good to very good (median, 4 [IQR, 4-5]) with very good interreader agreement (κ = 0.86). Motion artifacts were absent (median, 5 [IQR, 5-5]), and image noise was minimal (median, 4 [IQR, 4-5]). Visibility of anatomic structures was very good (median, 5 [IQR, 5-5]). Diagnostic performance for diagnosing arthroscopy-validated structural abnormalities was good to excellent (AUC ≥ 0.81) with at least good interreader agreement (κ ≥ 0.72). The sensitivity, specificity, accuracy, and AUC values were 100%, 99%, 99%, and 0.99 for anterior cruciate ligament tears; 100%, 100%, 100%, and 1.00 for posterior cruciate ligament tears; 90%, 95%, 94%, and 0.93 for medial meniscus tears; 76%, 97%, 90%, and 0.86 for lateral meniscus tears; and 85%, 88%, 88%, and 0.81 for articular cartilage defects, respectively. CONCLUSION. Sixfold PI-SMS-accelerated sub-5-minute DL superresolution 3-T knee MRI has excellent diagnostic performance for detecting internal derangement. CLINICAL IMPACT. Sixfold PI-SMS-accelerated PI-SMS DL superresolution 3-T knee MRI provides high efficiency through short scan times and high diagnostic performance.

MRI Reconstruction Musculoskeletal Retrospective Clinical Clinical Pilot Academic Lab

Conditional GAN performs better than orthopedic surgeon in virtual reduction of femoral neck fracture.

Zhao K, Mei Y, Wang X, Ma W, Shen W

•papers•Jul 16 2025

Satisfied reduction of fracture is hard to achieve. The purpose of this study is to develop a virtual fracture reduction technique using conditional GAN (Generative Adversarial Network), and evaluate its performance in simulating and guiding reduction of femoral neck fracture, which is hard to reduce. We compared its reduction quality with manual reduction performed by orthopedic surgeons. It is a pilot study for augmented reality assisted femoral neck fracture surgery. To establish the gold standard of reduction, we invited an orthopedic surgeon to perform virtual reduction registration with reference to the healthy proximal femur. The invited orthopedic surgeon also performed manual reduction by Mimics software to represent the capability of human doctor. Then we trained conditional GAN models on our dataset, which consisted 208 images from 208 different patients. For displaced femoral neck fractures, it is not easy to measure the accurate angles, like Pauwels angle, of the fracture line. However, the fracture lines would be clearer after reduction. We compared the results of manual reduction, conditional GAN models and registration by Pauwels angle, Garden index and satisfied reduction rate. We tried different number of downsampling (α) to optimize the performance of conditional GAN models. There were 208 pre-surgical CT scans from 208 patients included in our study (age 69.755 ± 13.728, including 88 men). The Pauwles angles of conditional GAN model(α = 0) was 38.519°, which was significantly more stable than manual reduction (44.647°, p < 0.001). The Garden indices of conditional GAN model(α = 0) was 176.726°, which was also significantly more stable than manual reduction (163.590°, p = 0.002). The satisfied reduction rate of conditional GAN model(α = 0) was 88.372%, significantly higher than manual reduction (53.488%, p < 0.001). The Pauwels angles, Garden indices and satisfied reduction rate of conditional GAN model(α = 0) showed no difference to registration. Conditional GAN model(α = 0) can achieve better performance in the virtual reduction of femoral neck fracture than orthopedic surgeon.

CT Registration Musculoskeletal Retrospective Clinical Clinical Pilot

MR-Transformer: A Vision Transformer-based Deep Learning Model for Total Knee Replacement Prediction Using MRI.

Zhang C, Chen S, Cigdem O, Rajamohan HR, Cho K, Kijowski R, Deniz CM

•papers•Jul 16 2025

"Just Accepted" papers have undergone full peer review and have been accepted for publication in Radiology: Artificial Intelligence. This article will undergo copyediting, layout, and proof review before it is published in its final version. Please note that during production of the final copyedited article, errors may be discovered which could affect the content. Purpose To develop a transformer-based deep learning model-MR-Transformer-that leverages ImageNet pretraining and three-dimensional (3D) spatial correlations to predict the progression of knee osteoarthritis to TKR using MRI. Materials and Methods This retrospective study included 353 case-control matched pairs of coronal intermediate-weighted turbo spin-echo (COR-IW-TSE) and sagittal intermediate-weighted turbo spin-echo with fat suppression (SAG-IW-TSE-FS) knee MRIs from the Osteoarthritis Initiative (OAI) database, with a follow-up period up to 9 years, and 270 case-control matched pairs of coronal short-tau inversion recovery (COR-STIR) and sagittal proton density fat-saturated (SAG-PD-FAT-SAT) knee MRIs from the Multicenter Osteoarthritis Study (MOST) database, with a follow-up period up to 7 years. Performance of the MR-Transformer to predict the progression of knee osteoarthritis was compared with that of existing state-of-the-art deep learning models (TSE-Net, 3DMeT, and MRNet) using sevenfold nested cross-validation across the four MRI tissue sequences. Results MR-Transformer achieved areas under the receiver operating characteristic curves (AUCs) of 0.88 (95% CI: 0.85, 0.91), 0.88 (95% CI: 0.85, 0.90), 0.86 (95% CI: 0.82, 0.89), and 0.84 (95% CI: 0.81, 0.87) for COR-IW-TSE, SAG-IW-TSE-FS, COR-STIR, and SAG-PD-FAT-SAT, respectively. The model achieved a higher AUC than that of 3DMeT for all MRI sequences (P < .001). The model showed the highest sensitivity of 83% (95% CI: 78, 87%) and specificity of 83% (95% CI: 76, 88%) for the COR-IW-TSE MRI sequence. Conclusion Compared with the existing deep learning models, the MR-Transformer exhibited state-of-the-art performance in predicting the progression of knee osteoarthritis to TKR using MRIs. ©RSNA, 2025.

MRI Classification Musculoskeletal Retrospective Clinical In Silico Academic Lab Benchmark SOTA

An efficient deep learning based approach for automated identification of cervical vertebrae fracture as a clinical support aid.

Singh M, Tripathi U, Patel KK, Mohit K, Pathak S

•papers•Jul 15 2025

Cervical vertebrae fractures pose a significant risk to a patient's health. The accurate diagnosis and prompt treatment need to be provided for effective treatment. Moreover, the automated analysis of the cervical vertebrae fracture is of utmost important, as deep learning models have been widely used and play significant role in identification and classification. In this paper, we propose a novel hybrid transfer learning approach for the identification and classification of fractures in axial CT scan slices of the cervical spine. We utilize the publicly available RSNA (Radiological Society of North America) dataset of annotated cervical vertebrae fractures for our experiments. The CT scan slices undergo preprocessing and analysis to extract features, employing four distinct pre-trained transfer learning models to detect abnormalities in the cervical vertebrae. The top-performing model, Inception-ResNet-v2, is combined with the upsampling component of U-Net to form a hybrid architecture. The hybrid model demonstrates superior performance over traditional deep learning models, achieving an overall accuracy of 98.44% on 2,984 test CT scan slices, which represents a 3.62% improvement over the 95% accuracy of predictions made by radiologists. This study advances clinical decision support systems, equipping medical professionals with a powerful tool for timely intervention and accurate diagnosis of cervical vertebrae fractures, thereby enhancing patient outcomes and healthcare efficiency.

CT Classification Musculoskeletal Methodology In Silico Academic Lab

The safety and accuracy of radiation-free spinal navigation using a short, scoliosis-specific BoneMRI-protocol, compared to CT.

Detecting Fifth Metatarsal Fractures on Radiographs through the Lens of Smartphones: A FIXUS AI Algorithm

OrthoInsight: Rib Fracture Diagnosis and Report Generation Based on Multi-Modal Large Models

Multi-Centre Validation of a Deep Learning Model for Scoliosis Assessment

Diagnostic interchangeability of deep-learning based Synth-STIR images generated from T1 and T2 weighted spine images.

A Deep Learning-Based Ensemble System for Automated Shoulder Fracture Detection in Clinical Radiographs

Clinical Implementation of Sixfold-Accelerated Deep Learning Superresolution Knee MRI in Under 5 Minutes: Arthroscopy-Validated Diagnostic Performance.

Conditional GAN performs better than orthopedic surgeon in virtual reduction of femoral neck fracture.

MR-Transformer: A Vision Transformer-based Deep Learning Model for Total Knee Replacement Prediction Using MRI.

An efficient deep learning based approach for automated identification of cervical vertebrae fracture as a clinical support aid.

Ready to Sharpen Your Edge?