Latest Papers on Radiology AI. Tags: Segmentation

Implicit Neural Representations of Intramyocardial Motion and Strain

Andrew Bell, Yan Kit Choi, Steffen E Peterson, Andrew King, Muhummad Sohaib Nazir, Alistair A Young

•preprint•Sep 10 2025

Automatic quantification of intramyocardial motion and strain from tagging MRI remains an important but challenging task. We propose a method using implicit neural representations (INRs), conditioned on learned latent codes, to predict continuous left ventricular (LV) displacement -- without requiring inference-time optimisation. Evaluated on 452 UK Biobank test cases, our method achieved the best tracking accuracy (2.14 mm RMSE) and the lowest combined error in global circumferential (2.86%) and radial (6.42%) strain compared to three deep learning baselines. In addition, our method is $\sim$380$\times$ faster than the most accurate baseline. These results highlight the suitability of INR-based models for accurate and scalable analysis of myocardial strain in large CMR datasets.

MRI Segmentation Cardiac Retrospective Clinical In Silico Academic Lab Benchmark SOTA

Implicit Neural Representations of Intramyocardial Motion and Strain

Andrew Bell, Yan Kit Choi, Steffen E Petersen, Andrew King, Muhummad Sohaib Nazir, Alistair A Young

•preprint•Sep 10 2025

MRI Segmentation Cardiac Retrospective Clinical In Silico Academic Lab

Artificial Intelligence in Breast Cancer Care: Transforming Preoperative Planning and Patient Education with 3D Reconstruction

Mustafa Khanbhai, Giulia Di Nardo, Jun Ma, Vivienne Freitas, Caterina Masino, Ali Dolatabadi, Zhaoxun "Lorenz" Liu, Wey Leong, Wagner H. Souza, Amin Madani

•preprint•Sep 10 2025

Effective preoperative planning requires accurate algorithms for segmenting anatomical structures across diverse datasets, but traditional models struggle with generalization. This study presents a novel machine learning methodology to improve algorithm generalization for 3D anatomical reconstruction beyond breast cancer applications. We processed 120 retrospective breast MRIs (January 2018-June 2023) through three phases: anonymization and manual segmentation of T1-weighted and dynamic contrast-enhanced sequences; co-registration and segmentation of whole breast, fibroglandular tissue, and tumors; and 3D visualization using ITK-SNAP. A human-in-the-loop approach refined segmentations using U-Mamba, designed to generalize across imaging scenarios. Dice similarity coefficient assessed overlap between automated segmentation and ground truth. Clinical relevance was evaluated through clinician and patient interviews. U-Mamba showed strong performance with DSC values of 0.97 ($\pm$0.013) for whole organs, 0.96 ($\pm$0.024) for fibroglandular tissue, and 0.82 ($\pm$0.12) for tumors on T1-weighted images. The model generated accurate 3D reconstructions enabling visualization of complex anatomical features. Clinician interviews indicated improved planning, intraoperative navigation, and decision support. Integration of 3D visualization enhanced patient education, communication, and understanding. This human-in-the-loop machine learning approach successfully generalizes algorithms for 3D reconstruction and anatomical segmentation across patient datasets, offering enhanced visualization for clinicians, improved preoperative planning, and more effective patient education, facilitating shared decision-making and empowering informed patient choices across medical applications.

MRI Segmentation Breast Retrospective Clinical In Silico Academic Lab Reproducibility

Multimodal Contrastive Pretraining of CBCT and IOS for Enhanced Tooth Segmentation

Moo Hyun Son, Juyoung Bae, Zelin Qiu, Jiale Peng, Kai Xin Li, Yifan Lin, Hao Chen

•preprint•Sep 9 2025

Digital dentistry represents a transformative shift in modern dental practice. The foundational step in this transformation is the accurate digital representation of the patient's dentition, which is obtained from segmented Cone-Beam Computed Tomography (CBCT) and Intraoral Scans (IOS). Despite the growing interest in digital dental technologies, existing segmentation methodologies frequently lack rigorous validation and demonstrate limited performance and clinical applicability. To the best of our knowledge, this is the first work to introduce a multimodal pretraining framework for tooth segmentation. We present ToothMCL, a Tooth Multimodal Contrastive Learning for pretraining that integrates volumetric (CBCT) and surface-based (IOS) modalities. By capturing modality-invariant representations through multimodal contrastive learning, our approach effectively models fine-grained anatomical features, enabling precise multi-class segmentation and accurate identification of F\'ed\'eration Dentaire Internationale (FDI) tooth numbering. Along with the framework, we curated CBCT-IOS3.8K, the largest paired CBCT and IOS dataset to date, comprising 3,867 patients. We then evaluated ToothMCL on a comprehensive collection of independent datasets, representing the largest and most diverse evaluation to date. Our method achieves state-of-the-art performance in both internal and external testing, with an increase of 12\% for CBCT segmentation and 8\% for IOS segmentation in the Dice Similarity Coefficient (DSC). Furthermore, ToothMCL consistently surpasses existing approaches in tooth groups and demonstrates robust generalizability across varying imaging conditions and clinical scenarios.

CT Segmentation Methodology In Silico Open Dataset Benchmark SOTA

[99mTc]Tc-Sestamibi/[99mTc]NaTcO4 Subtraction SPECT of Parathyroid Glands Using Analysis of Principal Components.

Maříková I, Balogová S, Zogala D, Ptáčník V, Raška I, Libánský P, Talbot JN, Šámal M, Trnka J

•papers•Sep 9 2025

The aim of the study was to validate a new method for semiautomatic subtraction of [99mTc]Tc-sestamibi and [99mTc]NaTcO4 SPECT 3-dimensional datasets using principal component analysis (PCA) against the results of parathyroid surgery and to compare its performance with an interactive method for visual comparison of images. We also sought to identify factors that affect the accuracy of lesion detection using the two methods. Methods: Scintigraphic data from [99mTc]Tc-sestamibi and [99mTc]NaTcO4 SPECT were analyzed using semiautomatic subtraction of the 2 registered datasets based on PCA applied to the region of interest including the thyroid and an interactive method for visual comparison of the 2 image datasets. The findings of both methods were compared with those of surgery. Agreement with surgery was assessed with respect to the lesion quadrant, affected side of the neck, and the patient positivity regardless of location. Results: The results of parathyroid surgery and histology were available for 52 patients who underwent [99mTc]Tc-sestamibi/[99mTc]NaTcO4 SPECT. Semiautomatic image subtraction identified the correct lesion quadrant in 46 patients (88%), the correct side of the neck in 51 patients (98%), and true pathologic lesions regardless of location in 51 patients (98%). Visual interactive analysis identified the correct lesion quadrant in 44 patients (85%), correct side of the neck in 49 patients (94%), and true pathologic lesions regardless of location in 50 patients (96%). There was no significant difference between the results of the 2 methods (P > 0.05). The factors supporting lesion detection were accurate positioning of the patient on the camera table, which facilitated subsequent image registration of the neck, and, after excluding ectopic parathyroid glands, focusing detection on the thyroid ROI. Conclusion: The results of semiautomatic subtraction of [99mTc]Tc-sestamibi/[99mTc]NaTcO4 SPECT using PCA had good agreement with the findings from surgery as well as the visual interactive method, comparable to the high diagnostic accuracy of [99mTc]Tc-sestamibi/[123I]NaI subtraction scintigraphy and [18F]fluorocholine PET/CT reported in the literature. The main advantages of semiautomatic subtraction are minimum user interaction and automatic adjustment of the subtraction weight. Principal component images may serve as optimized input objects, potentially useful in machine-learning algorithms aimed at fully automated detection of hyperfunctioning parathyroid glands.

SPECT Segmentation Retrospective Clinical In Silico Academic Lab Benchmark SOTA

Barlow-Swin: Toward a novel siamese-based segmentation architecture using Swin-Transformers

Morteza Kiani Haftlang, Mohammadhossein Malmir, Foroutan Parand, Umberto Michelucci, Safouane El Ghazouali

•preprint•Sep 8 2025

Medical image segmentation is a critical task in clinical workflows, particularly for the detection and delineation of pathological regions. While convolutional architectures like U-Net have become standard for such tasks, their limited receptive field restricts global context modeling. Recent efforts integrating transformers have addressed this, but often result in deep, computationally expensive models unsuitable for real-time use. In this work, we present a novel end-to-end lightweight architecture designed specifically for real-time binary medical image segmentation. Our model combines a Swin Transformer-like encoder with a U-Net-like decoder, connected via skip pathways to preserve spatial detail while capturing contextual information. Unlike existing designs such as Swin Transformer or U-Net, our architecture is significantly shallower and competitively efficient. To improve the encoder's ability to learn meaningful features without relying on large amounts of labeled data, we first train it using Barlow Twins, a self-supervised learning method that helps the model focus on important patterns by reducing unnecessary repetition in the learned features. After this pretraining, we fine-tune the entire model for our specific task. Experiments on benchmark binary segmentation tasks demonstrate that our model achieves competitive accuracy with substantially reduced parameter count and faster inference, positioning it as a practical alternative for deployment in real-time and resource-limited clinical environments. The code for our method is available at Github repository: https://github.com/mkianih/Barlow-Swin.

Mixed Modality Segmentation Methodology In Silico Academic Lab Open Code

Leveraging Information Divergence for Robust Semi-Supervised Fetal Ultrasound Image Segmentation

Fangyijie Wang, Guénolé Silvestre, Kathleen M. Curran

•preprint•Sep 8 2025

Maternal-fetal Ultrasound is the primary modality for monitoring fetal development, yet automated segmentation remains challenging due to the scarcity of high-quality annotations. To address this limitation, we propose a semi-supervised learning framework that leverages information divergence for robust fetal ultrasound segmentation. Our method employs a lightweight convolutional network (1.47M parameters) and a Transformer-based network, trained jointly with labelled data through standard supervision and with unlabelled data via cross-supervision. To encourage consistent and confident predictions, we introduce an information divergence loss that combines per-pixel Kullback-Leibler divergence and Mutual Information Gap, effectively reducing prediction disagreement between the two models. In addition, we apply mixup on unlabelled samples to further enhance robustness. Experiments on two fetal ultrasound datasets demonstrate that our approach consistently outperforms seven state-of-the-art semi-supervised methods. When only 5% of training data is labelled, our framework improves the Dice score by 2.39%, reduces the 95% Hausdorff distance by 14.90, and decreases the Average Surface Distance by 4.18. These results highlight the effectiveness of leveraging information divergence for annotation-efficient and robust medical image segmentation. Our code is publicly available on GitHub.

Ultrasound Segmentation Abdominal Methodology In Silico Academic Lab Open Code

GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning

Evgeny Alves Limarenko, Anastasiia Alexandrovna Studenikina

•preprint•Sep 8 2025

In multi-task learning (MTL), gradient conflict poses a significant challenge. Effective methods for addressing this problem, including PCGrad, CAGrad, and GradNorm, in their original implementations are computationally demanding, which significantly limits their application in modern large models and transformers. We propose Gradient Conductor (GCond), a method that builds upon PCGrad principles by combining them with gradient accumulation and an adaptive arbitration mechanism. We evaluated GCond on self-supervised learning tasks using MobileNetV3-Small and ConvNeXt architectures on the ImageNet 1K dataset and a combined head and neck CT scan dataset, comparing the proposed method against baseline linear combinations and state-of-the-art gradient conflict resolution methods. The stochastic mode of GCond achieved a two-fold computational speedup while maintaining optimization quality, and demonstrated superior performance across all evaluated metrics, achieving lower L1 and SSIM losses compared to other methods on both datasets. GCond exhibited high scalability, being successfully applied to both compact models (MobileNetV3-Small) and large architectures (ConvNeXt-tiny and ConvNeXt-Base). It also showed compatibility with modern optimizers such as AdamW and Lion/LARS. Therefore, GCond offers a scalable and efficient solution to the problem of gradient conflicts in multi-task learning.

CT Segmentation Methodology In Silico

XBusNet: Text-Guided Breast Ultrasound Segmentation via Multimodal Vision-Language Learning

Raja Mallina, Bryar Shareef

•preprint•Sep 8 2025

Background: Precise breast ultrasound (BUS) segmentation supports reliable measurement, quantitative analysis, and downstream classification, yet remains difficult for small or low-contrast lesions with fuzzy margins and speckle noise. Text prompts can add clinical context, but directly applying weakly localized text-image cues (e.g., CAM/CLIP-derived signals) tends to produce coarse, blob-like responses that smear boundaries unless additional mechanisms recover fine edges. Methods: We propose XBusNet, a novel dual-prompt, dual-branch multimodal model that combines image features with clinically grounded text. A global pathway based on a CLIP Vision Transformer encodes whole-image semantics conditioned on lesion size and location, while a local U-Net pathway emphasizes precise boundaries and is modulated by prompts that describe shape, margin, and Breast Imaging Reporting and Data System (BI-RADS) terms. Prompts are assembled automatically from structured metadata, requiring no manual clicks. We evaluate on the Breast Lesions USG (BLU) dataset using five-fold cross-validation. Primary metrics are Dice and Intersection over Union (IoU); we also conduct size-stratified analyses and ablations to assess the roles of the global and local paths and the text-driven modulation. Results: XBusNet achieves state-of-the-art performance on BLU, with mean Dice of 0.8765 and IoU of 0.8149, outperforming six strong baselines. Small lesions show the largest gains, with fewer missed regions and fewer spurious activations. Ablation studies show complementary contributions of global context, local boundary modeling, and prompt-based modulation. Conclusions: A dual-prompt, dual-branch multimodal design that merges global semantics with local precision yields accurate BUS segmentation masks and improves robustness for small, low-contrast lesions.

Ultrasound Segmentation Breast Methodology In Silico Academic Lab Benchmark SOTA

Two step approach for detecting and segmenting the second mesiobuccal canal of maxillary first molars on cone beam computed tomography (CBCT) images via artificial intelligence.

Mansour S, Anter E, Mohamed AK, Dahaba MM, Mousa A

•papers•Sep 8 2025

The purpose of this study was to assess the accuracy of a customized deep learning model based on CNN and U-Net for detecting and segmenting the second mesiobuccal canal (MB2) of maxillary first molar teeth on cone beam computed tomography (CBCT) scans. CBCT scans of 37 patients were imported into 3D slicer software to crop and segment the canals of the mesiobuccal (MB) root of the maxillary first molar. The annotated data were divided into two groups: 80% for training and validation and 20% for testing. The data were used to train the AI model in 2 separate steps: a classification model based on a customized CNN and a segmentation model based on U-Net. A confusion matrix and receiver-operating characteristic (ROC) analysis were used in the statistical evaluation of the results of the classification model, whereas the Dice-coefficient (DCE) was used to express the segmentation accuracy. F1 score, testing accuracy, recall and precision values were 0.93, 0.87, 1.0 and 0.87 respectively, for the cropped images of MB root of maxillary 1st molar teeth in the testing group. The testing loss was 0.4, and the area under the curve (AUC) value was 0.57. The segmentation accuracy results were satisfactory, where the DCE of training was 0.85 and DCE of testing was 0.79. MB2 in the maxillary first molar can be precisely detected and segmented via the developed AI algorithm in CBCT images. Current Controlled Trial Number NCT05340140. April 22, 2022.

CT Segmentation Retrospective Clinical In Silico Academic Lab

Filter Papers

Tags

Implicit Neural Representations of Intramyocardial Motion and Strain

Implicit Neural Representations of Intramyocardial Motion and Strain

Artificial Intelligence in Breast Cancer Care: Transforming Preoperative Planning and Patient Education with 3D Reconstruction

Multimodal Contrastive Pretraining of CBCT and IOS for Enhanced Tooth Segmentation

[<sup>99m</sup>Tc]Tc-Sestamibi/[<sup>99m</sup>Tc]NaTcO<sub>4</sub> Subtraction SPECT of Parathyroid Glands Using Analysis of Principal Components.

Barlow-Swin: Toward a novel siamese-based segmentation architecture using Swin-Transformers

Leveraging Information Divergence for Robust Semi-Supervised Fetal Ultrasound Image Segmentation

GCond: Gradient Conflict Resolution via Accumulation-based Stabilization for Large-Scale Multi-Task Learning

XBusNet: Text-Guided Breast Ultrasound Segmentation via Multimodal Vision-Language Learning

Two step approach for detecting and segmenting the second mesiobuccal canal of maxillary first molars on cone beam computed tomography (CBCT) images via artificial intelligence.

Ready to Sharpen Your Edge?