A Synthetic Data-Augmented Deep Learning Framework for Robust Segmentation and Quantification of the Carotid Artery in Ultrasound Images.
Authors
Affiliations (4)
Affiliations (4)
- Department of Ultrasound Medicine, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China.
- Department of Ultrasound Medicine, Haining Central Hospital, Jiaxing, Zhejiang, China.
- School of Mathematical Sciences, Zhejiang University, Hangzhou, Zhejiang, China.
- Department of Ultrasound Medicine, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China. Electronic address: [email protected].
Abstract
To address the severe limitation imposed by the scarcity of annotated data on deep learning-based automated segmentation of the carotid artery in ultrasound images, this study aimed to develop and evaluate a synthetic data generation framework for enhancing segmentation performance. We proposed a Conditional Diffusion model with ControlNet guidance (Carotid Ultrasound Conditional Diffusion Generation Framework, CU-CDGF) to generate realistic carotid ultrasound images from segmentation masks. The framework was evaluated by both image quality metrics (e.g., Fréchet Inception Distance, FID) and its impact on augmenting the training set for a downstream segmentation network. The proposed CU-CDGF framework generated carotid ultrasound images with high visual fidelity. Quantitative metrics indicated favorable distribution alignment with real data (e.g., FID 87.41). More importantly, in a blinded observer study, both junior and senior sonographers frequently mistook synthetic images for real scans, confirming their perceptual realism. When used for data augmentation, the synthetic data significantly improved segmentation performance for the challenging intima-media complex. On the internal test set, the Dice coefficient increased from 0.60 to 0.64 (corrected p = .013); on an independent external test set, it increased from 0.61 to 0.66 (corrected p = .0004). The proposed diffusion-based framework effectively mitigates data scarcity in carotid ultrasound analysis. By enhancing the segmentation of clinically relevant fine structures, it provides a robust tool to advance the reliability and clinical adoption of automated quantitative ultrasound assessment.