Adversarial vulnerability and robustness of deep learning models for panoramic dental X-ray segmentation.
Authors
Affiliations (7)
Affiliations (7)
- Department of Information and Computer Science, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia.
- Interdisciplinary Computer Science (InteX) Research Lab, Sylhet, 3100, Bangladesh.
- Department of Software Engineering, College of Engineering and Advanced Computing, Alfaisal University, Riyadh, 11533, Saudi Arabia.
- Department of Electrical Engineering, College of Engineering, King Faisal University, Al-Ahsa, 31982, Saudi Arabia. [email protected].
- Interdisciplinary Research Center for Intelligent Secure Systems, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia.
- Department of Information and Computer Science, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia. [email protected].
- Interdisciplinary Research Center for Intelligent Secure Systems, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia. [email protected].
Abstract
Deep learning-based segmentation has become essential in computer-aided dental diagnosis and treatment planning. However, these models remain highly vulnerable to adversarial perturbations, like small and imperceptible changes in input images, which can drastically alter segmentation outputs and compromise clinical reliability. In this work, we present the first systematic study on adversarial vulnerability and robustness of deep learning models for panoramic dental X-ray segmentation. We curated a dataset of 995 panoramic images by combining 361 expert-annotated radiographs with 634 refined masks from the DENTEX 2023 challenge. Under identical training conditions, initially, we benchmarked 11 unique model variants, including five core architectures (Attention UNet, SegNet, Trans UNet, Vanilla UNet, and UNet++) and their corresponding ablations on training and preprocessing techniques. UNet++ emerged as the most practical backbone (clean IoU [Formula: see text], Dice [Formula: see text]) and subjected to a suite of white-box attacks with FGSM, I-FGSM, PGD, and DeepFool across perturbation ([Formula: see text]). Our results reveal that even minimal perturbations caused large performance drops, such as at [Formula: see text], IoU collapsed to 23.5% (0.851 to 0.649). To mitigate this fragility, we implemented a customized multi-attack adversarial defense strategy to ensure the model's robustness, which preserved a modest clean-accuracy trade-off by increasing 14.9% (IoU 0.649 to 0.798) at [Formula: see text] and 12.5% at [Formula: see text]. Our qualitative and quantitative analyses demonstrate that the defended model produces more stable and anatomically consistent masks under attack and set the benchmark of adversarial robustness in dental image segmentation as an effective defense strategy for safety-critical clinical deployment.