Expert-guided StyleGAN2 image generation elevates AI diagnostic accuracy for maxillary sinus lesions.
Authors
Affiliations (7)
Affiliations (7)
- Hospital of Stomatology, Guanghua School of Stomatology, Sun Yat-sen University and Guangdong Provincial Key Laboratory of Stomatology, Guangzhou, Guangdong, China.
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong, China.
- School of Mechanical and Automation Engineering, Wuyi University, Jiangmen, Guangdong, China.
- School of Information Technology, Guangdong Industry Polytechnic University, Foshan, Guangdong, China.
- Hospital of Stomatology, Guanghua School of Stomatology, Sun Yat-sen University and Guangdong Provincial Key Laboratory of Stomatology, Guangzhou, Guangdong, China. [email protected].
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong, China. [email protected].
- Hospital of Stomatology, Guanghua School of Stomatology, Sun Yat-sen University and Guangdong Provincial Key Laboratory of Stomatology, Guangzhou, Guangdong, China. [email protected].
Abstract
The progress of artificial intelligence (AI) research in dental medicine is hindered by data acquisition challenges and imbalanced distributions. These problems are especially apparent when planning to develop AI-based diagnostic or analytic tools for various lesions, such as maxillary sinus lesions (MSL) including mucosal thickening and polypoid lesions. Traditional unsupervised generative models struggle to simultaneously control the image realism, diversity, and lesion-type specificity. This study establishes an expert-guided framework to overcome these limitations to elevate AI-based diagnostic accuracy. A StyleGAN2 framework was developed for generating clinically relevant MSL images (such as mucosal thickening and polypoid lesion) under expert control. The generated images were then integrated into training datasets to evaluate their effect on ResNet50's diagnostic performance. Here we show: 1) Both lesion subtypes achieve satisfactory fidelity metrics, with structural similarity indices (SSIM > 0.996) and maximum mean discrepancy values (MMD < 0.032), and clinical validation scores close to those of real images; 2) Integrating baseline datasets with synthetic images significantly enhances diagnostic accuracy for both internal and external test sets, particularly improving area under the precision-recall curve (AUPRC) by approximately 8% and 14% for mucosal thickening and polypoid lesions in the internal test set, respectively. The StyleGAN2-based image generation tool effectively addressed data scarcity and imbalance through high-quality MSL image synthesis, consequently boosting diagnostic model performance. This work not only facilitates AI-assisted preoperative assessment for maxillary sinus lift procedures but also establishes a methodological framework for overcoming data limitations in medical image analysis.