Transfer learning-enhanced CNN model for integrative ultrasound and biomarker-based diagnosis of polycystic ovarian disease.
Authors
Affiliations (6)
Affiliations (6)
- BVRIT HYDERABAD College of Engineering for Women, Computer Science and Engineering, Hyderabad, 500090, India. [email protected].
- VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.
- BVRIT HYDERABAD College of Engineering for Women, Computer Science and Engineering, Hyderabad, 500090, India.
- Department of CSE, Vasavi College of Engineering, Ibrahimbagh, Hyderabad, 500031, India.
- Department of Computer Science and Artificial Intelligence, School of Computer Science and Artificial Intelligence, SR University, Warangal, Telangana, 506371, India.
- Department of ECE, KLEF, Guntur, Vaddeswaram, Andhrapradesh, India. [email protected].
Abstract
Polycystic Ovarian Disease (PCOD), also known as Polycystic Ovary Syndrome (PCOS), is a prevalent hormonal and metabolic condition primarily affecting women of reproductive age worldwide. It is typically marked by disrupted ovulation, an increase in circulating androgen hormones, and the presence of multiple small ovarian follicles, which collectively result in menstrual irregularities, infertility challenges, and associated metabolic disturbances. This study presents an automated diagnostic framework for PCOD detection from transvaginal ultrasound images, leveraging an Enhanced [Formula: see text] convolutional neural network architecture. The model incorporates attention mechanisms, batch normalization, and dropout regularization to improve feature learning and generalization. Bayesian Optimization was employed to fine-tune critical hyperparameters, including learning rate, batch size, and dropout rate, ensuring optimal model performance. The proposed system was trained and validated on a curated ovarian ultrasound image dataset, applying data augmentation and SMOTE techniques to address class imbalance. Experimental evaluation demonstrated that the Enhanced [Formula: see text] model achieved a classification accuracy of 94.8%, sensitivity of 93.2%, specificity of 95.5%, precision of 94.0%, and an F1-score of 93.6% on the independent test set. Interpretability was enhanced through Grad-CAM visualization, which effectively localized diagnostically significant regions within the ultrasound images, corroborating clinical findings. These results highlight the potential of the proposed deep learning-based framework to serve as a reliable, scalable, and interpretable decision-support tool for PCOD diagnosis, offering improved diagnostic consistency and reducing operator dependency in clinical workflows.