The impact of pre-processing techniques on deep learning breast image segmentation.
Authors
Affiliations (5)
Affiliations (5)
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisbon, Portugal. [email protected].
- Digital Surgery Lab, Breast Cancer Research Program, Champalimaud Foundation, 1400-038, Lisbon, Portugal. [email protected].
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisbon, Portugal.
- Digital Surgery Lab, Breast Cancer Research Program, Champalimaud Foundation, 1400-038, Lisbon, Portugal. [email protected].
- Faculty of Medicine, University of Lisbon, 1649-028, Lisbon, Portugal. [email protected].
Abstract
Breast cancer is one of the most common forms of cancer worldwide, making breast imaging a critical area for developing and evaluating Deep Learning methods. In this study, we investigate how different pre-processing techniques influence model performance in breast image segmentation. Pre-processing is a crucial step in the Deep Learning pipeline that directly impacts model performance, yet studies on its role in medical imaging remain limited. We assess the influence of different pre-processing techniques on a U-Net segmentation model applied to two breast public imaging datasets: CBIS-DDSM and Duke-Breast-Cancer-MRI. We systematically explored commonly used methods, including pixel intensity normalization, spacing harmonization, resizing/padding, and orientation standardization. Two processing pipelines were developed: Domain Non-Specific, integrating standard practices from natural and medical image analysis, and Domain Specific, which preserves anatomical information through careful handling of breast imaging metadata. A detailed comparative analysis of each pre-processing technique was conducted to evaluate its impact on model performance. Despite challenges and limitations associated with dataset size and scope, our findings identify pre-processing strategies tailored for breast imaging that can improve segmentation accuracy and analysis. This study represents an initial step in evaluating pre-processing for medical image analysis, providing a foundation for future work. Our results highlight significant differences in a 3-way ANOVA F-test ([Formula: see text]) for U-Net segmentation outcomes, attributed to different pixel intensity normalization approaches, offering valuable insights for future research.