Back to all papers

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models.

Authors

Calle P,Bates A,Reynolds JC,Liu Y,Cui H,Ly S,Wang C,Zhang Q,de Armendi AJ,Shettar SS,Fung KM,Tang Q,Pan C

Affiliations (7)

  • School of Computer Science, University of Oklahoma, Norman, 73019, OK, USA.
  • Stephenson School of Biomedical Engineer, University of Oklahoma, Norman, 73019, OK, USA.
  • Department of Anesthesiology, University of Oklahoma Health Sciences Center, Oklahoma City, 73104, OK, USA.
  • Department of Pathology, University of Oklahoma Health Sciences Center, Oklahoma City, 73104, OK, USA.
  • Department of Pathology, University of Oklahoma Health Sciences Center, Oklahoma City, 73104, OK, USA; Stephenson Cancer Center, University of Oklahoma Health Sciences Center, Oklahoma City, 73104, OK, USA.
  • Stephenson School of Biomedical Engineer, University of Oklahoma, Norman, 73019, OK, USA; Stephenson Cancer Center, University of Oklahoma Health Sciences Center, Oklahoma City, 73104, OK, USA. Electronic address: [email protected].
  • School of Computer Science, University of Oklahoma, Norman, 73019, OK, USA; Stephenson School of Biomedical Engineer, University of Oklahoma, Norman, 73019, OK, USA. Electronic address: [email protected].

Abstract

The variability and biases in the real-world performance benchmarking of deep learning models for medical imaging compromise their trustworthiness for real-world deployment. The common approach of holding out a single fixed test set fails to quantify the variance in the estimation of test performance metrics. This study introduces NACHOS (Nested and Automated Cross-validation and Hyperparameter Optimization using Supercomputing) to reduce and quantify the variance of test performance metrics of deep learning models. NACHOS integrates Nested Cross-Validation (NCV) and Automated Hyperparameter Optimization (AHPO) within a parallelized high-performance computing (HPC) framework. NACHOS was demonstrated on a chest X-ray repository and an Optical Coherence Tomography (OCT) dataset under multiple data partitioning schemes. Beyond performance estimation, DACHOS (Deployment with Automated Cross-validation and Hyperparameter Optimization using Supercomputing) is introduced to leverage AHPO and cross-validation to build the final model on the full dataset, improving expected deployment performance. The findings underscore the importance of NCV in quantifying and reducing estimation variance, AHPO in optimizing hyperparameters consistently across test folds, and HPC in ensuring computational feasibility. By integrating these methodologies, NACHOS and DACHOS provide a scalable, reproducible, and trustworthy framework for DL model evaluation and deployment in medical imaging. To maximize public availability, the full open-source codebase is provided at https://github.com/thepanlab/NACHOS.

Topics

Journal Article

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.