Cloud-to-Edge Deployment of Optimized nnU-Net for Ischemic Stroke Lesion Segmentation on Resource-Constrained Embedded Devices.
Authors
Affiliations (1)
Affiliations (1)
- Escuela Técnica Superior de Ingeniería Industrial, Campus Muralla del Mar, Universidad Politécnica de Cartagena, European University of Technology EUT+, C/Doctor Fleming, s/n, 30202 Cartagena, Spain.
Abstract
Ischemic stroke remains a leading cause of global mortality and long-term neurological disability, where the "Time is Brain" paradigm dictates that rapid and accurate lesion assessment is fundamental for effective clinical intervention. While the nnU-Net v2 framework has established a new state of the art in medical image segmentation, its high computational demands and reliance on data-center-grade GPUs hinder its translation into real-time, point-of-care clinical workflows. This study presents a technical feasibility analysis of a Cloud-to-Edge optimization pipeline designed to transfer a 3D nnU-Net v2 model from a high-performance cloud environment to a resource-constrained embedded device. Experimental results showed that edge deployment was associated with a reduction in overlap-based segmentation metrics compared with the cloud reference, with Dice decreasing from approximately 0.78 to 0.67. However, TensorRT FP32 and FP16 inference produced nearly identical mean segmentation metrics, suggesting that reduced-precision inference did not introduce additional measurable degradation under the evaluated conditions. The optimized FP16 configuration achieved a processing time of 10.2 s per 3D volume, representing a 33% reduction compared with embedded FP32 inference, while operating within a low-power envelope of approximately 10-13 W. These findings support the preliminary technical feasibility of executing advanced 3D volumetric segmentation models on low-power edge hardware. Nevertheless, the evaluation was limited to an internal 25-case test subset and did not include external validation, prospective clinical assessment, or reader studies. Therefore, the proposed system should be interpreted as a preliminary deployment framework rather than a clinically validated tool for autonomous stroke imaging.