A Simplified and Data-Driven Lung Ultrasound Approach for Predicting Surfactant Need in Preterm Infants: A Pilot Study Using Machine Learning and Rule-Based Models.
Authors
Affiliations (9)
Affiliations (9)
- Neonatal Unit, Franco-Britannique Hospital, Levallois-Perret, France.
- Centre of Artificial Intelligence in Medicine and Imaging, Stanford University, Stanford, California, USA.
- Institute of Computational and Mathematics Engineering, Stanford University, Stanford, California, USA.
- Oliver Fisher Neonatal Unit, Medway NHS Foundation Trust, Kent, UK.
- Neonatal Unit, University Hospital Wishaw, Scotland, UK.
- Department of Pediatrics, Division of Neonatology, University of California, UC Davis Children's Hospital, Sacramento, California, USA.
- Neonatal Unit, James Cook University Hospital, Middlesbrough, UK.
- Clinical Academic Office, Faculty of Medical Sciences, Newcastle University, Newcastle, UK.
- Department of Physics, Durham University, Durham, UK.
Abstract
Six-region lung ultrasound (LUS) scores show good predictive value for predicting surfactant need in preterm infants but rely on a fixed threshold, which may lead to misclassification near the cut-off and lack data-driven justification for selecting these 6 regions. This study explored whether evaluating individual regions-and combinations-could improve predictive accuracy and utility. Data from preterm infants born at ≤34 weeks and enrolled in the Serial Lung Ultrasound for Surfactant Replacement Therapy (SLURP) cohort study were analyzed to develop predictive models for surfactant administration based on regional LUS scores. Univariate, bivariate, and machine learning analyses were conducted to identify the most informative lung regions. Rule-based, decision tree, and logistic regression models were then developed, compared to the 6-region model, and validated on an external dataset. The training set consisted of 77 patients from the SLURP cohort study. The rule-based, decision tree, and logistic regression models showed the best performance, primarily using 2 lung regions-left lateral and left upper posterior. A refined model that included the right upper anterior (RUA) region further improved performance. On the external test set (n = 42), the rule-based model with RUA achieved the highest accuracy (0.93) and the lowest false negative rate (0.11), outperforming the 6-region model. Adding more regions did not enhance accuracy. A simplified, rule-based model that accounts for the differential predictive value of individual lung regions may enhance the accuracy of LUS-based prediction of surfactant need in preterm infants. It is also more accessible, effective, and time-efficient for clinicians.