Appl Clin Inform 2025; 16(04): 1332-1340
DOI: 10.1055/a-2618-4470
Case Report

Refining a Machine Learning Model for Predicting Infant Sepsis: A Multidisciplinary Team Supported by Human-Centered Design Methods

Authors

  • Dean Karavite

    1   Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
  • Lusha Cao

    1   Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
  • Mary C. Harris

    2   Department of Neonatology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
    3   Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States
  • Alex Fidel

    1   Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
  • Lyle Ungar

    4   Department of Computer and Information Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States
  • Gerald Shaeffer

    1   Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
  • Rui Xiao

    5   Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States
  • Patrick Brady

    6   Cincinnati Children's Hospital Medical Center, Department of Pediatrics, Perinatal Institute and James M. Anderson Center for Health Systems Excellence, Cincinnati, Ohio, United States
    7   Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States
  • Heather C. Kaplan

    6   Cincinnati Children's Hospital Medical Center, Department of Pediatrics, Perinatal Institute and James M. Anderson Center for Health Systems Excellence, Cincinnati, Ohio, United States
    7   Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States
  • Robert W. Grundmeier

    1   Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
    3   Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, United States

Funding This study was funded by the Foundation for the National Institutes of Health (fund no.: 1R01LM013526-01A1).
Preview

Abstract

Background

Human-centered design (HCD) methods in machine learning generally focus on workflow, user interfaces, and data visualizations, but there is the potential to apply these methods to inform the model development and testing process.

Objectives

This study aimed to demonstrate the potential of HCD methods to support the design and testing of machine learning models developed for clinical decision-making.

Methods

In preparing for formative user testing of clinician facing representations of a machine learning model for detecting sepsis in neonatal intensive care unit (NICU) patients, we discovered that interactive low fidelity mockups using real patient data revealed potential model anomalies. To further investigate these potential anomalies, we utilized the qualitative analysis of interviews with 31 NICU clinicians concerning their experience with neonatal sepsis. The review process was conducted by a multidisciplinary team with members having expertise in neonatology, informatics, data science, and human computer interaction (HCI). Anomalies identified via the mockups and interview analysis were further analyzed by inspections of patient charts and model features and code.

Results

The HCD-facilitated review revealed anomalies in three categories: (1) feature inclusion and exclusion, (2) feature importance, and (3) model stability over time. Data entry errors in the electronic health record and their impact on model output were also noted. The review resulted in 41 changes to the model.

Conclusion

The discovery of over 41 opportunities to improve our prediction model was a serendipitous by-product of the HCD process. Our results suggest that HCD can be applied not only to model display design and measures of explainability, but to the development and evaluation of the model itself. This case report also demonstrates the need for a multidisciplinary team of clinicians, data scientists, and HCI experts in identifying and addressing issues involving machine learning model performance.

Protection of Human and Animal Subjects

The study was determined to be exempt from human studies by the Children's Hospital of Philadelphia, Institutional Review Board (IRB 21-018777).




Publication History

Received: 30 December 2024

Accepted: 21 May 2025

Article published online:
10 October 2025

© 2025. Thieme. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany