Methods Inf Med 2007; 46(03): 352-359
DOI: 10.1160/ME0368
paper
Schattauer GmbH

Modeling Length of Stay as an Optimized Two-class Prediction Problem

M. Verduijn
1   Department of Medical Informatics, Academic Medical Center (AMC), Amsterdam, The Netherlands
4   Department of Biomedical Engineering, University of Technology, Eindhoven, The Netherlands
,
N. Peek
1   Department of Medical Informatics, Academic Medical Center (AMC), Amsterdam, The Netherlands
,
F. Voorbraak
1   Department of Medical Informatics, Academic Medical Center (AMC), Amsterdam, The Netherlands
,
E. de Jonge
2   Department of Intensive Care Medicine, AMC, Amsterdam, The Netherlands
,
B. A. J. M. de Mol
3   Department of Cardiothoracic Surgery, AMC, Amsterdam, The Netherlands
4   Department of Biomedical Engineering, University of Technology, Eindhoven, The Netherlands
› Institutsangaben
Weitere Informationen

Publikationsverlauf

Publikationsdatum:
20. Januar 2018 (online)

Preview

Summary

Objectives: To develop a predictive model for the outcome length of stay at the Intensive Care Unit (ICU LOS), including the choice of an optimal dichotomization threshold for this outcome. Reduction of prediction problems of this type of outcome to a two-class problem is a common strategy to identify high-risk patients.

Methods: Threshold selection and model development are performed simultaneously. From the range of possible threshold values, the value is chosen for which the corresponding predictive model has maximal precision based on the data. To compare the precision of models for different dichotomizations of the outcome, the MALOR performance statistic is introduced. This statistic is insensitive to the prevalence of positive cases in a two-class prediction problem.

Results: The procedure is applied to data from cardiac surgery patients to dichotomize the outcome ICU LOS. The class probabilitytree method is used to develop predictive models. Within our data, the best model precision is found at the threshold of seven days.

Conclusions: The presented method extends existing procedures for predictive modeling with optimization of the outcome definition for predictive purposes. The method can be applied to all prediction problems where the outcome variable needs to be dichotomized, and is insensitive to changes in the prevalence of positive cases with different dichotomization thresholds.