Methods Inf Med 2006; 45(01): 37-43
DOI: 10.1055/s-0038-1634034
Original Article
Schattauer GmbH

Scoring and Staging Systems Using Cox Linear Regression Modeling and Recursive Partitioning

J. W. Lee
1   Department of Statistics, Korea University, Seoul, Korea
,
S. H. Um
2   Department of Internal Medicine, Korea University School of Medicine, Seoul, Korea
,
J. B. Lee
1   Department of Statistics, Korea University, Seoul, Korea
,
J. Mun
3   Department of Statistics, University of Wisconsin-Madison, USA
,
H. Cho
4   Department of Health Evaluation Sciences, University of Virginia School of Medicine, Charlottesville, Virginia, USA
› Author Affiliations
Further Information

Publication History

Received: 29 July 2004

accepted: 16 May 2005

Publication Date:
06 February 2018 (online)

Summary

Objectives: Scoring and staging systems are used to determine the order and class of data according to predictors. Systems used for medical data, such as the Child-Turcotte-Pugh scoring and staging systems for ordering and classifying patients with liver disease, are often derived strictly from physicians’ experience and intuition. We construct objective and data-based scoring/staging systems using statistical methods.

Methods: We consider Cox linear regression modeling and recursive partitioning techniques for censored survival data. In particular, to obtain a target number of stages we propose cross-validation and amalgamation algorithms. We also propose an algorithm for constructing scoring and staging systems by integrating local Cox linear regression models into recursive partitioning, so that we can retain the merits of both methods such as superior predictive accuracy, ease of use, and detection of interactions between predictors. The staging system construction algorithms are compared by cross-validation evaluation of real data.

Results: The data-based cross-validation comparison shows that Cox linear regression modeling is somewhat better than recursive partitioning when there are only continuous predictors, while recursive partitioning is better when there are significant categorical predictors. The proposed local Cox linear recursive partitioning has better predictive accuracy than Cox linear modeling and simple recursive partitioning.

Conclusions: This study indicates that integrating local linear modeling into recursive partitioning can significantly improve prediction accuracy in constructing scoring and staging systems.

 
  • References

  • 1 Lucey MR, Brown KA, Everson GT, Fung JJ, Gish R, Keefe EB, Kneteman NM, Lake JR, Martin P, Rakela J, Shiffman ML, So S, Wiesner RH. Minimal criteria for placement of adults on the liver transplant waiting list: a report of a national conference organized by the American Society of Transplant Physicians and the American Association for the Study of Liver Diseases. Transplantation 2001; 15: 956-62.
  • 2 Pugh RNH, Murray-Lyon IM, Dawson JL, Pietroni MC, Williams R. Transection of the oesophagus for bleeding oesophageal varices. British Journal of Surgery 1973; 60: 646-9.
  • 3 Kamath PS, Wiesner RH, Malinchoc M, Kremers W, Therneau TM, Kosberg CL, D’Amico G, Dickson ER, Kim WR. A model to predict survival in patients with endstage liver disease. Hepatology 2001; 33: 464-70.
  • 4 Dickson ER, Gramsch PM, Fleming TR, Fisher LD, Langworthy A. Prognosis in primary biliary cirrhosis: model for decision making. Hepatholgy 1989; 10: 1-7.
  • 5 Ricci P, Therneau TM, Malinchoc M, Benson JT, Petz JL, Klintmalm GB, Crippin JS, Wiesner RH, Steers JL, Rakela J, Starzl TE, Dickson ER. A prognostic model for the outcome of liver transplantation in patients with cholestatic liver disease. Hepatology 1997; 25: 672-7.
  • 6 Malinchoc M, Kamath PS, Gordon FD, Peine CJ, Rank J, Borg PCJ. A model to predict poor survival in patients undergoing transjugular intrahepatic portosystemic shunts. Hepathology 2000; 31: 864-71.
  • 7 D’Antico G, Franchis RD. and a Cooperative Study Group. Upper digestive bleeding in cirrhosis. posttherapeutic outcome and prognostic indicators. Hepatology 2003; 38: 599-612.
  • 8 Morgan JN, Sonquist JA. Problems in the analysis of survey data and a proposal. Journal of the American Statistical Association 1963; 58: 415-34.
  • 9 Kass GV. Significance testing in automatic interaction detection (a.i.d.). Applied Statistics 1975; 24: 178-89.
  • 10 Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and Regression Trees. New York: Chapman and Hall; 1984
  • 11 Ciampi A. Generalized regression trees. Computational Statistics and Data Analysis 1991; 26: 239-56.
  • 12 Loh WY, Shih YS. Split selection methods for classification trees. Statistica Sinica 1997; 7: 815-40.
  • 13 Sauerbrei W, Madjar H, Prompeler HJ. Differentiation of benign and malignant breast tumors by logistic regression and a classification tree using Doppler flow signals. Methods Inf Med 1998; 37: 226-34.
  • 14 Clark DE, El-Taha M. Generation of correlated logisticnormal random variates for medical decision trees. Methods Inf Med 1998; 37: 235-8.
  • 15 Kim H, Loh WY. Classification trees with unbiased multiway splits. Journal of the American Statistical Association 2001; 96: 589-604.
  • 16 McQuatt A, Sleeman D, Andrews PJD, Corruble V, Jones PA. Discussing anomalous situations using decision trees: A head injury case study. Methods Inf M 2001; 40: 373-9.
  • 17 Dusseldorp E, Meulman JJ. Prediction in medicine by integrating regression trees into regression analysis with optimal scaling. Methods Inf Med 2001; 40: 403-9.
  • 18 Ciampi A, Couturier A, Li S. Prediction trees with soft nodes for binary outcomes. Statistics in Medicine 2002; 21: 1145-65.
  • 19 Loh WY. Regression trees with unbiased variable selection and interaction detection. Statistica Sinica 2002; 12: 361-86.
  • 20 Schwarzer G, Nagata T, Mattern D, Schmelzeisen R, Schumacher M. Comparison of fuzzy inference, logistic regression, and classification trees (CART) prediction of cervical lymph node metastasis in carcinoma of the tongue. Methods Inf Med 2003; 42: 572-7.
  • 21 Fu CY. Combining loglinear model with classification and regression tree (cart): an application to birth data. Computational Statistics and Data Analysis 2004; 45: 865-74.
  • 22 Segal MR. Tree-structured methods for longitudinal data. Journal of the American Statistical Association 1992; 87: 407-18.
  • 23 Lauren B. Generalized regression trees applied to longitudinal nutritional survey data. Studies in Classification, Data Analysis, and Knowledge Organization 1997; 9: 467-74.
  • 24 Zhang H. Classification trees for multiple binary responses. Journal of the American Statistical Association 1998; 93: 180-93.
  • 25 Gordon L, Olshen RA. Tree-structured survival analysis. Cancer treatment Reports 1985; 69: 1065-9.
  • 26 Ciampi A, Thiffault J, Nakache J, Asselain B. Stratification by stepwise regression, correspondence analysis and recursive partition: A comparison of three methods of analysis for survival data with covariates. Computational Statistics and Data Analysis. 1986; 4: 185-204.
  • 27 Segal MR. Regression trees for censored data. Biometrics 1988; 44: 35-48.
  • 28 Davis R, Anderson J. Exponential survival trees. Statistics in Medicine 1989; 8: 947-62.
  • 29 LeBlanc M, Crowley J. Relative risk trees for censored survival data. Biometrics 1992; 48: 411-25.
  • 30 LeBlanc M, Crowley J. Survival trees by goodness- of-split. Journal of the American Statistical Association 1993; 88: 457-67.
  • 31 Ahn H, Loh WY. Tree-structured proportional hazards regression modeling. Biometrics 1994; 50: 471-85.
  • 32 Schlittgen R. Regression trees for survival data: an approach to select discontinuous split points by rank statistics. Biometrical Journal 1999; 41: 943-54.
  • 33 Xu R, Adak S. Survival analysis with time-varying relative risks: A tree-based approach. Methods Inf Med 2001; 40: 141-7.
  • 34 Keles S, Segal MR. Residual-based tree-structured survival analysis. Statistics in Medicine 2002; 21: 313-26.
  • 35 Radespiel-Troger M, Rabenstein T, Schneider HT, Lauren B. Comparison of tree-based methods for prognostic stratification of survival data. Artificial Intelligence in Medicine. 2003; 28: 323-41.
  • 36 Cox DR. Regression models and life tables (with discussion). Journal of the Royal Statistical Society B 1972; 34: 187-220.
  • 37 Therneau TM, Atkinson EJ. An introduction to recursive partitioning using the RPART routines. Technical Report 61, Mayo Foundation. 1997
  • 38 Breiman L. Statistical modeling: the two cultures. Statistical Science 2001; 16: 199-231.
  • 39 Ciampi A, Hogg S, McKinney S, Thiffault J. A computer program for recursive partition and amalgamation for censored survival data. Computer Methods and Programs in Biomedicine 1988; 26: 239-56.
  • 40 Cho H. Tree-structured regression model for censored data. PhD thesis, University of Wisconsin- Madison. 2002