Comparison of tree-based methods used in survival data


Yabacı Tak A., Sığırlı D.

STATISTICS IN TRANSITION, vol.23, no.1, pp.21-38, 2022 (SCI-Expanded)

  • Publication Type: Article / Article
  • Volume: 23 Issue: 1
  • Publication Date: 2022
  • Doi Number: 10.2478/stattrans-2022-0002
  • Journal Name: STATISTICS IN TRANSITION
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, International Bibliography of Social Sciences, Central & Eastern European Academic Source (CEEAS), Directory of Open Access Journals
  • Page Numbers: pp.21-38
  • Bezmialem Vakıf University Affiliated: Yes

Abstract

Survival trees and forests are popular non-parametric alternatives to parametric and semiparametric survival models. Conditional inference trees (Ctree) form a non-parametric class of regression trees embedding tree-structured regression models into a well-defined theory of conditional inference procedures. The Ctree is applicable in a varietyof regression-related issues, involving nominal, ordinal, numeric, censored, as well as multivariate response variables and arbitrary measurement scales of covariates. Conditional inference forests (Cforest) consitute a survival forest method which combines a large number of Ctrees. The Cforest provides a unified and flexible framework for ensemble learning in the presence of censoring. The random survival forests (RSF) methodology extends the random forests method enabling the approximation of rich classes of functions while maintaining generalisation errors low. In the present study, the Ctree, Cforest and RSF methods are discussed in detail and the performances of the survival forest methods, namely the Cforest and RSF have been compared with a simulation study. The results of the simulation demonstrate that the RSF method with a log-rank score distinction criteria outperforms the Cforest and the RSF with log-rank distinction criteria. Key words: tree-based methods, conditional inference trees, conditional inference forests, random survival forests.