Probabilistic survival analysis methods
using simulation and cancer data
Yo.Xu, C.P.Tsokos
The object of the present study is to
probabilistically evaluate commonly used methods to perform survival analysis of
medical patients. Our study includes evaluation of parametric, semi-parametric
and nonparametric analysis of probability survival models. We will evaluate the
popular Kaplan-Meier (KM), Cox Proportional Hazard (Cox PH), and Kernel density
(KD) models using both
Wikipedia defines
survival analysis as a branch of statistics which deals with
death in biological organisms and failure in mechanical systems. Scientists
have developed and used many different probabilistic survival analysis methods
including parametric, nonparametric and semi-parametric models. In the present
study we will compare all commonly used methods and propose which ones give the
best probabilistic survival results [1-23].
The first part of
our study is based on simulating data from a well defined probability failure
distribution by identifying the sample size so that the maximum likelihood
estimates converge to the assumed parametric values in
The second part of
our study uses actual survival time of breast cancer data to compare the above
mentioned survival models, in addition to the Cox Proportional (Cox PH)
survival hazard function.
Upon completing the
evaluation, we will propose a ranking of the analytical methods evaluated for
performing survival analysis. The breast cancer data that we used were given by
N.A.Ibrahim, whose results and analysis were
published in [1, 14]. Some other relevant references that we use in the present
study are [2, 11, 12, 13, 15, 18, 19].
In conclusion we
note. The present study consists of three parts in which the effectiveness of
three survival analysis models, namely, KM, Cox PH and KD, is compared.
Initially, using
The second part
consists of using actual uncensored breast cancer data. Performing similar evaluation,
the results support that the proposed KD model gives better estimates than the popular
KM and Cox PH models with interactions.
Thirdly, we performed
the same analysis with actual censored breast cancer data. Although working
with censored data, it is quite difficult to justify such an analysis, we
analyzed the data, and the results are similar to
1.
W.Fyles, D.R.McCready,
L.AManchul, M.E.Trudeau, P.Merante, M.Pintile, L.M.Weir, I.A.Olivotto. Tamoxifen
with or without breast irradiation in women 50 years of age or older with early
breast cancer.
2.
A.McGilchrist, REML.
Estimation for Survival Models with Fraility. Biometrics, 49(1), 1993, 221-225.
3.
Collett. Modeling survival
data in medical research. Chapman & Hall/CRC, 2003.
4.
P.Harrington, T.R.Fleming. A Class of Rank Test
Procedures for Censored Survival Data. Biometrika, 69(3), 1982, 553-566.
5.
D.R.Cox. Regression models and
life-tables (with discussion). Journal of the Royal Statistical Society, Series
B, 34, 1972, 187-220.
6.
D.R.Cox, D.Oakes. Analysis of survival
data. London: Chapman & Hall, 1984.
7.
E.L.Kaplan. In a retrospective on the seminal paper in "This week's
citation classic". Current Contents 24, 14, 1983.
8.
E.L.Kaplan, P.Meier. Non-parametric estimation from incomplete observations. J. Am. Stat. Assoc, 53, 1958, 457-448.
9.
E.Pena. Goodness of fit tests with right censored
data. Colloquium Talk,
10. J.P.Klein. Semiparametric Estimation of Random Effects Using the Cox Model
Based on the EM Algorithm. Biometrics, 48(3), 1992, 795-806.
11. K.Liu, C.P.Tsokos. Non-parametric Density Estimation for the Sum of Two
Independent RandomVariables. Journal of Stochastic Analysis, 2000.
12. K.Liu, C.P.Tsokos. Non-parametric Reliability Modeling for Parallel Systems.
Journal of StochasticAnalysis, 1999.
13. K.Liu, C.P.Tsokos. Optimal Bandwidth Selection for a Nonparametric Estimate
of the Cumulative Distribution Function. International Journal of Applied
Mathematics, Vol.10, No.1, 2002, 33-49.
14.
N.A.Ibrahim, A.Kudus,
I.Daud, M.R.Abu Bakar. Decision tree for competing
risks survival probability in breast cancer study. International Journal of
Biomedical Sciences Vol.3, No.1, 2008.
15. N.Mantel, W.Haenszel. Statical aspects of the analysis of data from
retrospective studies of disease. Journal of the National cancer Institute,
22(4), 1959.
16. P.Hougaard. Modelling Heterogeneity in Survival Data. Journal of Applied
Probabdity, 28(3), 695-701, 1991.
17. P.M.Grambsch, T.M.Therneau. Proportional Hazards Tests and Diagnostics
Based on Weighted Residuals. Biometrika, 81(3), 1994, 515-526.
18. R.Duda, P.Hart. Pattern Classification and Scene Analysis. John Wiley &
Sons, ISBN
0-471-22361-1, 1973.
19. P.Qiu, C.P.Tsokos. Accelerated Life-Testing Model Building with Box-Cox
Transformation. Sankhya, Vol. 62, Series A, Pt. 2, 2000, pp.223-235.
20. T.M.Therneau, P.M.Grambsch. Modeling Survival Data: Extending the Cox
Model. New York, Springer, 2000.
21. T.M.Therneau, P.M.Grambsch, T.R.Fleming. Martingale-Based Residuals for
Survival Models. Biometrika, 77(1), 1990, 147-160.
22. C.P.Tsokos. Modelling of environmental
engineering and health problems. Int.J. Problems of
nonlinear analysis in engineering systems, No.1(35),
v.17, 2011, 1-5.
23. D.Vovoras, C.P.TsokosMethod
of flexible co-variates in the proportional hazards
models. Int.J. Problems of nonlinear analysis in
engineering systems, No.2(36), v.17, 2011, 14-31.
© 1995-2008 Kazan State University