Science

Probabilistic survival analysis methods

using simulation and cancer data

Yo.Xu, C.P.Tsokos

University of South Florida

Tampa, FL 33620, USA

The object of the present study is to probabilistically evaluate commonly used methods to perform survival analysis of medical patients. Our study includes evaluation of parametric, semi-parametric and nonparametric analysis of probability survival models. We will evaluate the popular Kaplan-Meier (KM), Cox Proportional Hazard (Cox PH), and Kernel density (KD) models using both Monte Carlo simulation and actual breast cancer data. The first part of the evaluation will be based on how these methods measure up to parametric analysis and the second part using actual cancer data. As expected, the parametric survival analysis when applicable gives the best results followed by not commonly used non-parametric Kernel density approach for both evaluations using simulation and actual cancer data.

 

Wikipedia defines survival analysis as a branch of statistics which deals with death in biological organisms and failure in mechanical systems. Scientists have developed and used many different probabilistic survival analysis methods including parametric, nonparametric and semi-parametric models. In the present study we will compare all commonly used methods and propose which ones give the best probabilistic survival results [1-23].

The first part of our study is based on simulating data from a well defined probability failure distribution by identifying the sample size so that the maximum likelihood estimates converge to the assumed parametric values in Monte Carlo simulation procedure. Using this information we develop and compare the parametric estimated probabilistic survival function with the Kernel density (non-parametric), and the popular Kaplan-Meier (KM) model.

The second part of our study uses actual survival time of breast cancer data to compare the above mentioned survival models, in addition to the Cox Proportional (Cox PH) survival hazard function.

Upon completing the evaluation, we will propose a ranking of the analytical methods evaluated for performing survival analysis. The breast cancer data that we used were given by N.A.Ibrahim, whose results and analysis were published in [1, 14]. Some other relevant references that we use in the present study are [2, 11, 12, 13, 15, 18, 19].

In conclusion we note. The present study consists of three parts in which the effectiveness of three survival analysis models, namely, KM, Cox PH and KD, is compared.

Initially, using Monte Carlo simulation we compare the subject models with parametric survival models and find that the proposed KD survival model gives good results, if not better, than KM.

The second part consists of using actual uncensored breast cancer data. Performing similar evaluation, the results support that the proposed KD model gives better estimates than the popular KM and Cox PH models with interactions.

Thirdly, we performed the same analysis with actual censored breast cancer data. Although working with censored data, it is quite difficult to justify such an analysis, we analyzed the data, and the results are similar to Monte Carlo simulation and the case of using the uncensored data.

 

1.          W.Fyles, D.R.McCready, L.AManchul, M.E.Trudeau, P.Merante, M.Pintile, L.M.Weir, I.A.Olivotto. Tamoxifen with or without breast irradiation in women 50 years of age or older with early breast cancer. New England Journal of Medicine, 351, 2004, 963-970.

2.          A.McGilchrist, REML. Estimation for Survival Models with Fraility. Biometrics, 49(1), 1993, 221-225.

3.          Collett. Modeling survival data in medical research. Chapman & Hall/CRC, 2003.

4.          P.Harrington, T.R.Fleming. A Class of Rank Test Procedures for Censored Survival Data. Biometrika, 69(3), 1982, 553-566.

5.          D.R.Cox. Regression models and life-tables (with discussion). Journal of the Royal Statistical Society, Series B, 34, 1972, 187-220.

6.          D.R.Cox, D.Oakes. Analysis of survival data. London: Chapman & Hall, 1984.

7.          E.L.Kaplan. In a retrospective on the seminal paper in "This week's citation classic". Current Contents 24, 14, 1983.

8.          E.L.Kaplan, P.Meier. Non-parametric estimation from incomplete observations. J. Am. Stat. Assoc, 53, 1958, 457-448.

9.          E.Pena. Goodness of fit tests with right censored data. Colloquium Talk, Columbia, South Carolina, 2000.

10.       J.P.Klein. Semiparametric Estimation of Random Effects Using the Cox Model Based on the EM Algorithm. Biometrics, 48(3), 1992, 795-806.

11.       K.Liu, C.P.Tsokos. Non-parametric Density Estimation for the Sum of Two Independent RandomVariables. Journal of Stochastic Analysis, 2000.

12.       K.Liu, C.P.Tsokos. Non-parametric Reliability Modeling for Parallel Systems. Journal of StochasticAnalysis, 1999.

13.       K.Liu, C.P.Tsokos. Optimal Bandwidth Selection for a Nonparametric Estimate of the Cumulative Distribution Function. International Journal of Applied Mathematics, Vol.10, No.1, 2002, 33-49.

14.       N.A.Ibrahim, A.Kudus, I.Daud, M.R.Abu Bakar. Decision tree for competing risks survival probability in breast cancer study. International Journal of Biomedical Sciences Vol.3, No.1, 2008.

15.       N.Mantel, W.Haenszel. Statical aspects of the analysis of data from retrospective studies of disease. Journal of the National cancer Institute, 22(4), 1959.

16.       P.Hougaard. Modelling Heterogeneity in Survival Data. Journal of Applied Probabdity, 28(3), 695-701, 1991.

17.       P.M.Grambsch, T.M.Therneau. Proportional Hazards Tests and Diagnostics Based on Weighted Residuals. Biometrika, 81(3), 1994, 515-526.

18.       R.Duda, P.Hart. Pattern Classification and Scene Analysis. John Wiley & Sons, ISBN 0-471-22361-1, 1973.

19.       P.Qiu, C.P.Tsokos. Accelerated Life-Testing Model Building with Box-Cox Transformation. Sankhya, Vol. 62, Series A, Pt. 2, 2000, pp.223-235.

20.       T.M.Therneau, P.M.Grambsch. Modeling Survival Data: Extending the Cox Model. New York, Springer, 2000.

21.       T.M.Therneau, P.M.Grambsch, T.R.Fleming. Martingale-Based Residuals for Survival Models. Biometrika, 77(1), 1990, 147-160.

22.       C.P.Tsokos. Modelling of environmental engineering and health problems. Int.J. Problems of nonlinear analysis in engineering systems, No.1(35), v.17, 2011, 1-5.

23.       D.Vovoras, C.P.TsokosMethod of flexible co-variates in the proportional hazards models. Int.J. Problems of nonlinear analysis in engineering systems, No.2(36), v.17, 2011, 14-31.

 




[Contents]

homeKazanUniversitywhat's newsearchlevel upfeedback

© 1995-2008 Kazan State University