Biometrika Advance Access originally published online on January 24, 2008
Biometrika 2008 95(1):123-137; doi:10.1093/biomet/asm081
Articles |
Nonparametric regression using local kernel estimating equations for correlated failure time data
Division of Biostatistics, The Ohio State University College of Public Health, Columbus, Ohio 43210, U.S.A. zyu{at}cph.osu.edu
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts 02115, U.S.A. xlin{at}hsph.harvard.edu
Received for publication 1 January 2006. Revision received 1 June 2007.
We study nonparametric regression for correlated failure time data. Kernel estimating equations are used to estimate nonparametric covariate effects. Independent and weighted-kernel estimating equations are studied. The derivative of the nonparametric function is first estimated and the nonparametric function is then estimated by integrating the derivative estimator. We show that the nonparametric kernel estimator is consistent for any arbitrary working correlation matrix and that its asymptotic variance is minimized by assuming working independence. We evaluate the performance of the proposed kernel estimator using simulation studies, and apply the proposed method to the western Kenya parasitaemia data.
Key Words: Asymptotics Clustered survival data Marginal model Sandwich Estimator Weighted kernel smoothing Working-independence kernel estimator
References
-
Andersen P. K., Gill R. D. Cox's regression model for counting processes: a large sample study. Ann. Statist. (1982) 10:1100–20.[CrossRef]
Cai J., Prentice R. L. Estimating equations for hazard ratio parameters based on correlated failure time data. Biometrika (1995) 82:151–64.
Cai J., Prentice R. L. Regression estimation using multivariate failure time data and a common baseline hazard function model. Lifetime Data Anal. (1997) 3:197–213.[CrossRef][Medline]
Dabrowska D. M. Non-parametric regression with censored survival time data. Scand. J. Statist. (1987) 14:181–97.
Durrett R. Probability Theory and Examples (1995) Pacific Grove, CA: Duxbury Press.
Fan J., Gijbels I., King M. Local likelihood and local partial likelihood in hazard regression. Ann. Statist. (1997) 25:1661–90.[CrossRef]
Foutz R. V. On the unique consistent solution to the likelihood equations. J. Am. Statist. Assoc. (1977) 72:147–8.[CrossRef][ISI]
Gray R. Flexible methods for analyzing survival data using splines, with application to breast cancer prognosis. J. Am. Statist. Assoc. (1992) 87:942–51.[CrossRef][ISI]
Gray R., Li Y. Optimal weight functions for marginal proportional hazards analysis of clustered failure time data. Lifetime Data Anal. (2002) 8:5–19.[CrossRef][ISI][Medline]
Hastie T., Tibshirani R. Exploring the nature of covariate effects in the proportional hazards model. Biometrics (1990) 46:1005–16.[CrossRef][ISI][Medline]
Kalbfleisch J. D., Prentice R. L. The Statistical Analysis of Failure Time Data (2002) New York: John Wiley and Sons.
Lin X., Carroll R. J. Nonparametric function estimation for clustered data when the predictor is measured without/with error. J. Am. Statist. Assoc. (2000) 95:520–34.[CrossRef][ISI]
Lin X., Wang N., Welsh A. H., Carroll R. J. Equivalent kernels of smoothing splines in nonparametric regression for cluster/longitudinal data. Biometrika (2004) 91:177–93.
McElroy P. D., Beier J. C., Oster C. N., Onyango F. K., Beedle Lin X., C.&Hoffman S. L. Dose- and time-dependent relations between infective anopheles inoculation and outcomes of plasmodium falciparum parasitemia among children in Western Kenya. Am. J. Epidemiol. (1997) 145:945–56.
O'Sullivan F. Nonparametric estimation of relative risk using splines and cross-validation. SIAM J. Sci. Statist. Comp. (1988) 9:531–42.[CrossRef]
Pollard D. Convergence of Stochastic Process (1984) New York: Springer.
Spiekerman C. F., Lin D. Y. Marginal regression models for multivariate failure time data. J. Am. Statist. Assoc. (1998) 93:1164–75.[CrossRef][ISI]
Tibshirani R., Hastie T. Local likelihood estimation. J. Am. Statist. Assoc. (1987) 82:559–67.[CrossRef][ISI]
Wang N. Marginal nonparametric kernel regression accounting for within-subject correlation. Biometrika (2003) 90:43–52.
Wei L. J., Lin D. Y., Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J. Am. Statist. Assoc. (1989) 84:1065–73.[CrossRef][ISI]
| ||||||||||||||||||||||||||||||||||||||||||||||||