Biometrika Advance Access originally published online on June 22, 2009
Biometrika 2009 96(3):617-633; doi:10.1093/biomet/asp027
Article |
Pseudo-partial likelihood estimators for the Cox regression model with missing covariates
Department of Psychiatry, Mount Sinai School of Medicine, New York, New York 10029, U.S.A. Xiaodong.Luo{at}mssm.edu
Department of Biostatistics, Columbia University, New York, New York 10032, U.S.A. wt5{at}columbia.edu
Food and Drug Administration, Silver Spring, Maryland 20993, U.S.A. Qiang.Xu{at}fda.hhs.gov
Received for publication 1 May 2007. Revision received 1 November 2008.
By embedding the missing covariate data into a left-truncated and right-censored survival model, we propose a new class of weighted estimating functions for the Cox regression model with missing covariates. The resulting estimators, called the pseudo-partial likelihood estimators, are shown to be consistent and asymptotically normal. A simulation study demonstrates that, compared with the popular inverse-probability weighted estimators, the new estimators perform better when the observation probability is small and improve efficiency of estimating the missing covariate effects. Application to a practical example is reported.
Key Words: Augmented estimator Biased sampling data Embedding missing data Left-truncation Martingale structure Right censoring U-statistic
References
-
Andersen P. K., Borgan O., Gill R. D., Keiding N. Statistical Models Based on Counting Processes (1993) New York: Springer.
Chen K. Generalized case-cohort sampling. J. R. Statist. Soc. (2001) B. 63:791–809.[CrossRef]
Chen K., Lo S.-H. Case-cohort and case-control analysis with Cox's model. Biometrika (1999) 86:755–64.
Cox D. R. Regression models and life tables (with Discussion). J. R. Statist. Soc. (1972) B. 34:187–220.
Cox D. R. Partial likelihood. Biometrika (1975) 62:269–76.
Dempster A. P., Laird N. M., Rubin D. B. Maximum likelihood estimation from incomplete data via the EM algorithm (with Discussion). J. R. Statist. Soc. (1977) B. 39:1–38.
Foutz R. V. On the unique consistent solution to the likelihood equations. J. Am. Statist. Assoc. (1977) 72:147–8.[CrossRef][Web of Science]
Hansen B. E. Uniform convergence rates for kernel estimation with dependent data. Economet. Theory (2008) 24:726–48.
Horvitz D. G., Thompson D. J. A generalization of sampling without replacement from a finite universe. J. Am. Statist. Assoc. (1952) 47:663–85.[CrossRef][Web of Science]
Little R. J. A., Rubin D. B. Statistical Analysis with Missing Data (2002) 2nd ed. New York: Wiley.
Qi L., Wang C. Y., Prentice R. L. Weighted estimators for proportional hazards regression with missing covariates. J. Am. Statist. Assoc. (2005) 100:1250–63.[CrossRef][Web of Science]
Robins J. M., Rotnitzky A., Zhao L. P. Estimation of regression coefficients when some regressors are not always observed. J. Am. Statist. Assoc. (1994) 89:846–66.[CrossRef][Web of Science]
Robins J. M., Ritov Y. Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. Statist. Med. (1997) 16:285–319.[CrossRef]
Rubin D. B. Inference and missing data (with Discussion). Biometrika (1976) 63:581–92.
Sacco R. L., Benson R. T., Kargman D. E., Boden Albala B., Tuck C., Lin I.-F., Cheng J. F., Paik M. C., Shea S., Berglund L. High-density lipoprotein cholesterol and ischemic stroke in the elderly: The Northern Manhattan Stroke Study. J. Am. Med. Assoc. (2001) 285:2729–35.
Sacco R. L., Boden Albala B., Gan R., Kargman D. E., Paik M. C., Shea S., Hauser W. A., Northern Manhattan Stroke Study Collaborators. Stroke incidence among white, black and hispanic residents of an urban community: The Northern Manhattan Stroke Study. Am. J. Epidemiol. (1998) 147:259–68.
Sacco R. L., Elkind M. S., Boden Albala B., Lin I.-F., Kargman D. E., Hauser W. A., Shea S., Paik M. C. The protective effect of moderate alcohol consumption on ischemic stroke. J. Am. Med. Assoc. (1999) 281:53–60.
Tsai W. Y. Pseudo partial likelihood of proportional hazards model for biased sampling data. Biometrika (2009) 96:601–15.
Wang C. Y., Chen H. Y. Augmented inverse probability weighted estimator for Cox regression missing covariate regression. Biometrics (2001) 57:414–9.[CrossRef][Web of Science][Medline]
Wang M.-C. Hazards regression analysis for length-biased data. Biometrika (1996) 83:343–54.
This article has been cited by other articles:
![]() |
W. Y. Tsai Pseudo-partial likelihood for proportional hazards models with biased-sampling data Biometrika, September 1, 2009; 96(3): 601 - 615. [Abstract] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||
