© 1996 by Biometrika Trust
MISCELLANEA |
A conditional model for incomplete covariates in parametric regression models
Division of Biostatistics, Dana Farber Cancer Institute 44 Binney Street, Boston, Massachusetts 02115, U.S.A.
Department of Biostatistics, Harvard School of Public Health 677 Huntington Avenue, Boston, Massachusetts 02115, U.S.A.
Incomplete covariate data arise in many data sets. When the missing covariates are categorical, a useful technique for obtaining parameter estimates is the EM algorithm by the method of weights proposed in Ibrahim (1990). This method requires the estimation of many nuisance parameters for the distribution of the covariates. Unfortunately, in data sets when the percentage of missing data is high, and the missing covariate patterns are highly non-monotone, the estimates of the nuisance parameters can lead to highly unstable estimates of the parameters of interest. We propose a conditional model for the covariate distribution that has several modelling advantages for the E-step and provides a reduction in the number of nuisance parameters, thus providing more stable estimates in finite samples. We present a clinical trials example with six covariates, five of which have some missing values.
Key Words: EM-algorithm Missing at random Non-monotone missing data
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Ryan, W. Huang, S. W Thurston, K. T Kelsey, J. K Wiencke, and D. C Christiani On the use of biomarkers for environmental health research Statistical Methods in Medical Research, June 1, 2004; 13(3): 207 - 225. [Abstract] [PDF] |
||||
![]() |
N. J Horton and N. M Laird Maximum likelihood analysis of generalized linear models with missing covariates Statistical Methods in Medical Research, February 1, 1999; 8(1): 37 - 50. [Abstract] [PDF] |
||||
