© 1995 by Biometrika Trust
A mean score method for missing and auxiliary covariate data in regression models
Department of Statistics, University College Dublin Belfield, Dublin 4, Ireland
Fred Hutchinson Cancer Research Center 1124 Columbia Street, Seattle, Washington 98104, U.S.A.
We consider regression analysis when incomplete or auxiliary covariate data are available for all study subjects and, in addition, for a subset called the validation sample, true covariate data of interest have been ascertained. The term auxiliary data refers to data not in the regression model, but thought to be informative about the true missing covariate data of interest. We discuss a method which is nonparametric with respect to the association between available and missing data, allows missingness to depend on available response and covariate values, and is applicable to both cohort and case-control study designs. The method previously proposed by Flanders & Greenland (1991) and by Zhao & Lipsitz (1992) is generalised and asymptotic theory is derived. Our expression for the asymptotic variance of the estimator provides intuition regarding performance of the method. Optimal sampling strategies for the validation set are also suggested by the asymptotic results.
Key Words: Double sampling EM algorithm Hot-deck Semiparametric Surrogate data Two-stage sampling
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Guolo Robust techniques for measurement error correction: a review Statistical Methods in Medical Research, December 1, 2008; 17(6): 555 - 580. [Abstract] [PDF] |
||||
![]() |
S.-C. Yun, Y. Lee, and M. G. Kenward Using Hierarchical Likelihood for Missing Data Problems Biometrika, December 1, 2007; 94(4): 905 - 919. [Abstract] [PDF] |
||||
![]() |
M. G Kenward and J. Carpenter Multiple imputation: current perspectives Statistical Methods in Medical Research, June 1, 2007; 16(3): 199 - 218. [Abstract] [PDF] |
||||
![]() |
C. Wang, S.-M. Lee, and E. C. Chao Numerical equivalence of imputing scores and weighted estimators in regression analysis with missing covariates Biostat., April 1, 2007; 8(2): 468 - 473. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mcnamee Optimal design and efficiency of two-phase case-control studies with error-prone and error-free exposure measures Biostat., October 1, 2005; 6(4): 590 - 603. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Ryan, W. Huang, S. W Thurston, K. T Kelsey, J. K Wiencke, and D. C Christiani On the use of biomarkers for environmental health research Statistical Methods in Medical Research, June 1, 2004; 13(3): 207 - 225. [Abstract] [PDF] |
||||
![]() |
M. E. Halloran and I. M. Longini Jr. Using Validation Sets for Outcomes and Exposure to Infection in Vaccine Field Studies Am. J. Epidemiol., September 1, 2001; 154(5): 391 - 398. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Thurigen, D. Spiegelman, M. Blettner, C. Heuer, and H. Brenner Measurement error correction using validation data: a review of methods and their applicability in case-control studies Statistical Methods in Medical Research, October 1, 2000; 9(5): 447 - 474. [Abstract] [PDF] |
||||
![]() |
A. Pickles Psychiatric epidemiology Statistical Methods in Medical Research, June 1, 1998; 7(3): 235 - 251. [Abstract] [PDF] |
||||



