Biometrika Advance Access originally published online on November 3, 2008
Biometrika 2008 95(4):933-946; doi:10.1093/biomet/asn042
| ||||||||||||||||||||||||||||||||||||||||||||||||||
Articles |
Multiple imputation when records used for imputation are not used or disseminated for analysis
Department of Statistical Science, Duke University, Durham, North Carolina 27708-0251, U.S.A. jerry{at}stat.duke.edu
Received for publication 1 July 2007.
Revision received 1 March 2008.
| Abstract |
|---|
When some of the records used to estimate the imputation models in multiple imputation are not used or available for analysis, the usual multiple imputation variance estimator has positive bias. We present an alternative approach that enables unbiased estimation of variances and, hence, calibrated inferences in such contexts. First, using all records, the imputer samples m values of the parameters of the imputation model. Second, for each parameter draw, the imputer simulates the missing values for all records n times. From these mn completed datasets, the imputer can analyse or disseminate the appropriate subset of records. We develop methods for interval estimation and significance testing for this approach. Methods are presented in the context of multiple imputation for measurement error.
Key Words: Combining data Confidentiality Measurement error Missing data Multiple imputation