Skip Navigation

Biometrika 1995 82(2):315-325; doi:10.1093/biomet/82.2.315
© 1995 by Biometrika Trust
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by CHU, C. K.
Right arrow Articles by CHENG, K. F.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nonparametric regression estimates using misclassified binary responses

C. K. CHU and K. F. CHENG

Institute of Statistics, National Tsinghua University Hsinchu, Taiwan
Institute of Statistics, National Central University Chungli, Taiwan

For random design nonparametric regression, in the case that the responses are binary and subject to misclassification, the performance of the kernel estimator is investigated. The kernel estimator is generally biased for the local proportion. To adjust for the bias, the double sampling scheme of Tenenbein (1970, 1971) is considered. A plugged-in kernel estimator and an imputed kernel estimator, which adjust for the effect of misclassification on the kernel estimator, are proposed, and their asymptotic mean squared errors are analysed. The plugged-in kernel estimator is better than the simple kernel estimator, which uses only the data without misclassification in the validation subsample, in the sense of having smaller asymptotic mean squared error. However, the imputed kernel estimator has smaller asymptotic variance. If the misclassification probabilities are constant, then the two proposed estimators have the same asymptotic bias. In this case, the imputed kernel estimator is always better than the plugged-in kernel estimator. For general misclassification probabilities, the asymptotic biases of the two proposed estimators are not comparable in magnitude. However, our simulation results demonstrate that, even when the misclassification probabilities are not constant, the imputed kernel estimator is still better for reasonable sample sizes.

Key Words: Double sampling scheme • Imputed kernel estimator • Kernel estimator • Misclassification • Nonparametric regression • Plugged-in kernel estimator


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.