Skip Navigation


Biometrika Advance Access originally published online on January 30, 2009
Biometrika 2009 96(1):175-186; doi:10.1093/biomet/asn068
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
96/1/175    most recent
asn068v1
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hall, P.
Right arrow Articles by Robinson, A. P.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2009 Biometrika Trust

Articles

Reducing variability of crossvalidation for smoothing-parameter choice

Peter Hall and Andrew P. Robinson

Department of Mathematics and Statistics, The University of Melbourne, Victoria 3010, Australia halpstat{at}ms.unimelb.edu.au andrewpr{at}ms.unimelb.edu.au

Received for publication 1 August 2007. Revision received 1 June 2008.
   Abstract

One of the attractions of crossvalidation, as a tool for smoothing-parameter choice, is its applicability to a wide variety of estimator types and contexts. However, its detractors comment adversely on the relatively high variance of crossvalidatory smoothing parameters, noting that this compromises the performance of the estimators in which those parameters are used. We show that the variability can be reduced simply, significantly and reliably by employing bootstrap aggregation or bagging. We establish that in theory, when bagging is implemented using an adaptively chosen resample size, the variability of crossvalidation can be reduced by an order of magnitude. However, it is arguably more attractive to use a simpler approach, based for example on half-sample bagging, which can reduce variability by approximately 50%.

Key Words: Bagging • Bandwidth • Bootstrap aggregation • Half-sampling • Kernel estimation • Nonparametric density estimation • Nonparametric regression • Statistical smoothing • Subsampling


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.