© 1988 by Biometrika Trust
On stochastic complexity and nonparametric density estimation
Department of Statistics, Faculties, Australian National University Canberra, A.C.T., 2601, Australia
We use the concepts of stochastic complexity, description length, and model selection to develop data-based methods for choosing smoothing parameters in nonparametric density estimation. In the case of histogram estimators, we derive a simple, exact formula for stochastic complexity when the prior distribution of cell probabilities is uniform over the class of all possible choices. The formula depends only on the data and the smoothing parameter, which is readily chosen according to the criterion of minimum stochastic complexity. Approaches based on stochastic complexity and description length are shown to be asymptotically equivalent in certain circumstances. They produce a degree of smoothing which is almost optimal from the viewpoint of minimizing L
, or supremum, distance, but which smooths a little more than is optimal in the sense of minimizing Lr distance for any finite value of r.
Key Words: Histogram estimator Kernel estimator Minimum description length Model selection Nonparametric density estimator Penalized likelihood Stochastic complexity
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Nemenman Fluctuation-Dissipation Theorem and Models of Learning Neural Comput., September 1, 2005; 17(9): 2006 - 2033. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Still and W. Bialek How Many Clusters? An Information-Theoretic Perspective Neural Comput., December 1, 2004; 16(12): 2483 - 2506. [Abstract] [Full Text] [PDF] |
||||
