Biometrika Advance Access originally published online on January 24, 2009
Biometrika 2009 96(1):149-162; doi:10.1093/biomet/asn054
Articles |
Bayesian nonparametric functional data analysis through density estimation
Department of Applied Mathematics and Statistics, University of California, Mailstop SOE2, Santa Cruz, California 95064, U.S.A. abel{at}soe.ucsc.edu
Department of Statistical Sciences, Box 90251, Duke University, Durham, North Carolina 27708, U.S.A. dunson{at}stat.duke.edu alan{at}isds.duke.edu
Received for publication 1 March 2007. Revision received 1 March 2008.
In many modern experimental settings, observations are obtained in the form of functions and interest focuses on inferences about a collection of such functions. We propose a hierarchical model that allows us simultaneously to estimate multiple curves nonparametrically by using dependent Dirichlet process mixtures of Gaussian distributions to characterize the joint distribution of predictors and outcomes. Function estimates are then induced through the conditional distribution of the outcome given the predictors. The resulting approach allows for flexible estimation and clustering, while borrowing information across curves. We also show that the function estimates we obtain are consistent on the space of integrable functions. As an illustration, we consider an application to the analysis of conductivity and temperature at depth data in the north Atlantic.
Key Words: Dependent Dirichlet process Functional clustering Nonparametric Bayes inference Nonparametric regression Random probability measure
References
-
Albert J., Chib S. Bayesian analysis of binary and polychotomous response data. J. Am. Statist. Assoc. (1993) 88:669–79.[CrossRef][Web of Science]
Antoniak C. Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann. Statist. (1974) 2:1152–74.[CrossRef]
Basu S., Chib S. Marginal likelihood and Bayes factors for Dirichiet process mixture models. J. Am. Statist. Assoc. (2003) 98:224–35.[CrossRef][Web of Science]
Behseta S., Kass R. E., Wallstrom G. L. Hierarchical models for assessing variability among functions. Biometrika (2005) 92:419–34.
Bush C. A., MacEachern S. N. A semiparametric Bayesian model for randomised block designs. Biometrika (1996) 83:275–85.
De Iorio M., Müller P., Rosner G. L., MacEachern S. N. An anova model for dependent random measures. J. Am. Statist. Assoc. (2004) 99:205–15.[CrossRef][Web of Science]
Dunson D. Bayesian dynamic modeling of latent trait distributions. Biostatistics (2006) 7:551–68.
Dunson D. B., Pillai N., Park J.-H. Bayesian density regression. J. R. Statist. Soc. (2007) B 69:163–83.[CrossRef]
Escobar M. D. Estimating normal means with a Dirichlet process prior. J. Am. Statist. Assoc. (1994) 89:268–77.[CrossRef][Web of Science]
Escobar M. D., West M. Bayesian density estimation and inference using mixtures. J. Am. Statist. Assoc. (1995) 90:577–88.[CrossRef][Web of Science]
Fan J. Q., Yim T. H. A crossvalidation method for estimating conditional densities. Biometrika (2004) 91:819–34.
Ferguson T. S. A Bayesian analysis of some nonparametric problems. Ann. Statist. (1973) 1:209–30.[CrossRef]
Gelfand A. E., Kottas A., MacEachern S. N. Bayesian nonparametric spatial modeling with Dirichlet process mixing. J. Am. Statist. Assoc. (2005) 100:1021–35.[CrossRef][Web of Science]
Ghosal S., Ghosh J., Ramamoorthi R. V. Posterior consistency of Dirichlet mixtures in density estimation. Ann. Statist. (1999) 27:143–58.[CrossRef]
Green P., Richardson S. Modelling heterogeneity with and without the Dirichlet process. Scand. J. Statist. (2001) 28:355–75.[CrossRef]
Griffin J. E., Steel M. F. J. Order based dependent Dirichlet processes. J. Am. Statist. Assoc. (2006) 101:179–94.[CrossRef][Web of Science]
Ishwaran H., James L. F. Gibbs sampling methods for stick-breaking priors. J. Am. Statist. Assoc. (2001) 96:161–73.[CrossRef][Web of Science]
Jain S., Neal R. M. A split-merge Markov chain Monte Carlo procedure for the Dirichlet process mixture model. J. Comp. Graph. Statist. (2004) 13:158–82.[CrossRef]
MacEachern S. N. Dependent nonparametric processes. Proc. Sect. Bayesian Statist. Sci. (1999) pp. 50–5. Alexandria, VA: American Statistical Association.
Morris J. S., Carroll R. J. Wavelet-based functional mixed models. J. R. Statist. Soc. (2006) B 68:179–99.[CrossRef]
Müller P., Erkanli A., West M. Bayesian curve fitting using multivariate normal mixtures. Biometrika (1996) 83:67–79.
Müller P., Quintana F., Rosner G. Hierarchical meta-analysis over related non-parametric Bayesian models. J. R. Statist. Soc. (2004) B 66:735–49.[CrossRef]
Pennell M. L., Dunson D. B. Bayesian semiparametric dynamic frailty models for multiple event time data. Biometrics (2006) 62:1044–52.[CrossRef][Web of Science][Medline]
Ray S., Mallick B. K. Functional clustering by Bayesian wavelet methods. J. R. Statist. Soc. (2006) B 68:305–32.[CrossRef]
Rice J. A., Silverman B. W. Estimating the mean and covariance structure nonparametrically when the data are curves. J. R. Statist. Soc. (1991) B 53:233–43.
Rodriguez A., Dunson D. B., Gelfand A. E. The nested Dirichlet process (with discussion). J. Am. Statist. Assoc. (2008) 103:1131–44.[CrossRef][Web of Science]
Sethuraman J. A constructive definition of Dirichlet priors. Statist. Sinica (1994) 4:639–50.
Teh Y. W., Jordan M. I., Beal M. J., Blei D. M. Sharing clusters among related groups: hierarchical Dirichlet processes. J. Am. Statist. Assoc. (2006) 101:1566–81.[CrossRef][Web of Science]
Wu H., Zhang J. T. Local polynomial mixed-effects models for longitudinal data. J. Am. Statist. Assoc. (2002) 97:883–97.[CrossRef][Web of Science]
| ||||||||||||||||||||||||||||||||||||||||||||||||||