© 2002 by Biometrika Trust
Semiparametric regression for count data
1 Department of Statistics and Applied Mathematics D.De Castro, Università di Torino, Piazza Arbarello, 8, 10122 Torino, Italy carota@cisi.unito.it 2 Department of Oncology, Johns Hopkins University, 550 North Broadway, Suite 1103, Baltimore, Maryland 21205, U.S.A. gp@jhu.edu
We introduce a class of Bayesian semiparametric models for regression problems in which the response variable is a count.Our goal is to provide a flexible, easy-to-implement and robust extension of generalised linear models, for datasets of moderate or large size. Our approach is based on modelling the distribution of the response variable using a Dirichlet process, whose mean distribution function is itself random and is given a parametric form, such as a generalised linear model. The effects of the explanatory variables on the response are modelled via both the parameters of the mean distribution function of the Dirichlet process and the total mass parameter. We discuss modelling options and relationships with other approaches. We derive in closed form the marginal posterior distribution of the regression coefficients and discuss its use in inference and computing. We illustrate the benefits of our approach with a prognostic model for early breast cancer patients.
Key Words: Generalised linear model; Marginal model; Product of Dirichlet process mixtures
Received March 2000. Revised July 2001
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Navarrete, F. A Quintana, and P. Muller Some issues in nonparametric Bayesian modeling using species sampling models Statistical Modeling, April 1, 2008; 8(1): 3 - 21. [Abstract] [PDF] |
||||
