Skip Navigation

Biometrika 2006 93(4):877-893; doi:10.1093/biomet/93.4.877
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kim, S.
Right arrow Articles by Vannucci, M.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 Biometrika Trust

Variable selection in clustering via Dirichlet process mixture models

Sinae Kim1, Mahlet G. Tadesse2 and Marina Vannucci3

1 Department of Statistics, Texas A&M University, College Station, Texas 77843-3143, U.S.A. sinae{at}stat.tamu.edu, 2 Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, Pennsylvania 19104-6021, U.S.A. mtadesse{at}cceb.upenn.edu, 3 Department of Statistics, Texas A&M University, College Station, Texas 77843-3143, U.S.A. mvannucci{at}stat.tamu.edu


   Abstract

The increased collection of high-dimensional data in various fields has raised a strong interest in clustering algorithms and variable selection procedures. In this paper, we propose a model-based method that addresses the two problems simultaneously. We introduce a latent binary vector to identify discriminating variables and use Dirichlet process mixture models to define the cluster structure. We update the variable selection index using a Metropolis algorithm and obtain inference on the cluster structure via a split-merge Markov chain Monte Carlo technique. We explore the performance of the methodology on simulated data and illustrate an application with a DNA microarray study.

Key Words: Bayesian inference; Clustering; Dirichlet process mixture model; DNA microarray data analysis; Variable selection.


Received December 2004. Revised March 2006.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Phil Trans R Soc AHome page
D. L. Banks, L. House, and K. Killourhy
Cherry-picking for complex data: robust structure discovery
Phil Trans R Soc A, November 13, 2009; 367(1906): 4339 - 4359.
[Abstract] [Full Text] [PDF]


Home page
BiometrikaHome page
D. B. Dunson and J.-H. Park
Kernel stick-breaking processes
Biometrika, June 1, 2008; 95(2): 307 - 323.
[Abstract] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.