Skip to content Skip to navigation
University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • |
  • About

University of Warwick
Publications service & WRAP

Highlight your research

  • WRAP
    • Home
    • Search WRAP
    • Browse by Warwick Author
    • Browse WRAP by Year
    • Browse WRAP by Subject
    • Browse WRAP by Department
    • Browse WRAP by Funder
    • Browse Theses by Department
  • Publications Service
    • Home
    • Search Publications Service
    • Browse by Warwick Author
    • Browse Publications service by Year
    • Browse Publications service by Subject
    • Browse Publications service by Department
    • Browse Publications service by Funder
  • Help & Advice
University of Warwick

The Library

  • Login
  • Admin

R/BHC : fast Bayesian hierarchical clustering for microarray data

Tools
- Tools
+ Tools

Savage, Richard S., Heller, K. (Katherine), Xu, Yang, Ghahramani, Zoubin, Truman, William M., Grant, Murray, Denby, Katherine J. and Wild, David L. (2009) R/BHC : fast Bayesian hierarchical clustering for microarray data. BMC Bioinformatics, 10 . 242. doi:10.1186/1471-2105-10-242 ISSN 1471-2105.

[img]
Preview
PDF
WRAP_Savage_hr-151209-bmc_article_june09.pdf - Requires a PDF viewer.

Download (227Kb)
Official URL: http://dx.doi.org/10.1186/1471-2105-10-242

Request Changes to record.

Abstract

Background:
Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained.
Results:
We present an R/Bioconductor port of a fast novel algorithm for Bayesian agglomerative hierarchical clustering and demonstrate its use in clustering gene expression microarray data. The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infinite mixture) to model uncertainty in the data and Bayesian model selection to decide at each step which clusters to merge.
Conclusion:
Biologically plausible results are presented from a well studied data set: expression profiles of A. thaliana subjected to a variety of biotic and abiotic stresses. Our method avoids several limitations of traditional methods, for example how many clusters there should be and how to choose a principled distance metric.

Item Type: Journal Article
Subjects: R Medicine > R Medicine (General)
Q Science > QA Mathematics
Divisions: Faculty of Science, Engineering and Medicine > Research Centres > Warwick Systems Biology Centre
Faculty of Science, Engineering and Medicine > Science > Life Sciences (2010- ) > Warwick HRI (2004-2010)
Library of Congress Subject Headings (LCSH): Bayesian statistical decision theory, Gene expression -- Statistical methods, Dirichlet series, Arabidopsis thaliana
Journal or Publication Title: BMC Bioinformatics
Publisher: BioMed Central Ltd.
ISSN: 1471-2105
Official Date: 6 August 2009
Dates:
DateEvent
6 August 2009Submitted
Volume: 10
Article Number: 242
DOI: 10.1186/1471-2105-10-242
Status: Peer Reviewed
Publication Status: Published
Access rights to Published version: Open Access (Creative Commons)
Funder: Engineering and Physical Sciences Research Council (EPSRC), Biotechnology and Biological Sciences Research Council (Great Britain) (BBSRC), Marie Curie Fellowship Association (MCFA)
Grant number: EP/F027400/1 (EPSRC), BB/F005806/1 (BBSRC), 46444 (MCFA)

Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics

twitter

Email us: wrap@warwick.ac.uk
Contact Details
About Us