The Library
ParticleMDI - Particle Monte Carlo methods for the cluster analysis of multiple datasets with applications to cancer subtype identification
Tools
Cunningham, Nathan, Griffin, Jim E. and Wild, David L. (2020) ParticleMDI - Particle Monte Carlo methods for the cluster analysis of multiple datasets with applications to cancer subtype identification. Advances in Data Analysis and Classification, 14 . pp. 463-484. doi:10.1007/s11634-020-00401-y ISSN 1862-5347.
|
PDF
WRAP-Particle-Monte-Carlo-methods-cluster-analysis-Wild-2020.pdf - Accepted Version - Requires a PDF viewer. Download (3807Kb) | Preview |
Official URL: https://doi.org/10.1007/s11634-020-00401-y
Abstract
We present a novel nonparametric Bayesian approach for performing cluster analysis in a context where observational units have data arising from multiple sources. Our approach uses a particle Gibbs sampler for inference in which cluster allocations are jointly updated using a conditional particle filter within a Gibbs sampler, improving the mixing of the MCMC chain. We develop several approaches to improving the computational performance of our algorithm. These methods can achieve greater than an order-of-magnitude improvement in performance at no cost to accuracy and can be applied more broadly to Bayesian inference for mixture models with a single dataset. We apply our algorithm to the discovery of risk cohorts amongst 243 patients presenting with kidney renal clear cell carcinoma, using samples from the Cancer Genome Atlas, for which there are gene expression, copy number variation, DNA methylation, protein expression and microRNA data. We identify 4 distinct consensus subtypes and show they are prognostic for survival rate (p < 0.0001).
Item Type: | Journal Article | ||||||||
---|---|---|---|---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics | ||||||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Mathematics Faculty of Science, Engineering and Medicine > Science > Statistics Faculty of Science, Engineering and Medicine > Research Centres > Warwick Systems Biology Centre |
||||||||
Library of Congress Subject Headings (LCSH): | Cluster analysis, Bayesian statistical decision theory, Monte Carlo method | ||||||||
Journal or Publication Title: | Advances in Data Analysis and Classification | ||||||||
Publisher: | Springer | ||||||||
ISSN: | 1862-5347 | ||||||||
Official Date: | 2020 | ||||||||
Dates: |
|
||||||||
Volume: | 14 | ||||||||
Page Range: | pp. 463-484 | ||||||||
DOI: | 10.1007/s11634-020-00401-y | ||||||||
Status: | Peer Reviewed | ||||||||
Publication Status: | Published | ||||||||
Reuse Statement (publisher, data, author rights): | This is a post-peer-review, pre-copyedit version of an article published in Advances in Data Analysis and Classification. The final authenticated version is available online at: http://dx.doi.org/10.1007/s11634-020-00401-y | ||||||||
Access rights to Published version: | Restricted or Subscription Access | ||||||||
Date of first compliant deposit: | 10 June 2020 | ||||||||
Date of first compliant Open Access: | 12 June 2021 | ||||||||
RIOXX Funder/Project Grant: |
|
||||||||
Related URLs: |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |
Downloads
Downloads per month over past year