Skip to content Skip to navigation
University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • |
  • About

University of Warwick
Publications service & WRAP

Highlight your research

  • WRAP
    • Home
    • Search WRAP
    • Browse by Warwick Author
    • Browse WRAP by Year
    • Browse WRAP by Subject
    • Browse WRAP by Department
    • Browse WRAP by Funder
    • Browse Theses by Department
  • Publications Service
    • Home
    • Search Publications Service
    • Browse by Warwick Author
    • Browse Publications service by Year
    • Browse Publications service by Subject
    • Browse Publications service by Department
    • Browse Publications service by Funder
  • Help & Advice
University of Warwick

The Library

  • Login
  • Admin

Heterogeneous large datasets integration using bayesian factor regression

Tools
- Tools
+ Tools

Avalos-Pacheco, Alejandra, Rossell, David and Savage, Richard S. (2022) Heterogeneous large datasets integration using bayesian factor regression. Bayesian Analysis, 17 (1). pp. 33-66. doi:10.1214/20-BA1240

[img]
Preview
PDF
WRAP-Heterogenous-large-datasets-integration-using-bayesian-factor-regression-Avalos-Pacheco-2022.pdf - Published Version - Requires a PDF viewer.
Available under License Creative Commons Attribution 4.0.

Download (1248Kb) | Preview
Official URL: http://dx.doi.org/10.1214/20-BA1240

Request Changes to record.

Abstract

Two key challenges in modern statistical applications are the large amount of information recorded per individual, and that such data are often not collected all at once but in batches. These batch effects can be complex, causing distortions in both mean and variance. We propose a novel sparse latent factor regression model to integrate such heterogeneous data. The model provides a tool for data exploration via dimensionality reduction and sparse low-rank covariance estimation while correcting for a range of batch effects. We study the use of several sparse priors (local and non-local) to learn the dimension of the latent factors. We provide a flexible methodology for sparse factor regression which is not limited to data with batch effects. Our model is fitted in a deterministic fashion by means of an EM algorithm for which we derive closed-form updates, contributing a novel scalable algorithm for non-local priors of interest beyond the immediate scope of this paper. We present several examples, with a focus on bioinformatics applications. Our results show an increase in the accuracy of the dimensionality reduction, with non-local priors substantially improving the reconstruction of factor cardinality. The results of our analyses illustrate how failing to properly account for batch effects can result in unreliable inference. Our model provides a novel approach to latent factor regression that balances sparsity with sensitivity in scenarios both with and without batch effects and is highly computationally efficient.

Item Type: Journal Article
Subjects: Q Science > QA Mathematics
Q Science > QH Natural history
Divisions: Faculty of Science, Engineering and Medicine > Science > Statistics
Library of Congress Subject Headings (LCSH): Bayesian statistical decision theory , Meta-analysis, Bioinformatics -- Statistical methods
Journal or Publication Title: Bayesian Analysis
Publisher: International Society for Bayesian Analysis
ISSN: 1931-6690
Official Date: 2022
Dates:
DateEvent
2022Published
15 September 2020Available
Volume: 17
Number: 1
Page Range: pp. 33-66
DOI: 10.1214/20-BA1240
Status: Peer Reviewed
Publication Status: Published
Access rights to Published version: Open Access
RIOXX Funder/Project Grant:
Project/Grant IDRIOXX Funder NameFunder ID
CVU5464444Consejo Nacional de Ciencia y Tecnologíahttp://dx.doi.org/10.13039/501100003141
R01 CA158113-01National Institutes of Healthhttp://dx.doi.org/10.13039/100000002
RyC-2015-18544National Institutes of Healthhttp://dx.doi.org/10.13039/100000002
PGC2018-101643-B-I00Ministerio de Ciencia e Innovaciónhttp://dx.doi.org/10.13039/501100004837
2017 Fundación BBVAhttps://www.fbbva.es/
Related URLs:
  • Related dataset

Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics

twitter

Email us: wrap@warwick.ac.uk
Contact Details
About Us