The Library

Error, reproducibility and sensitivity : a pipeline for data processing of Agilent oligonucleotide expression arrays

Tools

Chain, B. M., Bowen, Helen C., Hammond, John P., Posch, Wilfried, Rasaiyaah, Jane, Tsang, Jhen and Noursadeghi, Mahdad (2010) Error, reproducibility and sensitivity : a pipeline for data processing of Agilent oligonucleotide expression arrays. BMC Bioinformatics, Vol.11 (No.344). doi:10.1186/1471-2105-11-344 ISSN 1471-2105.

PDF
WRAP_Hammond_Agilent_oligonucleotide.pdf - Requires a PDF viewer.
Download (2140Kb)

Official URL: http://dx.doi.org/10.1186/1471-2105-11-344

Request Changes to record.

Abstract

Background
Expression microarrays are increasingly used to obtain large scale transcriptomic information on a wide range of biological samples. Nevertheless, there is still much debate on the best ways to process data, to design experiments and analyse the output. Furthermore, many of the more sophisticated mathematical approaches to data analysis in the literature remain inaccessible to much of the biological research community. In this study we examine ways of extracting and analysing a large data set obtained using the Agilent long oligonucleotide transcriptomics platform, applied to a set of human macrophage and dendritic cell samples.

Results
We describe and validate a series of data extraction, transformation and normalisation steps which are implemented via a new R function. Analysis of replicate normalised reference data demonstrate that intrarray variability is small (only around 2% of the mean log signal), while interarray variability from replicate array measurements has a standard deviation (SD) of around 0.5 log2 units ( 6% of mean). The common practise of working with ratios of Cy5/Cy3 signal offers little further improvement in terms of reducing error. Comparison to expression data obtained using Arabidopsis samples demonstrates that the large number of genes in each sample showing a low level of transcription reflect the real complexity of the cellular transcriptome. Multidimensional scaling is used to show that the processed data identifies an underlying structure which reflect some of the key biological variables which define the data set. This structure is robust, allowing reliable comparison of samples collected over a number of years and collected by a variety of operators.

Conclusions
This study outlines a robust and easily implemented pipeline for extracting, transforming normalising and visualising transcriptomic array data from Agilent expression platform. The analysis is used to obtain quantitative estimates of the SD arising from experimental (non biological) intra- and interarray variability, and for a lower threshold for determining whether an individual gene is expressed. The study provides a reliable basis for further more extensive studies of the systems biology of eukaryotic cells.

Item Type:

Journal Article

Subjects:

Q Science > QP Physiology

Divisions:

Faculty of Science, Engineering and Medicine > Science > Life Sciences (2010- ) > Warwick HRI (2004-2010)

Library of Congress Subject Headings (LCSH):

DNA microarrays, Human genome -- Data processing, Oligonucleotides -- Data processing

Journal or Publication Title:

BMC Bioinformatics

Publisher:

BioMed Central Ltd.

ISSN:

1471-2105

Official Date:

24 June 2010

Dates:

Date	Event
24 June 2010	Published

Volume:

Vol.11

Number:

No.344

DOI:

10.1186/1471-2105-11-344

Status:

Peer Reviewed

Access rights to Published version:

Open Access (Creative Commons)

Funder:

Biotechnology and Biological Sciences Research Council (Great Britain) (BBSRC), Wellcome Trust (London, England), National Institute for Health Research (Great Britain) (NIHR)

Data sourced from Thomson Reuters' Web of Knowledge

Request changes or add full text files to a record

Repository staff actions (login required)

View Item

Downloads

Downloads per month over past year

View more statistics

University of Warwick
Publications service & WRAP

Highlight your research

The Library

Error, reproducibility and sensitivity : a pipeline for data processing of Agilent oligonucleotide expression arrays

Abstract

Repository staff actions (login required)

Downloads

University of WarwickPublications service & WRAP

Highlight your research

The Library

Error, reproducibility and sensitivity : a pipeline for data processing of Agilent oligonucleotide expression arrays

Abstract

Repository staff actions (login required)

Downloads

University of Warwick
Publications service & WRAP