Skip to content Skip to navigation
University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • |
  • About

University of Warwick
Publications service & WRAP

Highlight your research

  • WRAP
    • Home
    • Search WRAP
    • Browse by Warwick Author
    • Browse WRAP by Year
    • Browse WRAP by Subject
    • Browse WRAP by Department
    • Browse WRAP by Funder
    • Browse Theses by Department
  • Publications Service
    • Home
    • Search Publications Service
    • Browse by Warwick Author
    • Browse Publications service by Year
    • Browse Publications service by Subject
    • Browse Publications service by Department
    • Browse Publications service by Funder
  • Help & Advice
University of Warwick

The Library

  • Login
  • Admin

Evaluating the performance of tools used to call minority variants from whole genome short-read data

Tools
- Tools
+ Tools

Mohammed, Khadija Said, Kibinge, Nelson, Prins, Pjotr, Agoti, Charles N., Cotten, Matthew, Nokes, D. James, Brand, Samuel and Githinji, George (2018) Evaluating the performance of tools used to call minority variants from whole genome short-read data. Wellcome Open Research, 3 . 21. doi:10.12688/wellcomeopenres.13538.2

[img]
Preview
PDF
WRAP-evaluating-performance-tools-used-call-minority-variants-whole-genome-short-read-data-Nokes-2018.pdf - Published Version - Requires a PDF viewer.
Available under License Creative Commons Attribution 4.0.

Download (2313Kb) | Preview
Official URL: http://dx.doi.org/10.12688/wellcomeopenres.13538.2

Request Changes to record.

Abstract

Background

High-throughput whole genome sequencing facilitates investigation of minority virus sub-populations from virus positive samples. Minority variants are useful in understanding within and between host diversity, population dynamics and can potentially assist in elucidating person-person transmission pathways. Several minority variant callers have been developed to describe low frequency sub-populations from whole genome sequence data. These callers differ based on bioinformatics and statistical methods used to discriminate sequencing errors from low-frequency variants.

Methods

We evaluated the diagnostic performance and concordance between published minority variant callers used in identifying minority variants from whole-genome sequence data from virus samples. We used the ART-Illumina read simulation tool to generate three artificial short-read datasets of varying coverage and error profiles from an RSV reference genome. The datasets were spiked with nucleotide variants at predetermined positions and frequencies. Variants were called using FreeBayes, LoFreq, Vardict, and VarScan2. The variant callers’ agreement in identifying known variants was quantified using two measures; concordance accuracy and the inter-caller concordance.

Results

The variant callers reported differences in identifying minority variants from the datasets. Concordance accuracy and inter-caller concordance were positively correlated with sample coverage. FreeBayes identified the majority of variants although it was characterised by variable sensitivity and precision in addition to a high false positive rate relative to the other minority variant callers and which varied with sample coverage. LoFreq was the most conservative caller.

Conclusions

We conducted a performance and concordance evaluation of four minority variant calling tools used to identify and quantify low frequency variants. Inconsistency in the quality of sequenced samples impacts on sensitivity and accuracy of minority variant callers. Our study suggests that combining at least three tools when identifying minority variants is useful in filtering errors when calling low frequency variants.

Item Type: Journal Article
Subjects: Q Science > QH Natural history
Q Science > QR Microbiology
Divisions: Faculty of Science > Life Sciences (2010- )
Library of Congress Subject Headings (LCSH): Genomics, Viruses -- Africa, Bioinformatics
Journal or Publication Title: Wellcome Open Research
Publisher: F1000Research
ISSN: 2398-502X
Official Date: 5 March 2018
Dates:
DateEvent
13 September 2018Updated
5 March 2018Published
Date of first compliant deposit: 9 October 2018
Volume: 3
Article Number: 21
DOI: 10.12688/wellcomeopenres.13538.2
Status: Peer Reviewed
Publication Status: Published
Access rights to Published version: Open Access
RIOXX Funder/Project Grant:
Project/Grant IDRIOXX Funder NameFunder ID
102975Wellcome Trusthttp://dx.doi.org/10.13039/100010269
DELTAS Africa Initiative [DEL-15-003]African Academy of Scienceshttp://viaf.org/viaf/126099153
UNSPECIFIEDNew Partnership for Africa's Developmenthttp://dx.doi.org/10.13039/501100009250
107769Wellcome Trusthttp://dx.doi.org/10.13039/100010269
UNSPECIFIEDGreat Britain. GovernmentUNSPECIFIED

Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics

twitter

Email us: wrap@warwick.ac.uk
Contact Details
About Us