Thursday, 13 September 2012

[pub]: accurate comparison and validation of single-nucleotide variants: the pibase software.


Item 1 of 1    (Display the citation in PubMed)

1. Nucleic Acids Res. 2012 Sep 10. [Epub ahead of print]

From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software.

Forster M, Forster P, Elsharawy A, Hemmrich G, Kreck B, Wittig M, Thomsen I, Stade B, Barann M, Ellinghaus D, Petersen BS, May S, Melum E, Schilhabel MB, Keller A, Schreiber S, Rosenstiel P, Franke A.

Source

Institute of Clinical Molecular Biology, Christian-Albrechts-University Kiel, D-24105 Kiel, Institute of Forensic Genetics, D-48161 Münster, Germany, Murray Edwards College, University of Cambridge, CB3 0DF, UK, Division of Gastroenterology, Hepatology, and Endoscopy, Brigham and Women's Hospital, Harvard Medical School, MA 02115, USA, Norwegian PSC Research Center, Clinic for specialized Medicine and Surgery, Oslo University Hospital, Rikshospitalet, Oslo, Norway, Saarland University, Department of Human Genetics, D-66123 Saarbrücken and General Internal Medicine, Christian-Albrechts-University Kiel, D-24105 Kiel, Germany.

Abstract

Scientists working with single-nucleotide variants (SNVs), inferred by next-generation sequencing software, often need further information regarding true variants, artifacts and sequence coverage gaps. In clinical diagnostics, e.g. SNVs must usually be validated by visual inspection or several independent SNV-callers. We here demonstrate that 0.5-60% of relevant SNVs might not be detected due to coverage gaps, or might be misidentified. Even low error rates can overwhelm the true biological signal, especially in clinical diagnostics, in research comparing healthy with affected cells, in archaeogenetic dating or in forensics. For these reasons, we have developed a package called pibase, which is applicable to diploid and haploid genome, exome or targeted enrichment data. pibase extracts details on nucleotides from alignment files at user-specified coordinates and identifies reproducible genotypes, if present. In test cases pibase identifies genotypes at 99.98% specificity, 10-fold better than other tools. pibase also provides pair-wise comparisons between healthy and affected cells using nucleotide signals (10-fold more accurately than a genotype-based approach, as we show in our case study of monozygotic twins). This comparison tool also solves the problem of detecting allelic imbalance within heterozygous SNVs in copy number variation loci, or in heterogeneous tumor sequences.

PMID: 22965131 [PubMed - as supplied by publisher]

No comments:

Post a Comment

Datanami, Woe be me