"De novo assembly and genotyping of variants using colored de Bruijn graphs", Iqbal, Caccamo, Turner, Flicek, McVean
 Nature Genetics, (doi:10.1038/ng.1028)
This link will work for a bit
 http://www.nature.com/ng/journal/vaop/ncurrent/full/ng.1028.html
You may be interested in some of the following things we cover
 - low and predictable memory use
  - simultaneous assembly of multiple samples, and variant calling done directly (without assembling a consensus first)
   (eg you could assemble over 2000 S. aureus in 32Gb of RAM or 10 humans in 256Gb of RAM).
  - a mathematical model extending the Lander-Waterman statistics to include information on repeat content,
   allowing you to make choices of kmer-size depending on what you want to achieve
  - validation using fully sequenced fosmids
  - comparison of Cortex variant calls with 1000genomes pilot calls
  - showing you can make good variant calls without using a reference if you sequence multiple samples from a population (we did this with chimps)
  - a proof-of-concept of HLA-typing at HLA-B using whole genome (not pull-down) data
 
 
No comments:
Post a Comment