Thursday 4 October 2012

Paired-End Sequencing of Long-Range DNA Fragments for De Novo Assembly of Large, Complex Mammalian Genomes by Direct Intra-Molecule Ligation.


Item 1 of 1    (Display the citation in PubMed)

1. PLoS One. 2012;7(9):e46211. Epub 2012 Sep 27.

Paired-End Sequencing of Long-Range DNA Fragments for De Novo Assembly of Large, Complex Mammalian Genomes by Direct Intra-Molecule Ligation.

Asan, Geng C, Chen Y, Wu K, Cai Q, Wang Y, Lang Y, Cao H, Yang H, Wang J, Zhang X.

Source

BGI-Shenzhen, Shenzhen, Guangdong, China.

Abstract

BACKGROUND:

The relatively short read lengths from next generation sequencing (NGS) technologies still pose a challenge for de novo assembly of complex mammal genomes. One important solution is to use paired-end (PE) sequence information experimentally obtained from long-range DNA fragments (>1 kb). Here, we characterize and extend a long-range PE library construction method based on direct intra-molecule ligation (or molecular linker-free circularization) for NGS.

RESULTS:

We found that the method performs stably for PE sequencing of 2- to 5- kb DNA fragments, and can be extended to 10-20 kb (and even in extremes, up to ∼35 kb). We also characterized the impact of low quality input DNA on the method, and develop a whole-genome amplification (WGA) based protocol using limited input DNA (<1 µg). Using this PE dataset, we accurately assembled the YanHuang (YH) genome, the first sequenced Asian genome, into a scaffold N50 size of >2 Mb, which is over100-times greater than the initial size produced with only small insert PE reads(17 kb). In addition, we mapped two 7- to 8- kb insertions in the YH genome using the larger insert sizes of the long-range PE data.

CONCLUSIONS:

In conclusion, we demonstrate here the effectiveness of this long-range PE sequencing method and its use for the de novo assembly of a large, complex genome using NGS short reads.
PMID: 23029438 [PubMed - as supplied by publisher]

1 comment:

  1. There are now several DNA databases in existence around the world. Some are private, but most of the largest databases are government controlled. The United States maintains the largest DNA database, with the Combined DNA Index System, holding over 5 million records as of 2007. The United Kingdom maintains the National DNA Database which is of similar size, despite the UK's smaller population.

    ReplyDelete

Datanami, Woe be me