Wednesday, 6 June 2012

14TB of liver cancer genome data from this available in GigaDB

YT plugging recent ACRG collaboration with BGI. 14TB of liver cancer genome data from this available in GigaDB

Genomic DNA was purified for at least 30-fold coverage paired-end (PE) sequencing, and PE reads were mapped on human reference genome (UCSC build hg19) and HBV (NC_003977).  Two sequencing libraries with different insert size were constructed for each genomic DNA sample (200bp and 800bp).  Paired end, 90bp read length sequencing was performed in the HiSeq 2000 sequencer according to the manufacturer's instructions.  Raw gene expression profiling data of these human HCC samples have been deposited to GEO with the accession number GSE25097.

Raw data

Raw data



May 31, 2012: Data released.

In accordance with our terms of use, please cite this dataset as:

Kan, Z; Zheng, H; Liu, X; Li, S; Barber, TD; Gong, Z; Gao, H; Hao, K; Willard, MD; Xu, J; Hauptschein, R; Rejto, PA; Fernandez, J; Wang, G; Zhang, Q; Wang, B; Chen, R; Wang, J; Lee, NP; Lee, WH; Ariyaratne, PN; Tennakoon, C; Mulawadi, FH; Wong, KF; Liu, AM; Chan, KL; Hu, Y; Chou, WC; Buser, C; Zhou, W; Lin, Z; Peng, Z; Yi, K; Chen, S; Li, L; Fan, X; Yang, J; Ye, R; Ju, J; Wang, K; Estrella, H; Deng, S; Wulur, IH; Liu, J; Ehsani, ME; Zhang, C; Loboda, A; Sung, WK; Aggarwal, A; Poon, RT; Fan, ST; Wang, J; Hardwick, J; Reinhard, C; Dai, H; Li, Y; Luk, JM; Mao, M; the Asian Cancer Research Group (2012): Hepatocellular carcinoma genomic data from the Asia Cancer Research Group. GigaScience.


Related manuscript available at:



Accession codes associated with this data:


1 comment:

  1. Except that the gigaDb doesn't work ( and apparently hasn't for several weeks )

    Theoretically data is available from SRA here :

    But the purported 14TB does not match the 2.3 TB in SRA. There appear to only be 14 Tumor/Normal pairs of drastically varying coverage (according to the annotation).


Datanami, Woe be me