A hybrid likelihood model for sequence-based disease association studies.
Source
Abstract
- PMID:
- 23358228
- [PubMed - in process]
Plug for Debian Med Sprint
---------- Forwarded message ----------
From: Steffen Möller <steffen_moeller gmx.de>
Date: Jan 25, 2013 11:25 PM
Subject: [Biopython] Debian Med Sprint in Kiel, Germany 23rd/24th of February
To: "Biopython Mailing List" <biopython lists.open-bio.org>
Cc:
> Dear all,
>
> We have our annual Debian/Ubuntu/Bio-Linux sprint on Bioinformatics again next month. Every year there are a few individuals more peripheral to the distribution attending, which usually helps us to develop our community further in some way. Anybody from BioPython interested to join in, please read through
> http://wiki.debian.org/DebianMed/Meeting/Kiel2013
> and just email me or add him/herself. There is not anything particular that I expect from the BioPython community, except for more and better ideas on how to develop research on and with tools in computational biology further.
> Registration is free. Accommodation and travel are not.
>
> Cheers,
>
> Steffen
> _______________________________________________
http://mendeliandisorder.blogspot.sg/2012/11/why-i-dont-want-to-know-my-genome.html
http://blogs.plos.org/dnascience/2012/11/01/why-i-dont-want-to-know-my-genome-sequence/
Interesting reads on a rainy Saturday.
I think (at this point in time) believing whole genome sequencing or even exome seq is the way forward in medical health is akin to buying extended warranty.
You don't need it now but you are banking on having cost savings when u actually do (doing one whole genome versus small individual regions)
No doubt eventually when prescription of drugs depends on your genetic make up, your DNA sequences will be invaluable or even compulsory. (Before I read this article I didn't even know being slow to metabolize anti psychotics and beta blockers can be deadly). Right now, genomics offer a glimpse into likely causal associations which can be hard for the man on the street to act on, beyond the advice of " don't smoke, exercise, eat a healthy diet, and don't worry about DNA sequences"
I would also add "watch out for cars" since 1.3 million people die yearly from auto accidents versus 1.4 million deaths attributed to lung cancer.
See
http://www.who.int/mediacentre/factsheets/fs358/en/index.html
http://www.cancerresearchuk.org/cancer-info/cancerstats/world/the-global-picture/
We present a new streaming algorithm for k-mer counting, called DSK (diskstreaming of k-mers), which only requires a fixed, user-defined amount of memory and disk space. This approach realizes a memory, time and disk trade-off. The multi-set of all k-mers present in the reads is partitioned and partitions are saved to disk. Then, each partition is separately loaded in memory in a temporary hash table. The k-mer counts are returned by traversing each hash table. Low-abundance k-mers are optionally filtered.
DSK is the first approach that is able to count all the 27-mers of a human genome dataset using only 4.0 GB of memory and moderate disk space (160 GB), in 17.9 hours. DSK can replace a popular k-mer counting software (Jellyfish) on small-memory servers.
Availability:http://minia.genouest.org/dsk
Subject: [Bowtie-bio-announce] Bowtie 2 2.0.5 released
Bowtie 2 version 2.0.5 - January 4, 2013
* Fixed an issue that would cause excessive memory allocation when aligning
to very repetitive genomes.
* Fixed an issue that would cause a pseudo-randomness-related assert to be
thrown in debug mode under rare circumstances.
* When bowtie2-build fails, it will now delete index files created so far so
that invalid index files don't linger.
* Tokenizer no longer has limit of 10,000 tokens, which was a problem for
users trying to index a very large number of FASTA files.
* Updated manual's discussion of the -I and -X options to mention that
setting them farther apart makes Bowtie 2 slower.
* Renamed COPYING to LICENSE and created a README to be GitHub-friendly.
Best,
Ben
--
Ben Langmead
Department of Computer Science
Johns Hopkins University
3400 North Charles St
Baltimore, MD 21218-2682
---------- Forwarded message ----------
From: "Jonathan Marchini" <marchini@ 2013 4:51 PM
Subject: [OXSTATGEN] new version of IMPUTE2 (v2.3.0)
To: <OXSTATGEN>
Cc:
> Hello,
>
> There is a new version of IMPUTE2 (v2.3.0) on the website:
>
> https://mathgen.stats.ox.ac.uk/impute/impute_v2.html
>
> There are several new features in this version:
>
> - IMPUTE2 now has a streamlined way to combine haplotypes from two reference panels and impute from the merged panel. For example, we have seen good results when merging 1000 Genomes haplotypes with sequenced haplotypes from other cohorts to form a combined reference panel. This feature provides accurate imputation of variants that are specific to each panel while maintaining accuracy at variants that are shared across panels. You can read the details of our approach at https://mathgen.stats.ox.ac.uk/impute/impute_v2.html#merging_panels.
>
> - To make the panel-merging as flexible as possible, we now allow the -k_hap parameter to take separate values for each of two reference panels. In essence, you can specify the number of "useful" haplotypes in each reference panel, then IMPUTE2 will take this information into account when merging the reference panels and imputing genotypes in your study. This feature is described here: https://mathgen.stats.ox.ac.uk/impute/impute_v2.html#-k_hap.
>
> - We have added important documentation, such as a detailed description of how IMPUTE2 creates the concordance tables that are printed at the end of most runs: https://mathgen.stats.ox.ac.uk/impute/impute_v2.html#concordance_tables.
>
> - We fixed some bugs that were present in v2.2.2:
> -- The program used to throw an error when the -use_prephased_g and -chrX flags were combined; now these options are compatible.
> -- You can get phased imputation output by combining the -use_prephased_g and -phase flags, but previously the phasing of hets in the input file (-known_haps_g) was scrambled in the output; this is now fixed.
> -- Another problem with combining the -use_prephased_g and -phase flags is that Type 3 SNPs (those present in the -known_haps_g file but not the reference panel) were omitted from the output haplotypes; now these SNPs are included in the output by default.
> -- Annotations in the reference legend file (columns 5+) used to be restricted to numeric values, but now the program can handle string values as well. This extends the flexibility of the -filt_rules_l mechanism for run-time filtering of reference variants.
>
> - IMPUTE2 can now be made even more accurate by pre-phasing your study genotypes with SHAPEIT2 [ http://www.shapeit.fr/ ], which combines ideas from SHAPEIT and IMPUTE2 to improve the accuracy and efficiency of haplotype estimation. You can read the SHAPEIT2 article here: http://www.nature.com/nmeth/journal/v10/n1/full/nmeth.2307.html.
>
> - We have redesigned the website to make it easier to navigate.
>
> We are still actively improving some of the new features, and we anticipate making another software release in the next few months. In the meantime, we would be happy to hear your feedback about the new software and website.
>
> Happy imputing and Happy New Year!
>
> Bryan and Jonathan
>
> --
> o__ Jonathan Marchini
> c/ /'_ Department of Statistics, University of Oxford
> (+) \(+) 1 South Parks Road, Oxford, OX1 3TG
>
http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002802
If you have the choice, embracing an open approach to development has tremendous benefits. It allows you to build on the work of other scientists, and enables others to build on your own efforts. To make the development of open scientific software more rewarding and the experience of using software more positive, the following ten rules are intended to serve as a guide for any computational scientist.