Tuesday, 28 September 2021

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers | Nature Methods

 Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers | Nature Methods 

 


https://github.com/GoekeLab/bioinformatics-workflows    


Monday, 27 September 2021

Orientation Bias artifect

 strand bias and orientation bias – GATK (broadinstitute.org)

"

The read orientation artifact, also known as the orientation bias artifact, arises due to a chemical change in the nucleotide during library prep that results in, for example, G base-paring with A. This kind of artifact has a clear signature (e.g. C to A SNP that occurs predominantly for the middle C in the DNA sequence CCG), and it’s singlestranded in nature. Downstream, this artifact manifests as low allele fraction SNPs whose evidence for the alt allele consists almost entirely F1R2 reads or F2R1 reads. A read pair is F1R2 (forward 1st, reverse 2nd) if the sequence of bases in Read 1 maps to the forward strand of the reference (F1), and the sequence of Read 2 to the reverse strand
of the reference (R2). F2R1 is defined similarly

 

if someone has read the dragonbioit used guide in illumina, it just mentioned orientation bias, ignore the strand bias.

"

Friday, 17 September 2021

Benchmarking variants and comparing truth sets: List of useful tools and publications

Just realised that other than vcf-compare and bedtools intersect 

there's other options

 

https://github.com/RealTimeGenomics/rtg-tools

https://github.com/Illumina/hap.py

 

Also there's actually new variant callers ..

Molina-Mora, J.A., Solano-Vargas, M. Set-theory based benchmarking of three different variant callers for targeted sequencing. BMC Bioinformatics 22, 20 (2021). https://doi.org/10.1186/s12859-020-03926-3

 

Krishnan, V., Utiramerur, S., Ng, Z. et al. Benchmarking workflows to assess performance and suitability of germline variant calling pipelines in clinical diagnostic assays. BMC Bioinformatics 22, 85 (2021). https://doi.org/10.1186/s12859-020-03934-3

Additional file 23: File 3

. verify_variants.py

 

 

Zook, Justin M et al. “An open resource for accurately benchmarking small variant and reference calls. Nature biotechnology vol. 37,5 (2019): 561-566. doi:10.1038/s41587-019-0074-6



Python library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`



Datanami, Woe be me