Saturday, September 22, 2012

Fwd: SOAPindel: Efficient identification of indels from short paired reads [METHODS AND RESOURCES]

Fwd: please follow footer link
SOAPindel: Efficient identification of indels from short paired reads [METHODS AND RESOURCES]:

We present a new approach to indel calling which explicitly exploits that indel differences between a reference and a sequenced sample make the mapping of reads less efficient. We assign all unmapped reads with a mapped partner to their expected genomic positions and then perform extensive de novo assembly on the regions with many unmapped reads to resolve homozygous, heterozygous and complex indels by exhaustive traversal of the de Bruijn graph. The method is implemented in the software SOAPindel and provides a list of candidate indels with quality scores. We compare SOAPindel to Dindel, Pindel and GATK on simulated data and find similar or better performance for short indels (<10 bp) and higher sensitivity and specificity for long indels. A validation experiment suggests that SOAPindel has a false positive rate around 10% for long indels (>5 bp) while still providing many more candidate indels than other approaches.



(Original Post: Genome research (advanced).)