log in  |  register  |  feedback?  |  help  |  web accessibility
Pan-genomic advances for fighting reference bias
Ben Langmead
IRB- 4105- Zoom Link-https://umd.zoom.us/j/97287503999
Thursday, September 15, 2022, 2:00-3:00 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)

Also on Zoom- https://umd.zoom.us/j/97287503999


Sequencing data analysis often begins with aligning sequencing reads to a reference genome, where the reference takes the form of a linear string of bases.  But linearity leads to reference bias, a tendency to miss or misreport alignments containing non-reference alleles, which can confound downstream statistical and biological results.  This is a major concern in human genomics; we don't want to live in a world where diagnostics and therapeutics are differentially effective depending how closely our genome matches the reference.


Fortunately, computer science and bioinformatics are meeting the moment.  In particular, we can now index and align sequencing reads to references that include many population variants.  Here I will describe this journey from the early days of efficient genome indexing -- especially the FM index approach behind Bowtie and BWA -- continuing through more modern methods for graph-shaped references and references that include many genomes.  I will emphasize recent results that show how to optimize simple and complex pan-genome representations for effective avoidance of reference bias.  Finally, I will outline some promising future areas, including a new class of compressed indexes that improves locality of reference.



Ben Langmead is an Associate Professor in Computer Science at Johns Hopkins University, where he has received numerous prestigious awards, including the Alfred P. Sloan Research Fellowship, the NSF CAREER award, and the Benjamin Franklin Award for Open Access in Life Sciences, in recognition of his work developing innovative methods for high-throughput biological datasets, which are helping to transform how biomedical researchers and other life scientists access and use DNA sequencing data. The development of Bowtie, for which Dr. Langmead is widely known, began while he was a PhD student at the University of Maryland, College Park. We are very excited to welcome him back to UMD for this seminar talk and encourage you to attend in person!

This talk is organized by Richa Mathur