Líffræðifélag Íslands - biologia.is
Líffræðiráðstefnan 2017

Erindi/veggspjald / Talk/poster E77

Graphtyper enables population-scale genotyping using pangenome graphs

Höfundar / Authors: Hannes P Eggertsson (1,2) , Hakon Jonsson (1), Snaedis Kristmundsdottir (1,3), Eirikur Hjartarson (1), Birte Kehr (1,4), Gisli Masson (1), Florian Zink (1), Kristjan E Hjorleifsson (1), Aslaug Jonasdottir (1), Adalbjorg Jonasdottir (1), Ingileif Jonsdottir (1,5), Daniel F Gudbjartsson (1,2), Pall Melsted (1,2), Kari Stefansson (1,5), Bjarni V Halldorsson (1,3)

Starfsvettvangur / Affiliations: 1. deCODE Genetics/Amgen, Inc., Reykjavik, Iceland. 2. School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland. 3. School of Science and Engineering, Reykjavik University, Reykjavik, Iceland. 4. Berlin Institute of Health (BIH), Berlin, Germany. 5. Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland. C

Kynnir / Presenter: Hannes Pétur Eggertsson

A fundamental requirement for genetic studies is an accurate determination of sequence variation. While human genome sequence diversity is increasingly well characterized, there is a need for efficient ways to use this knowledge in sequence analysis. Here we present Graphtyper, a publicly available novel algorithm and software for discovering and genotyping sequence variants. Graphtyper realigns shortread sequence data to a pangenome, a variation-aware graph structure that encodes sequence variation within a population by representing possible haplotypes as graph paths. Our results show that Graphtyper is fast, highly scalable, and provides sensitive and accurate genotype calls. Graphtyper genotyped 89.4 million sequence variants in the whole genomes of 28,075 Icelanders using less than 1 00,000 CPU days, including detailed genotyping of six human leukocyte antigen (HLA) genes. We show that Graphtyper is a valuable tool in characterizing sequence variation in both small and population-scale sequencing studies.