GGTyper: genotyping complex structural variants using short-read sequencing data

Mirus, Tim and Lohmayer, Robert and Doehring, Clementine and Halldorsson, Bjarni and Kehr, Birte (2024) GGTyper: genotyping complex structural variants using short-read sequencing data. BIOINFORMATICS, 40 (suppl2). ii11-ii19. ISSN 1367-4803, 1367-4811

Full text not available from this repository. (Request a copy)

Abstract

Motivation Complex structural variants (SVs) are genomic rearrangements that involve multiple segments of DNA. They contribute to human diversity and have been shown to cause Mendelian disease. Nevertheless, our abilities to analyse complex SVs are very limited. As opposed to deletions and other canonical types of SVs, there are no established tools that have explicitly been designed for analysing complex SVs.Results Here, we describe a new computational approach that we specifically designed for genotyping complex SVs in short-read sequenced genomes. Given a variant description, our approach computes genotype-specific probability distributions for observing aligned read pairs with a wide range of properties. Subsequently, these distributions can be used to efficiently determine the most likely genotype for any set of aligned read pairs observed in a sequenced genome. In addition, we use these distributions to compute a genotyping difficulty for a given variant, which predicts the amount of data needed to achieve a reliable call. Careful evaluation confirms that our approach outperforms other genotypers by making reliable genotype predictions across both simulated and real data. On up to 7829 human genomes, we achieve high concordance with population-genetic assumptions and expected inheritance patterns. On simulated data, we show that precision correlates well with our prediction of genotyping difficulty. This together with low memory and time requirements makes our approach well-suited for application in biomedical studies involving small to very large numbers of short-read sequenced genomes.Availability and implementation Source code is available at https://github.com/kehrlab/Complex-SV-Genotyping.

Item Type: Article
Uncontrolled Keywords: GENOME
Subjects: 000 Computer science, information & general works > 004 Computer science
500 Science > 570 Life sciences
600 Technology > 610 Medical sciences Medicine
Divisions: Informatics and Data Science > Department Computational Life Science > Algorithmische Bioinformatik (Prof. Dr. rer. nat. Birte Kehr )
Depositing User: Dr. Gernot Deinzer
Date Deposited: 26 Nov 2025 09:02
Last Modified: 26 Nov 2025 09:02
URI: https://pred.uni-regensburg.de/id/eprint/64325

Actions (login required)

View Item View Item