Conserved genomic neighborhood is a strong but no perfect indicator for a direct interaction of microbial gene products

Esch, Robert and Merkl, Rainer (2020) Conserved genomic neighborhood is a strong but no perfect indicator for a direct interaction of microbial gene products. BMC BIOINFORMATICS, 21 (1): 5. ISSN 1471-2105,

Full text not available from this repository. (Request a copy)

Abstract

Background The order of genes in bacterial genomes is not random; for example, the products of genes belonging to an operon work together in the same pathway. The cotranslational assembly of protein complexes is deemed to conserve genomic neighborhoods even stronger than a common function. This is why a conserved genomic neighborhood can be utilized to predict, whether gene products form protein complexes. Results We were interested to assess the performance of a neighborhood-based classifier that analyzes a large number of genomes. Thus, we determined for the genes encoding the subunits of 494 experimentally verified hetero-dimers their local genomic context. In order to generate phylogenetically comprehensive genomic neighborhoods, we utilized the tools offered by the Enzyme Function Initiative. For each subunit, a sequence similarity network was generated and the corresponding genome neighborhood network was analyzed to deduce the most frequent gene product. This was predicted as interaction partner, if its abundance exceeded a threshold, which was the frequency giving rise to the maximal Matthews correlation coefficient. For the threshold of 16%, the true positive rate was 45%, the false positive rate 0.06%, and the precision 55%. For approximately 20% of the subunits, the interaction partner was not found in a neighborhood of +/- 10 genes. Conclusions Our phylogenetically comprehensive analysis confirmed that complex formation is a strong evolutionary factor that conserves genome neighborhoods. On the other hand, for 55% of the cases analyzed here, classification failed. Either, the interaction partner was not present in a +/- 10 gene window or was not the most frequent gene product.

Item Type: Article
Uncontrolled Keywords: PROTEIN-PROTEIN INTERACTIONS; ESCHERICHIA-COLI; EVOLUTION; ORDER; ORGANIZATION; DATABASE; OPERONS; Protein-protein interaction; Complex formation; Sequence similarity network; Genome neighborhood network; Binary classifier
Subjects: 500 Science > 570 Life sciences
Divisions: Biology, Preclinical Medicine > Institut für Biophysik und physikalische Biochemie
Biology, Preclinical Medicine > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Rainer Merkl
Depositing User: Dr. Gernot Deinzer
Date Deposited: 09 Apr 2021 08:24
Last Modified: 09 Apr 2021 08:24
URI: https://pred.uni-regensburg.de/id/eprint/45367

Actions (login required)

View Item View Item