Abstract
Which variables determine the constraints on gene sequence evolution is one of the most central questions in molecular evolution. In the fission yeast Schizosaccharomyces pombe, an important model organism, the variables influencing the rate of sequence evolution have yet to be determined. Previous studies in other single celled organisms have generally found gene expression levels to be most significant, with numerous other variables such as gene length and functional importance identified as having a smaller impact. Using publicly available data, we used partial least squares regression, principal components regression, and partial correlations to determine the variables most strongly associated with sequence evolution constraints. We identify centrality in the protein-protein interactions network, amino acid composition, and cellular location as the most important determinants of sequence conservation. However, each factor only explains a small amount of variance, and there are numerous variables having a significant or heterogeneous influence. Our models explain more than half of the variance in dN, raising the possibility that future refined models could quantify the role of stochastics in evolutionary rate variation.
Original language | English |
---|---|
Pages (from-to) | 601-610 |
Journal | Journal of molecular evolution |
Volume | 89 |
Issue number | 9-10 |
Early online date | 26 Aug 2021 |
DOIs | |
Publication status | Published - Dec 2021 |
Bibliographical note
Funding Information:CAG was supported by MRC Newton as a component of the UK:Brazil Joint Centre Partnership in Leishmaniasis (MR/S019472/1).
Publisher Copyright:
© 2021, The Author(s).