📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

PoMoSelect: Disentangling Modes of Selection

Lead Research Organisation: University of St Andrews
Department Name: Biology

Abstract

Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Alternative models accounting for multi-individual data have nevertheless been proposed and are known as polymorphism-aware phylogenetic models (PoMo). PoMo adds a new layer of complexity to the standard nucleotide substitution models by accounting for the population-level (so far, genetic drift and mutations) processes to describe the evolutionary process. To do so, PoMo expands the standard substitution models to include polymorphic states. We have previously shown that PoMo accounts for incomplete lineage sorting (ILS), and improve the estimation of species tree inference.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.

Technical Summary

Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Alternative models accounting for multi-individual data have nevertheless been proposed and are known as polymorphism-aware phylogenetic models (PoMo). PoMo adds a new layer of complexity to the standard nucleotide substitution models by accounting for the population-level (so far, genetic drift and mutations) processes to describe the evolutionary process. To do so, PoMo expands the standard substitution models to include polymorphic states. We have previously shown that PoMo accounts for incomplete lineage sorting (ILS), and improve the estimation of species tree inference.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.
 
Description Bonn - St Andrews Collaboration grant
Amount £20,000 (GBP)
Organisation University of St Andrews 
Sector Academic/University
Country United Kingdom
Start 01/2025 
End 12/2026
 
Description Crop Diversity GPU - Growing Plant Understanding
Amount £654,298 (GBP)
Funding ID BB/X019683/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 07/2023 
End 07/2024
 
Description Deep learning: Linkage effects and balancing selection
Amount £258,162 (GBP)
Funding ID BB/Y513842/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 02/2024 
End 08/2025
 
Description Single cell multi-omics sequencing platform to understand the building blocks of life.
Amount £288,106 (GBP)
Funding ID BB/W019493/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 07/2022 
End 07/2023
 
Title Software implementation to detect balancing selection 
Description The polymorphism-aware phylogenetic models (PoMos) are alternative approaches to species tree estimation that add a new layer of complexity to the standard substitution models by accounting for population-level forces to describe the process of sequence evolution. We have extended the method and software to detect balancing selection. The software/tool was made available through RevBayes tutorial and a github page. 
Type Of Material Improvements to research infrastructure 
Year Produced 2023 
Provided To Others? Yes  
Impact The RevBayes project has a large user community. Making our software available with a tutorial on the RevBayes webpage made our approach accessible to more users that we would not reach otherwise. 
URL https://revbayes.github.io/tutorials/pomobalance/
 
Title Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes 
Description Supplementary files of Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes by Borges, Boussau, Höhna, Pereira and Kosiol 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact This data set allows the reproducibility of the results in the publication. It also provides users of the software package with benchmark data sets. 
URL https://zenodo.org/record/6592395
 
Title Polymorphism-aware models in RevBayes: Species trees, disentangling Balancing Selection and CG-biased gene conversion 
Description Building upon the polymorphism-aware phylogenetic models (PoMos) framework, we introduce PoMoBalance designed to disentangle the interplay of mutation, genetic drift, directional and balancing selection pressures influencing population diversity. Rooted in the Moran model, PoMos have demonstrated efficiency in species tree inference, capturing mutational effects, fixation biases, and GC-bias rates. Implemented in the open-source RevBayes Bayesian framework, PoMoBalance offers a versatile tool for multi-individual data analysis. 
Type Of Material Computer model/algorithm 
Year Produced 2023 
Provided To Others? Yes  
Impact Still early days. However, we received some notable requests and downloads of PoMo through RevBayes user community. 
URL https://www.biorxiv.org/content/10.1101/2023.12.11.571102v1
 
Description Alex Suh 
Organisation University of Bonn
Country Germany 
Sector Academic/University 
PI Contribution Expertise in phylogenomics and time-series analysis
Collaborator Contribution Expertise in Speciation Genomics and Museum Genomics
Impact Sucessfull application to the Bonn-St Andrews collaboration grant programme
Start Year 2024
 
Description Dr Ricardo Pereira 
Organisation Ludwig Maximilian University of Munich (LMU Munich)
Country Germany 
Sector Academic/University 
PI Contribution Help with analysis of next generation sequencing data set, in particular reconstruction of species tree.
Collaborator Contribution Provided interesting data set on grass hoppers as a showcase for our methods.
Impact One publication: https://doi.org/10.1111/2041-210X.13980
Start Year 2021
 
Description Dr Rui Borges, Group Leader 
Organisation University of Veterinary Medicine Vienna
Country Austria 
Sector Academic/University 
PI Contribution Rui Borges received training on methods development in population genetics and phylogenomics as postdoctoral researcher. He has recently been offered a position as Group Leader, and we continue to collaborate.
Collaborator Contribution Contributions of C++ code of our software package, and the writing of related papers.
Impact Two publications that are listed with this grant (https://doi.org/10.1111/2041-210X.13980 and https://doi.org/10.1111/jeb.14134)
Start Year 2017
 
Description Dr Sebastian Hoehna 
Organisation Ludwig Maximilian University of Munich (LMU Munich)
Country Germany 
Sector Academic/University 
PI Contribution Development of new tutorials for applying the RevBayes programming language in the specific context of polymorphism-aware phylogentic models.
Collaborator Contribution Training in the RevBayes programming language provided to the PDRA Svitlana Braichenko on the grant.
Impact Publication that is listed with this grant: https://doi.org/10.1111/2041-210X.13980
Start Year 2019
 
Title Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes 
Description We implemented polymorphism-aware phylogenetic models (PoMos), an alter- native approach for species tree estimation, in the Bayesian phylogenetic software RevBayes. PoMos naturally account for incomplete lineage sorting, which is known to cause difficulties for phylogenetic inference in species radiations, and scale well with genome-wide data. Simultaneously, PoMos can estimate mutation and selection biases. 
Type Of Technology Software 
Year Produced 2022 
Open Source License? Yes  
Impact Software enables biologists to infer species trees while accounting for selection 
URL https://revbayes.github.io/tutorials/pomos/
 
Description "Simulate Evolution" activity at the Mind & Matter Event of the University of St Andrews on the 27th October 2024 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Public/other audiences
Results and Impact This activity that demonstrates evolutionary processes using colored chocolate beans. We also have an interactive game to evolve DNA using dices and a computer simulation that can be shown on a larger screen.
Year(s) Of Engagement Activity 2024
URL https://impact.wp.st-andrews.ac.uk/engagement-with-research/mind-and-matter-2024/helix-activities/
 
Description Dundee Science Festival 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact Developed interactive game for children aged 5-10 to simulate evolutionary processes using chocolate beans and computers. The activity was popular and we were asked to come in the following week as well were primary school children visited the Dundee Science Centre.
Year(s) Of Engagement Activity 2024
URL https://www.dundeesciencecentre.org.uk/festival/
 
Description Sutton Trust Summer School 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Schools
Results and Impact On July 4th, 2023, Svitlana Braichenko, Carolin Kosiol, Shangzhe Zhang joined the Sutton Trust Summer School to teach about evolution! They took S5 students on a journey through natural selection, genetic drift, mutations, DNA, and phylogeny.

We proposed computer simulations and interactive games to make learning about evolution fun. One game involved chocolate beans with different "traits", showing how certain selected traits can be passed on and while other traits turn out be too tasty to survive in high numbers! The students also got to build phylogenetic trees using dice-generated mutations.
Year(s) Of Engagement Activity 2023
URL https://summerschools.suttontrust.com/