PoMoSelect: Disentangling Modes of Selection
Lead Research Organisation:
University of St Andrews
Department Name: Biology
Abstract
Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Alternative models accounting for multi-individual data have nevertheless been proposed and are known as polymorphism-aware phylogenetic models (PoMo). PoMo adds a new layer of complexity to the standard nucleotide substitution models by accounting for the population-level (so far, genetic drift and mutations) processes to describe the evolutionary process. To do so, PoMo expands the standard substitution models to include polymorphic states. We have previously shown that PoMo accounts for incomplete lineage sorting (ILS), and improve the estimation of species tree inference.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.
Technical Summary
Molecular phylogenetics has neglected polymorphisms within present and ancestral populations for a long time. Alternative models accounting for multi-individual data have nevertheless been proposed and are known as polymorphism-aware phylogenetic models (PoMo). PoMo adds a new layer of complexity to the standard nucleotide substitution models by accounting for the population-level (so far, genetic drift and mutations) processes to describe the evolutionary process. To do so, PoMo expands the standard substitution models to include polymorphic states. We have previously shown that PoMo accounts for incomplete lineage sorting (ILS), and improve the estimation of species tree inference.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.
For this project, we will develop an approach that accounts for balancing selection called PoMoSelect. Genetic drift removes polymorphism from populations over time, with the rate of polymorphism loss being accelerated when species experience strong reductions in population size. Adaptive forces that maintain genetic variation in populations, or balancing selection, might counteract this process. PoMo is a mutation-selection model which we will use disentangle balancing selection from directional selection as well as mutational effects, fixation biases, and demographic effects. Furthermore, PoMo combination of polymorphism with divergence data allows it to model short as well as long-term balancing selection. We will introduce a new mechanistic parameters to quantify the strength of balancing selection on a loci. For the inference of the new parameters we will develop a new Bayesian framework and software package PoMoSelect. Together with our collaborators, we will analyse DNA sequences of African hunter-gather and farmer populations as well as great ape species to understand the role of blood parasites on the alpha and beta globin cluster (thalasiamias). Finally, we will apply our new methodology genome-wide to study the extent and patterns of balancing selection.
Publications
Barata C
(2023)
Bait-ER: A Bayesian method to detect targets of selection in Evolve-and-Resequence experiments.
in Journal of evolutionary biology
Borges R
(2022)
Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes
in Methods in Ecology and Evolution
Braichenko S
(2024)
Polymorphism-Aware Models in RevBayes: Species Trees, Disentangling Balancing Selection, and GC-Biased Gene Conversion
in Molecular Biology and Evolution
Kotari I
(2024)
The Patterns of Codon Usage between Chordates and Arthropods are Different but Co-evolving with Mutational Biases.
in Molecular biology and evolution
| Description | Bonn - St Andrews Collaboration grant |
| Amount | £20,000 (GBP) |
| Organisation | University of St Andrews |
| Sector | Academic/University |
| Country | United Kingdom |
| Start | 01/2025 |
| End | 12/2026 |
| Description | Crop Diversity GPU - Growing Plant Understanding |
| Amount | £654,298 (GBP) |
| Funding ID | BB/X019683/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 07/2023 |
| End | 07/2024 |
| Description | Deep learning: Linkage effects and balancing selection |
| Amount | £258,162 (GBP) |
| Funding ID | BB/Y513842/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 02/2024 |
| End | 08/2025 |
| Description | Single cell multi-omics sequencing platform to understand the building blocks of life. |
| Amount | £288,106 (GBP) |
| Funding ID | BB/W019493/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 07/2022 |
| End | 07/2023 |
| Title | Software implementation to detect balancing selection |
| Description | The polymorphism-aware phylogenetic models (PoMos) are alternative approaches to species tree estimation that add a new layer of complexity to the standard substitution models by accounting for population-level forces to describe the process of sequence evolution. We have extended the method and software to detect balancing selection. The software/tool was made available through RevBayes tutorial and a github page. |
| Type Of Material | Improvements to research infrastructure |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| Impact | The RevBayes project has a large user community. Making our software available with a tutorial on the RevBayes webpage made our approach accessible to more users that we would not reach otherwise. |
| URL | https://revbayes.github.io/tutorials/pomobalance/ |
| Title | Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes |
| Description | Supplementary files of Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes by Borges, Boussau, Höhna, Pereira and Kosiol |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| Impact | This data set allows the reproducibility of the results in the publication. It also provides users of the software package with benchmark data sets. |
| URL | https://zenodo.org/record/6592395 |
| Title | Polymorphism-aware models in RevBayes: Species trees, disentangling Balancing Selection and CG-biased gene conversion |
| Description | Building upon the polymorphism-aware phylogenetic models (PoMos) framework, we introduce PoMoBalance designed to disentangle the interplay of mutation, genetic drift, directional and balancing selection pressures influencing population diversity. Rooted in the Moran model, PoMos have demonstrated efficiency in species tree inference, capturing mutational effects, fixation biases, and GC-bias rates. Implemented in the open-source RevBayes Bayesian framework, PoMoBalance offers a versatile tool for multi-individual data analysis. |
| Type Of Material | Computer model/algorithm |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| Impact | Still early days. However, we received some notable requests and downloads of PoMo through RevBayes user community. |
| URL | https://www.biorxiv.org/content/10.1101/2023.12.11.571102v1 |
| Description | Alex Suh |
| Organisation | University of Bonn |
| Country | Germany |
| Sector | Academic/University |
| PI Contribution | Expertise in phylogenomics and time-series analysis |
| Collaborator Contribution | Expertise in Speciation Genomics and Museum Genomics |
| Impact | Sucessfull application to the Bonn-St Andrews collaboration grant programme |
| Start Year | 2024 |
| Description | Dr Ricardo Pereira |
| Organisation | Ludwig Maximilian University of Munich (LMU Munich) |
| Country | Germany |
| Sector | Academic/University |
| PI Contribution | Help with analysis of next generation sequencing data set, in particular reconstruction of species tree. |
| Collaborator Contribution | Provided interesting data set on grass hoppers as a showcase for our methods. |
| Impact | One publication: https://doi.org/10.1111/2041-210X.13980 |
| Start Year | 2021 |
| Description | Dr Rui Borges, Group Leader |
| Organisation | University of Veterinary Medicine Vienna |
| Country | Austria |
| Sector | Academic/University |
| PI Contribution | Rui Borges received training on methods development in population genetics and phylogenomics as postdoctoral researcher. He has recently been offered a position as Group Leader, and we continue to collaborate. |
| Collaborator Contribution | Contributions of C++ code of our software package, and the writing of related papers. |
| Impact | Two publications that are listed with this grant (https://doi.org/10.1111/2041-210X.13980 and https://doi.org/10.1111/jeb.14134) |
| Start Year | 2017 |
| Description | Dr Sebastian Hoehna |
| Organisation | Ludwig Maximilian University of Munich (LMU Munich) |
| Country | Germany |
| Sector | Academic/University |
| PI Contribution | Development of new tutorials for applying the RevBayes programming language in the specific context of polymorphism-aware phylogentic models. |
| Collaborator Contribution | Training in the RevBayes programming language provided to the PDRA Svitlana Braichenko on the grant. |
| Impact | Publication that is listed with this grant: https://doi.org/10.1111/2041-210X.13980 |
| Start Year | 2019 |
| Title | Polymorphism-aware estimation of species trees and evolutionary forces from genomic sequences with RevBayes |
| Description | We implemented polymorphism-aware phylogenetic models (PoMos), an alter- native approach for species tree estimation, in the Bayesian phylogenetic software RevBayes. PoMos naturally account for incomplete lineage sorting, which is known to cause difficulties for phylogenetic inference in species radiations, and scale well with genome-wide data. Simultaneously, PoMos can estimate mutation and selection biases. |
| Type Of Technology | Software |
| Year Produced | 2022 |
| Open Source License? | Yes |
| Impact | Software enables biologists to infer species trees while accounting for selection |
| URL | https://revbayes.github.io/tutorials/pomos/ |
| Description | "Simulate Evolution" activity at the Mind & Matter Event of the University of St Andrews on the 27th October 2024 |
| Form Of Engagement Activity | Participation in an open day or visit at my research institution |
| Part Of Official Scheme? | No |
| Geographic Reach | Regional |
| Primary Audience | Public/other audiences |
| Results and Impact | This activity that demonstrates evolutionary processes using colored chocolate beans. We also have an interactive game to evolve DNA using dices and a computer simulation that can be shown on a larger screen. |
| Year(s) Of Engagement Activity | 2024 |
| URL | https://impact.wp.st-andrews.ac.uk/engagement-with-research/mind-and-matter-2024/helix-activities/ |
| Description | Dundee Science Festival |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Schools |
| Results and Impact | Developed interactive game for children aged 5-10 to simulate evolutionary processes using chocolate beans and computers. The activity was popular and we were asked to come in the following week as well were primary school children visited the Dundee Science Centre. |
| Year(s) Of Engagement Activity | 2024 |
| URL | https://www.dundeesciencecentre.org.uk/festival/ |
| Description | Sutton Trust Summer School |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Schools |
| Results and Impact | On July 4th, 2023, Svitlana Braichenko, Carolin Kosiol, Shangzhe Zhang joined the Sutton Trust Summer School to teach about evolution! They took S5 students on a journey through natural selection, genetic drift, mutations, DNA, and phylogeny. We proposed computer simulations and interactive games to make learning about evolution fun. One game involved chocolate beans with different "traits", showing how certain selected traits can be passed on and while other traits turn out be too tasty to survive in high numbers! The students also got to build phylogenetic trees using dice-generated mutations. |
| Year(s) Of Engagement Activity | 2023 |
| URL | https://summerschools.suttontrust.com/ |
