Mutational Scanning
Lead Research Organisation:
University of Edinburgh
Department Name: UNLISTED
Abstract
Proper diagnosis of genetic diseases is very important for patients and their families. Thanks to advances in genome sequencing, doctors are now able to list all the mutations present in a patient’s genome - but the interpretation of these mutations is difficult. In our lab, we use experiments in yeast and human cells to study the consequences of mutations found in human patients.
Our experiments are focused on selected genes including PAX6, mutations in which cause eye malformations, and TP53, frequently mutated in cancer. In our previous experiments, we found a surprisingly good agreement between mutations in PAX6 that cause slow growth of yeast, and mutations in the same gene that damage eye development in patients. This allows us to predict the severity for thousands of mutations that were never previously seen in patients.
We now aim to move beyond disease prediction and understand why certain mutations are more harmful than others. To study this, we will measure how each mutation influences the collection of all RNA molecules found in cells, known as the transcriptome. We will also analyse thousands of mutated cells under the microscope. Applying artificial intelligence methods to these measurements will help us understand why mutations are harmful, and help find appropriate treatments. Finally we will measure the performance of candidate mRNA therapeutics in human cells, in order to develop protocols and software for the design of new mRNA therapies.
Our experiments are focused on selected genes including PAX6, mutations in which cause eye malformations, and TP53, frequently mutated in cancer. In our previous experiments, we found a surprisingly good agreement between mutations in PAX6 that cause slow growth of yeast, and mutations in the same gene that damage eye development in patients. This allows us to predict the severity for thousands of mutations that were never previously seen in patients.
We now aim to move beyond disease prediction and understand why certain mutations are more harmful than others. To study this, we will measure how each mutation influences the collection of all RNA molecules found in cells, known as the transcriptome. We will also analyse thousands of mutated cells under the microscope. Applying artificial intelligence methods to these measurements will help us understand why mutations are harmful, and help find appropriate treatments. Finally we will measure the performance of candidate mRNA therapeutics in human cells, in order to develop protocols and software for the design of new mRNA therapies.
Technical Summary
With more than a million human genomes sequenced worldwide, we urgently need methods that will allow the interpretation of human genetic variation at scale. Important unsolved questions in this area include the classification of variants as benign or pathogenic; identification of molecular mechanisms through which variants influence phenotype; understanding the dependence of variant effects on genetic and environmental factors; and prediction of the effects of medical interventions for specific variants. All these questions can be addressed through an experimental strategy which has emerged over the past decade, and which recently became widely known as deep mutational scanning (DMS).
In the past five years, our group has focussed on the development of DMS and other high-throughput techniques to understand relationships between sequence, structure and function of RNA. We applied these methods to study gene regulation, noncoding RNA biology, virus genome structures, and RNA engineering. In the next quinquennium, we will translate these technologies for clinical benefit by applying them to protein-coding genes involved in human disease and by building data-driven tools for the design of mRNA therapeutics. Our aims are to:
1. Comprehensively study the effects of mutations in selected human disease genes, focusing on disease associations and molecular mechanisms;
2. Adapt multidimensional phenotypic assays: single-cell RNA sequencing and high-content microscopy, for high-throughput analysis of genetic variants;
3. Leverage our studies of codon usage to build widely accessible tools for coding sequence optimization of mRNA therapeutics.
Our research will facilitate the diagnosis and treatment of genetic diseases, clarify the molecular mechanisms of disease mutations, and facilitate the development of new RNA therapeutics.
In the past five years, our group has focussed on the development of DMS and other high-throughput techniques to understand relationships between sequence, structure and function of RNA. We applied these methods to study gene regulation, noncoding RNA biology, virus genome structures, and RNA engineering. In the next quinquennium, we will translate these technologies for clinical benefit by applying them to protein-coding genes involved in human disease and by building data-driven tools for the design of mRNA therapeutics. Our aims are to:
1. Comprehensively study the effects of mutations in selected human disease genes, focusing on disease associations and molecular mechanisms;
2. Adapt multidimensional phenotypic assays: single-cell RNA sequencing and high-content microscopy, for high-throughput analysis of genetic variants;
3. Leverage our studies of codon usage to build widely accessible tools for coding sequence optimization of mRNA therapeutics.
Our research will facilitate the diagnosis and treatment of genetic diseases, clarify the molecular mechanisms of disease mutations, and facilitate the development of new RNA therapeutics.
Publications
Farberov S
(2025)
Structural features within the NORAD long noncoding RNA underlie efficient repression of Pumilio activity.
in Nature structural & molecular biology
McDonnell AF
(2024)
Deep mutational scanning quantifies DNA binding and predicts clinical outcomes of PAX6 variants.
in Molecular systems biology
Shen Y
(2025)
Improving the generalization of protein expression models with mechanistic sequence information
in Nucleic Acids Research
| Title | Deep mutational scanning quantifies DNA binding and predicts clinical outcomes of PAX6 variants |
| Description | Nonsense and missense mutations in the transcription factor PAX6 cause a wide range of eye development defects, including aniridia, microphthalmia and coloboma. To understand how changes of PAX6:DNA binding cause these phenotypes, we combined saturation mutagenesis of the paired domain of PAX6 with a yeast one-hybrid (Y1H) assay in which expression of a PAX6-GAL4 fusion gene drives antibiotic resistance. We quantified binding of more than 2,700 single amino-acid variants to two DNA sequence elements. Mutations in DNA-facing residues of the N-terminal subdomain and linker region were most detrimental, as were mutations to prolines and to negatively charged residues. Many variants caused sequence-specific molecular gain-of-function effects, including variants in position Ile71 that increased binding to the LE9 enhancer but decreased binding to a SELEX-derived binding site. In the absence of antibiotic selection, variants that retained DNA binding slowed yeast growth, likely because such variants perturbed the yeast transcriptome. Benchmarking against known patient variants and applying ACMG/AMP guidelines to variant classification, we obtained supporting to moderate evidence to suggest that 1,306 variants are likely benign, and 977, likely pathogenic. Our analysis shows that most pathogenic mutations in the paired domain of PAX6 can be explained simply by the effects of these mutations on PAX6:DNA association, and establishes Y1H as a generalisable assay for the interpretation of variant effects in transcription factors. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| Impact | Quantification of variant effects for 2,700 single amino acid variants of gene PAX6 |
| URL | https://www.mavedb.org/#/experiments/urn:mavedb:00000665-a |
| Description | Mutational screen of GBA1, Parkin and LRRK2 |
| Organisation | University of Dundee |
| Department | MRC Protein Phosphorylation and Ubiquitylation Unit |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | In partnership with the groups of Dario Alessi, Esther Sammler and Miratul Muqit (MRC PPU, University of Dundee) we are undertaking a deep mutational scanning study to systematically measure the function of single amino acid variants in genes associated with Parkinson's diseases: GBA1, Parkin and LRRK2. The mutational screens are performed in human cell lines using clinically relevant reporters of gene function and will facilitate diagnosis and drug screening in Parkinson's disease. My group is providing expertise on the design of high-throughput assays of variant effects, including the generation of barcoded mutant libraries, integration of variants into a genomic landing pad, massively parallel functional assays, and data analysis. |
| Collaborator Contribution | The partners provide expertise on cell models and biochemical assays of GBA1, Parkin and LRRK2 function, including measurements of protein phosphorylation and ubiquitylation and lysosome function. These assays are being used as functional readouts in the high-throughput screens we develop. |
| Impact | No outputs yet |
| Start Year | 2024 |
| Description | 1st and 2nd Workshop on Codon Usage: Function, Mechanism and Evolution |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | Together with Professors Yi Liu (University of Texas, US), Patricia Clark (University of Notre Dame, US) and Yitzhak Pilpel (Weizmann Institute, Israel), I started a series of international workshops on codon usage. Two workshops took place at the RCPE in Edinburgh in 2022 and 2024, each attended by 120 participants from around the world. The 3rd conference in the series is due to take place in Toronto in 2026. These interdisciplinary workshops are dedicated to the understanding of the functions of codon usage, mechanisms of codon usage-mediated regulatory processes, and evolution of codon usage biases. There is a substantial focus on new method development (experimental and computational) to determine the effects of synonymous codon usage on different cellular processes. This is the first international conference dedicated to the understanding of codon usage bias, bringing together a broad selection of scientists from around the globe working to understand the myriad ways that codon usage bias influences biology. |
| Year(s) Of Engagement Activity | 2022,2024 |
