Wheat Pan-Genomics

Lead Research Organisation: Natural History Museum
Department Name: Life Sciences

Abstract

Bread wheat (Triticum aestivum L.) is the UK's most important crop and the world's most widely cultivated cereal. Understanding the genetic make up of crops is essential for achieving increases in sustainable yield, disease resistance and adaptation to changing growing conditions. However this understanding has been lacking in wheat because it has a exceptionally complex genetic make-up. But recently BBSRC scientists have made a breakthrough in analysing the complete genetic make up of wheat. In this project we aim to build on the UK success by accessing the genetic make up of several key wheat varieties important for global agriculture, and to analyse this data to help identify useful genetic variation. This work will be carried out in an international framework that will maximise the benefit of UK research for this globally important crop.

Technical Summary

This project integrates recent BBSRC-funded innovation in bread wheat genome re-sequencing/analysis capabilities at TGAC/JIC, with the 8-founder MAGIC population developed at NIAB, to address timely questions regarding the composition of the wheat 'pan-genome', and the ways in which this knowledge can be used for wheat genetic improvement.
Specifically, this project will:
-Produce and annotate high-quality reference sequences for the 8 MAGIC founders.
-Produce and annotate high-quality reference sequences for two internationally important CIMMYT wheat lines.
-Survey sequence of 94 maximally diverse wheat landraces from the Watkins collection to access a wide range of useful genetic variation.
-Catalogue millions of sequence variants, including a systematic identification and characterisation of many types wheat structural variants, including chromosome rearrangements, CNV, PAV and InDels, in both genes and repeats.
-Generate GbS data for 1,000 F9 MAGIC individuals, for identification of cross-over locations, haplotype characterisation, and construction of a POPSEQ genetic map for genome sequence scaffold anchoring and QTL analysis.
-Via analysis of MAGIC phenotypic datasets (18 traits over 28 trials), delineate gene and genetic variant content within QTL intervals at unprecedented detail.
-Characterise the effects of SV on genetic recombination rates and segregation distortion.
-Deliver open-access MAGIC datasets, analysis tools, knowledge and training to the wheat R&D community and to industry.

This project will deliver world-class knowledge, resources and training, and help maintain the UK's global position in wheat genetics and genomics, within a coordinated international framework.

Planned Impact

This project aims to develop resources and analytical tools for understanding and exploiting when genome sequence assemblies. These have the potential to radically alter both wheat breeding and crop improvement, and also facilitate new areas of research that will lead to improved crops by understanding the contributions of genetic variation and gene function to traits.
We aim to achieve the following objectives that will ensure that the proposed work achieves its maximal impact.
- To contribute to developing an international strategy for coordinating future work in wheat pan-genomics;
- To establish a network of collaborators and partners for applying outcomes of the research to genomics-led wheat improvement and promoting their uptake by breeders;
- To work with technology providers to develop new services and products to support genetic analyses of wheat based on multiple wheat genome assemblies;
- To demonstrate and promote the benefits of open access to technologies for generating wheat genomic assemblies, and genetic and genomic data;
- To train researchers from industry in the analysis and application of wheat genomics for breeding and technology development;
- To engage with the public about the excitement, power and societal impacts of genomics, enabling them to understand how it it applied for crop improvement, and to continue dialog about changes in how crops are produced.
- To encourage career development in plant science and genomics by hosting school visits and laboratory placements for school children.

We will maintain/develop key partnerships
Impacts of International Coordination
We will work with the Wheat Initiative to ensure this project achieves maximum synergies with related international activities, and that BBSRC achieves the best value for money and maximises its contributions to global wheat research.
Impacts for plant breeders and biotechnology companies
We will continue in-depth engagement and training of crop breeders in new genomics technologies, and prioritise the development of new tools that will facilitate improved breeding
Impacts for Growers and the Public
We will engage with growers and the general public to establish a dialog about the potential impacts of genomics on crop production, how it might influence farming, consumers and the environment.
Impacts for technology development companies and services
We work with Illumina and other genomics companies to help develop technologies and services for cheaper, faster and better genetics and genomics analyses for large and complex crop genomes

Publications

10 25 50
 
Description We have joined the wheat initiative's pan-genomics project and have released the wheat genomes for several cultivars. The first publication was comparing genomes of accessions across the world was recently released via Nature, with some more specialist satellite papers. We are preparign additional papers on technical methods (an improved scaffolding software pipeline), on comparative genomics and our MAGIC population as a discovery platform.
Exploitation Route Researchers, especially CIMMYT were very excited about getting access to the Weebill genome sequence.
We have published the first of the 10wheatgenomes (pan-genomics) papers in Nature.

See also https://webblast.ipk-gatersleben.de/wheat_ten_genomes/ & https://www.ebi.ac.uk/ena/browser/view/PRJEB35709 for our datasets.
Sectors Agriculture, Food and Drink,Education,Environment

URL http://www.10wheatgenomes.com/data-repository/
 
Description The sequences we have released are being used by a consortium (including DFW/DSW grant holders) to generate a new improved SNP chip. The SNP chip is being designed specifically to use haplotypes to get maximal information for least number of markers, and is specifically targeted at breeders e.g. commercial breeding companies.
First Year Of Impact 2021
Sector Agriculture, Food and Drink
Impact Types Economic

 
Title MAGIC 8+ BLAST server 
Description We created a site using sequenceserver 2.0.0 to allow the wheat research community to access the genomes from our project before publication. 
Type Of Material Data analysis technique 
Year Produced 2022 
Provided To Others? Yes  
Impact This has been widely used as noted by the many searches, and has been used in teaching about wheat genomics. 
URL https://www.cropdiversity.ac.uk/8magic-blast/
 
Title Sequencing and assembly of Claire, Paragon, Robigus, Cadenza and Weebil hexaploid wheat lines 
Description Sequencing and assembly of 4 UK elites Claire, Paragon, Robigus, Cadenza and 1 Mexican (CIMMYT) Weebil hexaploid wheat cultivars 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
Impact large dataset covering >50% UK genetic diversity, 1st Mexican (heat and drought tolerant line) made publicly available 
URL https://www.ebi.ac.uk/ena/browser/view/PRJEB35709
 
Description Collaboration with IPK for wheat scaffolding 
Organisation IPK Gatersleben
Country Germany 
Sector Private 
PI Contribution A collaboration between my team using W2RAP genome assemblies (contigs scaffolded with long mate pair data) with Martin Mascher's group at IPK who developed TRITEX assembler (contigs scaffolded with long mate pair, then 10x genomics linked reads, finally HiC data). TRITEX is similar to NRGene's de novo magic assembler. We supplied our W2RAP assemblies with 10x and HiC datasets for 10 wheat genomes, and worked with the team to apply their longer range scaffolding approach. This also includes us testing the method applying their approach implemented in our code for 10x data. We QC'd their assemblies and paid for a staff member to assemble more genomes.
Collaborator Contribution TRITEX is not am easily portable software package, and needs some human QC and tuning according to the genome and datasets. By working with IPK we were able to analyse our data on the pipeline stably installed on their servers, and with a team well used to using it. Martins team tested the approach on two varieties, Robigus and Claire. After QC we were impressed, and because the postdoc was leaving we found another postdoc who could work on the project for some months to scaffold the remaining genomes, leading to 10-100x improvement in scaffolding contiguity.
Impact 10 W2RAP+TRITEX genome assemblies with psuedomolecule data.
Start Year 2021
 
Description Genome scaffolding method - small genome test bed 
Organisation University of East Anglia
Department School of Environmental Sciences UEA
Country United Kingdom 
Sector Academic/University 
PI Contribution We were developing scaffolding approaches using W2RAP and TRITEX assembler methods. However, wheat is a huge complex system in which to test new methods. We decided to test the methods on a smaller bird genome using data from this project. This genome is ~10x smaller than wheat, and bird genomes are well decribed by the B10K project, with very high levels of synteny between avian genomes. We produced psuedomolecules which allowed production of figures showing how inbred individuals were across their chromosomes.
Collaborator Contribution Datasets, sanity checks of our data, publishing these results in Jackson et al. Conservation Biology 2022 see https://doi.org/10.1111/cobi.13918
Impact This is a multi-disciplinary collaboration between conservation biologists, genomics experts, and population geneticists. We published this study in Conservation Biology (see URL).
Start Year 2021
 
Description Illumina users group meeting 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Presentation: "Plant genomics - assembling genomies and understanding haplotypes"
Year(s) Of Engagement Activity 2018
 
Description Meetings of the Wheat 10 Genome Project on pangenomics 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact I attended a meeting on Wheat Pan-genomics connected to the Wheat Initiative meeting in Munich Germany. The International 10 Genome Project outlined their plans and progress, I updated them on the progress and plans of our project. The 10GP covers USA, Canada, China, Japan, India, Australia, Germany, France, Switzerland and UK - but each of these is studying Elite varieties from developed countries who an afford to join this project. Our grant includes the sequencing of 2 CIMMYT varieties (Weebil and Baj), which are from a for-public-good breeding program targeting the developing world. There was much interest in comparing to these lines, and so we have joined the International 10GP by committing to add these genomes to the project.
Year(s) Of Engagement Activity 2018
URL http://www.10wheatgenomes.com/
 
Description Monogram Bioinformatics workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact I presented the breakthroughs in wheat genetics that are coming from our work on 8 of the most important UK elite wheat cultivars. The audience was a mix of researchers, prebreeders, breeders, agronomists and farmers.
Year(s) Of Engagement Activity 2018
 
Description Our Broken Planet - NHM exhibition 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact I worked with the Natural History Museum team to contribute towards the exhibition "Our broken planet: How we got here and ways to fix it" this was open from May to August 2022, but was planned months before this. Specifically I contributed towards the design of a cabinet explaining how modern agriculture is carried out, and how research is working to maximise yields, minimise environmental impacts, specifically around the use of genetics. The cabinet contained different types of wheat, and in a recording I explained how they differ morphologically and genetically, and how we are seeking to breed new varieties of wheat that use less water, pesticides and fungicides by developing better wheat genetics.
Year(s) Of Engagement Activity 2022
URL https://www.nhm.ac.uk/visit/our-broken-planet.html
 
Description Presentation to international wheat pan-genome project expert group on our progress in improving UK cultivar scaffolding 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact Presentation on bringing UK academically derived assemblies towards the same level as the other international assemblies all produced by a commercial company (NRGene). This sparked discussion on how this would allow better UK cultivar and breeding study, and better comparison between UK and international cultivars.
Year(s) Of Engagement Activity 2020
 
Description Public engagement as part of the 1851 Royal Commission even on "Genetic engineering" at the NHM and broadcast on the BBC World Service 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Schools
Results and Impact As part of the reception before the debate on "Genetic Engineering" and broadcast on the BBC, I demonstrated to the public with plant samples how Wheat is a consequence of an historic, and natural, hybridization of three grass species. Furthermore that these events are still occurring and beneficial e.g. Triticale is a hybrid of wheat and rye, and wheat cultivars such as Rialto contain ~250,000,000bp of DNA from rye, which makes them resistant to fungi that otherwise would require extensive use of environmentally damaging pesticides.

With my colleague, Sandra Knapp, we also showed with samples how Solanaceae crop wild relatives can be used to introduce desirable traits controlled by single genes into crops such as potatoes and tomatoes.
Year(s) Of Engagement Activity 2018
URL https://www.bbc.co.uk/mediacentre/proginfo/2018/39/the-weekend-documentary-the-engineers
 
Description Talk at conference in MPI Golm Germany 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Attended the "plants and people" conference at MPI Golm, Germany. This conference is organised by the PhD students at the institute who are also presenting. The meeting attracted speakers from across the world and also from across the field of plant science from research, industry, EU regulations, and public engagements. It was very interesting to be able to meet with so many professionals across the field, and with young scientists doing their PhDs.
Year(s) Of Engagement Activity 2019
URL https://plants-and-people.mpg.de/node/7
 
Description Workshop talk 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Presented to "workshop on multi-parental populations", this was held in Cambridge UK, but was attended by international researchers from Europe, plus scientists from ICRISAT and IRRI ( CGIAR breeding institutes). My presentation introduced our work on Wheat and MAGIC populations in my talk "Combining genome technologies with genetics to analyse MAGIC populations". There was a lot of interest in how genomics can be used in breeding, as showed in our work, and the workshop attendees have written a review on "multi-parental populations" for the Hereditary journal.
Year(s) Of Engagement Activity 2019
URL http://mtweb.cs.ucl.ac.uk/mus/www/MAGICdiverse/MAGIC_workshop.htm