Analysis of quantitative genetic traits in a huge data set

Lead Research Organisation: University of Edinburgh
Department Name: The Roslin Institute

Abstract

The genetic basis of quantitative trait variation and covariation is central to human genetics, evolutionary biology, and plant and animal breeding. In medical genetics many diseases, including schizophrenia, heart disease and cancer, are complex traits with continuous phenotypes and liabilities, which have multiple genome variants contributing genetic variance. In evolutionary biology fitness is largely due to such quantitative traits (e.g., fecundity, longevity). In plant and animal breeding most of the economically important traits are quantitative traits (e.g., milk, meat, and grain yields, environmental footprint, fecundity).

Huge datasets are needed for statistical genomics because many variants (probably thousands), which can be clustered together, contribute to any individual quantitative trait and their effects can combine in complex ways (additive, dominant, epistatic). Moreover, important portions of the genetic variance of quantitative traits are controlled by variants that are rare, have small effect sizes or are highly correlated with other variants. The effects of such quantitative trait variants can only be separated when very powerful statistical models are used in very large data sets.

We will analyse the genetic basis of 25 quantitative traits at the molecular level by creating and analysing a dataset containing genome sequences, pedigrees and trait records of 325000 pigs from the world's biggest commercial breeding programme. The dataset will be created and analysed using imputation and analysis algorithms based on those that we developed to support the breeding programme.

The size of the dataset and the quality of the data will allow us to address three big questions:-
1. Which genome variants control which quantitative traits, how do they control them and how do the multiple variants that control a single trait interact?
2. What kinds of mechanisms cause traits to co-vary? To what extent does pleiotropy and linkage disequilibrium contribute? What is the distribution of the magnitude and sign of joint effects of genomic regions on pairs of traits?
3. To what extent do huge data sets help us address these questions? For the first time we have the technology to generate genome sequence data for hundreds of thousands of individuals at low cost and the computer power to store and analyse such data.

The aim of this project is to harvest scientific benefits from a 15 year billion dollar pig breeding program. Our previous projects asked how statistical genomics helps animal breeding; this project asks how animal breeding helps statistical genomics.

Technical Summary

This project aims to harvest scientific benefits from a 15 year, billion dollar pig breeding program. We will analyse the genome sequences, pedigrees and phenotypes of 325,000 pigs, in order to:-
- Analyse the genetic basis of 25 quantitative traits at the molecular level.
- Explain the covariance between traits.
- Quantify the extent to which huge datasets help us answer these questions
Our objectives are:-
(1)The genetic basis of quantitative traits
For each of the 25 traits we will count the number of quantitative trait variants that can be mapped, analyse how they are distributed across the genome (e.g. randomly or in clusters), which types of variant (e.g., indels or SNPs) control them; and which kinds of genome element (e.g. coding versus noncoding) contain the variants.
We will analyse the interactions between the mapped variants that control each trait to quantify the degree to which they show additivity, dominance, or epistasis. We will also quantify the joint distribution of allele frequencies, ages, and effect sizes, quantify how and by how much the genetic variation changes across many generations. We will quantify the degree to which the contributions to genetic variation differ among the 11 related populations and the 19 generations of our data set and the extent to which they compare with what is known of other species.
(2)What kinds of mechanisms cause traits to co-vary?
We will measure the correlation between traits (locally and genome wide) and identify the extent to which pleiotropy and linkage disequilibrium contribute, the distribution of the magnitude and sign of joint effects of genomic regions on pairs of traits and the degree to which these variants are new or old, common or rare, lie in each type of functional region, and have large or small effect sizes.
(3)We will quantify the extent to which huge data sets, with and without functional annotation information, help us with addressing these questions.

Planned Impact

(i) The academic community. Scientifically, the project constitutes a step change in genetics research because this is the first data set of this scale with whole genome sequence data. As outlined in the section "Academic Beneficiaries" there are several benefits that will accrue to the academic community (animal, plant, human and evolutionary geneticists and other fields that develop and utilise large scale computational methods). This impact will be delivered via publication in journals, presentations at conferences, seminars, and by making data and software available.

(ii) Animal breeding companies, breed societies, and levy boards. The biological insights about quantitative traits will guide these organisations in their efforts to turn genetic variance in traits into response to selection in a way that is sustainable. The quantification of the power of a huge data set combined with functional annotation will guide them in their investments in data for the coming years. The software and scripts that we will use to generate and analyse the data in this project will be made available to these organisations.

(iii) The entire chain of users of pig products. The entire chain of users of pig products, including meat packers, processors, retailers and consumers will benefit because the knowledge generated will equip PIC and other pig breeding companies with tools to deliver a higher quality product, which costs less, and is more environmentally friendly, healthier and suited to individual requirements of stakeholders in the supply chain.

(iv) Plant breeding organisations. The methods, data sets of this scale, and biological insights are also highly relevant to plant breeding organisations. Therefore the benefits to plant breeding organisations, in the developed and developing world, will be similar to those outlined for animal breeding companies, breed societies, and levy boards.

(v) Commercial sequence and genotype providers. Companies providing SNP or sequence data will be able to open up a completely new market based on low cost provision of huge volumes sequence data.

(vi) UK Treasury will benefit from increased tax revenues through increased profitability of PIC, the pork supply chain, other UK agricultural users should they adopt the method, and UK based sequence and genotype providers.

(vii) UK science infrastructure and capacity. The proposed methods and data set will provide a platform for increased R&D capabilities in the UK, maintaining its scientific reputation and associated institutions, with increased capability for sustainable agricultural production. The proposed research will be embedded within training courses that the PI is regularly invited to give, and the post-docs working on the project will have the opportunity to be trained at a world-class institute in a cutting edge area of research while interacting with a leading commercial partner.

(viii) Policy. Sequence data is expensive, but the research and practical benefits are potentially large. Therefore much investment will be made in sequence data in the coming years. The outcomes from this project will guide these investments. This will be particularly relevant for projects such as the Genomics England project which is spending >£300 million to sequence 100,000 individuals.

(ix) Society. All members of society who work to improve or depend upon the competitiveness and sustainability of agriculture will benefit from the downstream practical applications outlined above. The application of the outcomes by breeding organisations will lead to faster and more sustainable genetic progress, leading to healthier food, and food production that is more resource efficient and affordable. Increased efficiencies in agriculture has direct societal benefits in greater food security with less environmental impact. The knowledge will feed into educational programs.

Publications

10 25 50
publication icon
Antolín R (2017) A hybrid method for the imputation of genomic data in livestock populations. in Genetics, selection, evolution : GSE

publication icon
Gaynor RC (2021) AlphaSimR: an R package for breeding program simulations. in G3 (Bethesda, Md.)

publication icon
Hickey JM (2016) Promotion of alleles by genome editing in livestock breeding programmes. in Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie

publication icon
Johnsson M (2021) Genetic variation in recombination rate in the pig. in Genetics, selection, evolution : GSE

 
Description A genetic association analysis (GWAS) to all available 25 traits relevant to the breeding practice was completed. For most of these, significantly associated genomic regions were identified. Currently, differences between line-specific and common-across-lines associations are being interpreted.

A series of bioinformatic and population genetic predictors (GERP, SIFT, MutPred2 and RVIS) have been applied to protein-coding variants in the imputed sequence data. Some of these methods (gene and domain RVIS) have been applied to a subset of the data to analyse diversity at the CD163 gene in combination with pooled sequence data.

Maps have been created that show the recombination landscape in pig. The heritability of genome-wide recombination rate was found to be low but non-zero and comparable between lines. GWAS identified one major locus, detected in most lines, which coincides with RNF212, previously associated with recombination rate in several other species.

Simulations have been performed, showing that it is possible to improve fitness in a breeding population by targeting deleterious load (i.e. selecting against alleles with a negative impact on animal fitness).
Exploitation Route The results will have a major impact on the breeding practice in pigs and other livestock. Next rounds of breeding will focus on the specific selection for favourable alleles on identified loci associated with relevant traits (including recombination rate) and general selection against deleterious alleles. In the future, these will be the logic targets for gene editing.
Sectors Agriculture, Food and Drink,Environment

 
Description The methods and tools developed in this project have been used to process and analyse the largest whole-genome sequencing data to dat - with Genus/PIC breeding programme.
First Year Of Impact 2019
Sector Agriculture, Food and Drink
Impact Types Economic

 
Description DairyMine: Mining integrated dairy data for precision cow management and breeding (Scottish Funding Council Covid-19 Recovery funding to the University of Edinburgh)
Amount £160,067 (GBP)
Organisation University of Edinburgh 
Sector Academic/University
Country United Kingdom
Start 11/2020 
End 07/2021
 
Description Data-driven breeding of resource efficient cattle ({TRAIN@Ed: Horizon 2020 Marie Sklodowska-Curie Action COFUND with Geno)
Amount £205,000 (GBP)
Organisation University of Edinburgh 
Sector Academic/University
Country United Kingdom
Start 05/2020 
End 05/2023
 
Description Ehsani, Alireza - Train@ED Fellowship
Amount £72,690 (GBP)
Funding ID 801215 
Organisation Marie Sklodowska-Curie Actions 
Sector Charity/Non Profit
Country Global
Start 04/2020 
End 04/2023
 
Description Ehsani, Alireza - Train@ED Fellowship
Amount £72,690 (GBP)
Funding ID 801215 Horizon 2020 - Marie Sk_odowska-Curie Actions 
Organisation European Commission 
Sector Public
Country European Union (EU)
Start 04/2020 
End 04/2023
 
Description Genetics and breeding of Taurine-Indicine crossbred dairy cattle
Amount £105,000 (GBP)
Organisation East of Scotland BioScience (EastBio) 
Sector Charity/Non Profit
Country United Kingdom
Start 09/2021 
End 08/2025
 
Description GenoForage: Genomic breeding of forages
Amount £294,000 (GBP)
Organisation Lantmännen 
Sector Academic/University
Country Sweden
Start 04/2022 
End 03/2024
 
Description Newton Fund Workshop Brazil
Amount £52,000 (GBP)
Funding ID 228949780 
Organisation British Council 
Sector Charity/Non Profit
Country United Kingdom
Start 04/2016 
End 09/2016
 
Description Newton Fund Workshop Mexico
Amount £37,550 (GBP)
Funding ID 2016-RLWK7-10399 
Organisation British Council 
Sector Charity/Non Profit
Country United Kingdom
Start 04/2017 
End 03/2018
 
Description On-line course on in-silico modelling of breeding programmes (DataLab)
Amount £105,000 (GBP)
Organisation The Datalab 
Sector Charity/Non Profit
Start 01/2020 
End 03/2021
 
Description Optimising intense layer chicken selection and management of diversity
Amount £29,300 (GBP)
Funding ID BBSRC IAA PIII059 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 02/2020 
End 08/2020
 
Title Development of LCSeq 
Description LCSeq is a system that we are developing which enables sequence information to be generated for animal breeding populations at low-cost. LCSeq recognises that in livestock populations individuals are highly related and thus share long genome segments (haplotypes). Rather than sequencing individuals at high-coverage aims to sequence the haplotypes that are present in the population by spreading the sequencing resources at low-coverage across many individuals. 
Type Of Material Data analysis technique 
Provided To Others? No  
Impact the research is still on-going, however, one of the softwares that will contribute to this model is AlphaSeqOpt, freely available on the AlphaSuite. 
URL http://www.alphagenes.roslin.ed.ac.uk/alphasuite-softwares/alphaseqopt/
 
Description NextGen Breeding project 
Organisation Genus plc
Country United Kingdom 
Sector Private 
PI Contribution The AlphaSuite is a collection of software that we have developed to perform many of the common tasks in animal breeding, plant breeding, and human genetics including genomic prediction, breeding value estimation, variance component estimation, GWAS, imputation, phasing, optimal contributions, simulation, field trial designs, and various data recoding and handling tools.
Collaborator Contribution PIC is providing the DNA sequencing data from approximately 14,000 individual from their genetic nucleus.
Impact At this stage of the collaboration the outputs have not been generated.
Start Year 2015
 
Description Pig breeding with Genus PIC 
Organisation Genus plc
Country United Kingdom 
Sector Private 
PI Contribution Optimising breeding programme
Collaborator Contribution Supplying data and knowledge
Impact Collaboration just begun
Start Year 2018
 
Title AlphaAssign - a parentage assignment algorithm that works with SNP array and sequencing data 
Description AlphaAssign is a parentage assignment algorithm that works with SNP array and GBS data - https://onlinelibrary.wiley.com/doi/full/10.1111/jbg.12370 
Type Of Technology Software 
Year Produced 2018 
Open Source License? Yes  
Impact AlphaAssign has been used by researchers and practitioners in breeding and genetics 
URL https://github.com/AlphaGenes/AlphaAssign
 
Title AlphaImpute2 - Fast and accurate pedigree and population based imputation 
Description AlphaImpute2 is a phasing and imputation algorithm for massive livestock populations. The method uses a approximate version of multi-locus iterative peeling for pedigree based imputation, and a novel imputation algorithm that uses the Positional Burrows Wheeler Transform for population imputation. AlphaImpute2 has been successfully used to perform imputation in populations of hundreds of thousands of individuals. 
Type Of Technology Software 
Year Produced 2020 
Open Source License? Yes  
Impact AlphaImpute2 enabled large-scale phasing and imputation in world-leading animal breeding programmes that operate with hundreds of thousands of genotyped individuals. 
URL https://github.com/AlphaGenes/AlphaImpute2
 
Title AlphaPeel 
Description AlphaPeel is a software package for calling, phasing, and imputing genotype and sequence data in pedigree populations. This program implements single locus peeling, multi locus peeling, and hybrid peeling 
Type Of Technology Software 
Year Produced 2018 
Open Source License? Yes  
Impact AlphaPeel enabled genotype calling, phasing, and imputation of the largest whole-genome sequencing data in a pedigreed population to date. 
URL https://github.com/AlphaGenes/AlphaPeel
 
Title AlphaPhase - phasing genotype data 
Description AlphaPhase is a software package for phasing genotype data. The program implements methods to determine phase using an extended Long Range Phasing and Haplotype Library Imputation. AlphaPhase consists of a single program. All information on the model of analysis, input files and their layout, is specified in a parameter file. 
Type Of Technology Software 
Year Produced 2019 
Impact AlphaPhase has been used to phase SNP array genotype data in large pig and poultry breeding programmes. 
URL https://github.com/AlphaGenes/AlphaPhase
 
Title AlphaSim 
Description One of the fundamental questions in populations dynamic is assessing how changes in the current structure and environment affect the structure composition in both the short and long-term. Plant and animal breeding programs benefits from having a tool to evaluate the potential of different selection strategies or new emerging technologies to improve population performance. Empirical datasets to assess the effect of different factors on one population are difficult to collect, since they require substantial financial and time investments and are subject to noise and error. Simulation is a key tool for both researchers and breeders to assess the impact of different factors given a known historical and current population structure prior to implementation within a real-life setting. AlphaSim is a fast and flexible software tool that enables researchers and breeders to do this. Unlike other simulation tools, AlphaSim has the functionality to manipulate fine details of the population structure in order to simulate realistic scenarios and provides detailed outputs for use in downstream analyses. 
Type Of Technology Software 
Year Produced 2016 
Impact AlphaSim is a freely available software package that simulates genetic population and can assess breeding programs. The AlphaSim package includes a manual, tutorial, and access to technical support with the aim of benefiting the academic research community in animal breeding. This software package has already attracted users from a number of different academic institutions and has supported a number of peer-reviewed academic publications. These publications include: Potential of gene drives with genome editing to increase genetic gain in livestock breeding programs. 2017. Gonen, S, J. Jenko, G. Gorjanc, A.J. Mileham, C.B.A. Whitelaw, J.M. Hickey. Genetics Selection Evolution, 49:3. AlphaSim: Software for Breeding Program Simulation. 2016. Faux A. M., G. Gorjanc, R. C. Gaynor , M. Battagin, S. M. Edwards, D. L. Wilson, Sarah J. Hearne, S. Gonen, and J. M. Hickey. The Plant Genome vol. 9, no.3. AlphaSim is not only used in academic research, but has also attracted industrial collaborations. One such example is our recently awarded Innovate UK grant in collaboration with Driscoll's. 
URL http://www.alphagenes.roslin.ed.ac.uk/alphasuite-softwares/alphasim/
 
Title AlphaSim GUI 
Description In an effort to improve the accessibility and usability of our AlphaSim software, we have developed a graphical user interface (GUI), which uses the Java runtime environment (JRE). By increasing the usability of our software, we hope that the impact of these programs will be even greater, especially for people where available resources are at a premium. Tthe AlphaSim GUI is freely available on the AlphaGenes webpage, and includes video tutorials, practical exercises and support. 
Type Of Technology Webtool/Application 
Year Produced 2016 
Impact Currently we do not have noticeable impacts, as the GUI has been only available for a few months. 
URL http://www.alphagenes.roslin.ed.ac.uk/alphasuite-softwares/alphasim/
 
Title AlphaSimR: An R-package for Breeding Program Simulations 
Description AlphaSimR is an R package for stochastic simulations of plant and animal breeding programs. AlphaSimR is a highly flexible software package able to simulate a wide range of plant and animal breeding programs for diploid and autopolyploid species. AlphaSimR is ideal for testing the overall strategy and detailed design of breeding programs. AlphaSimR utilizes a scripting approach to building simulations that is particularly well suited for modeling highly complex breeding programs, such as commercial breeding programs. The primary benefit of this scripting approach is that it frees users from preset breeding program designs and allows them to model nearly any breeding program design. 
Type Of Technology Software 
Year Produced 2017 
Open Source License? Yes  
Impact AlphaSimR has been used widely by researchers and practitioners in breeding and genetics, most notably it drives and supports development of genomic and quantitative genetic methods and tools at Roslin, optimisation of world-leading breeding programmes, such as Genus, PIC, Bayer CropScience, KWS, Limagrain, BASF, Beta Bugs, and CGIAR Excellence in Breeding platform. 
URL https://github.com/AlphaGenes/AlphaSimR
 
Title AlphaSuite of software for data science, genetics, and breeding 
Description AlphaSuite of software for data science, genetics, and breeding available from https://github.com/AlphaGenes The major tools include: * AlphaSimR for simulation of breeding programmes https://github.com/AlphaGenes/AlphaSimR * AlphaBayes for estimation of SNP effects on phenotype https://github.com/AlphaGenes/AlphaBayes * AlphaAssign for finding progeny-parent (pedigree) relationships https://github.com/AlphaGenes/AlphaAssign * AlphaPhase for phasing and imputation of SNP array genotype data https://github.com/AlphaGenes/AlphaPhase * AlphaImpute for phasing and imputation of SNP array genotype data https://github.com/AlphaGenes/AlphaImpute * AlphaImpute2 for phasing and imputation of SNP array genotype data (version 2) https://github.com/AlphaGenes/AlphaImpute2 * AlphaPeel for genotype calling, phasing, and imputation in pedigreed populations https://github.com/AlphaGenes/AlphaPeel * AlphaFamImpute for genotype calling, phasing, and imputation in families https://github.com/AlphaGenes/AlphaFamImpute * AlphaPlantImpute for phasing and imputation in plant populations (version 2) https://github.com/AlphaGenes/AlphaPlantImpute * AlphaPlantImpute2 for phasing and imputation in plant populations (version 2) https://github.com/AlphaGenes/AlphaPlantImpute2 * AlphaMate for balancing selection and management of genetic diversity in breeding programmes https://github.com/AlphaGenes/AlphaMate * AlphaPart for analysing trend in genetic means and variances https://github.com/AlphaGenes/AlphaPart 
Type Of Technology Software 
Year Produced 2018 
Open Source License? Yes  
Impact AlphaSuite is used by leading public and private animal and plant breeding programmes that supply genetics worldwide in the Global North and Global South. 
URL https://github.com/AlphaGenes
 
Description AlphaGenes Twitter channel 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The AlphaGenes updates the scientific community and a broader audience about news around our research group, scientific output and engagement activities
Year(s) Of Engagement Activity 2012,2013,2014,2015,2016,2017,2018,2019,2020
URL https://twitter.com/Alpha_Genes
 
Description AlphaGenes website 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The AlphaGenes website informs the scientific community about the groups research activities, outputs, courses and available software tools.
Year(s) Of Engagement Activity 2017,2018,2019,2020
URL https://alphagenes.roslin.ed.ac.uk
 
Description Big Data in Agriculture, Part of the DuPont Pioneer Plant Sciences Symposia Series, at Roslin Institute, 14-15 May 2018 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other audiences
Results and Impact Symposium held at the Roslin institute, organised by members of my group, sponsored by third parties from the breeding industry
Year(s) Of Engagement Activity 2018
 
Description Chief Veterinary Officer visit 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Policymakers/politicians
Results and Impact UK Chief Veterinary Officer visited The Roslin Institute in July 2018. I contributed to the visit by giving a talk on "Genomic selection for genetic improvement of farm animals", which informed the CVO about the past, present and future of genetic improvement methods and how we are developing the methods and applying them in collaboration with public and private partners. It also informed the CVO about the power and potential of the available methods to solve some of the pressing needs of the UK agriculture and veterinary practice.
Year(s) Of Engagement Activity 2018
 
Description Contribution to the New York Time article: Open Season Is Seen in Gene Editing of Animals 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Open Season Is Seen in Gene Editing of Animals was a feature article on gene Editing by Amy Harmon. Professor John Hickey was interviewed as specialist in the Quantitative Genetic field.
Year(s) Of Engagement Activity 2016
URL https://www.nytimes.com/2015/11/27/us/2015-11-27-us-animal-gene-editing.html?_r=0
 
Description Data-Driven Breeding and Genetics course (2 weeks) on-line 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The principles of animal and plant breeding are increasingly coalescing due to advances in technology and increasing demands and opportunities for agriculture. This two-week graduate level course of integrated lectures and practicals is designed to equip students, academics, and practitioners with theoretical and applied knowledge, skills and tools to design, optimise, and deploy Data Driven Breeding and Genetics techniques for Animals and Plants. It was jointly delivered by scientists and teachers from the University of Edinburgh and colleagues from the Swedish University of Agricultural Sciences and the CGIAR's Excellence in Breeding Platform, with guest lectures from various academic and industry collaborators.

Due to the pandemic the course took place in virtual format from the 20th Sep and 1st Oct 2021. The course lectures were pre-recorded to enable asynchronous worldwide delivery. Course participants engaged with the lectures and practicals at their own pace. They engaged with course instructors and other participants via Slack and daily Zoom sessions (one in the UK morning and one in the UK afternoon time).

Day 1 - Introduction to breeding
Welcome and Introduction (Gregor Gorjanc)
Introduction to breeding programme modelling (Gregor Gorjanc)
AlphaSimR MOOC - Introduction (Gregor Gorjanc)
AlphaSimR MOOC - Relationship between DNA & traits (Gregor Gorjanc)
R crash course on using ggplot and tidyverse (Thiago Paula Oliveira)
The role of livestock in global food security (Geoff Simm)

Day 2 - Breeding programme design
AlphaSimR MOOC - DNA lottery (Gregor Gorjanc)
AlphaSimR MOOC - Response to selection (Gregor Gorjanc)
AlphaSimR MOOC - Modelling complex breeding programmes (Gregor Gorjanc)
How does a major multinational animal breeding programme operate in the 21st century (Andreas Kranis)
How does a major multinational plant breeding programme operate in the 21st century (Brian Gardunia)

Day 3 - Genomic data in breeding
Genomic data, SNP array genotyping and sequencing, and Strategies to generate genomic data in breeding programmes (Gregor Gorjanc)
Phasing genomic data with heuristic and probabilistic methods (Gregor Gorjanc)
Imputation of genomic data (Gregor Gorjanc)
AlphaPeel practical - probabilistic genotype calling, phasing, and imputation of genomic data in pedigreed populations (Jana Obsteter)
AlphaImpute2 practical - fast phasing and imputation (Jana Obsteter)
AlphaFamImpute practical - genotype calling, phasing, and imputation algorithm for large full-sib families (Jana Obsteter)
AlphaAssign practical - parentage assignment (Jana Obsteter)
Breeding in aquaculture (Ross Houston)
Tea breeding and a genomic selection outlook (Nelson Lubanga)

Day 4 - Modelling phenotype data to estimate environmental effects
Introduction to experimental design of field trials (Daniel Tolhurst)
Introduction to linear mixed models for plant breeding (Daniel Tolhurst)
Analysis of phenotype data, including data collected from i) single field trials (with spatial) and ii) field trials across multiple (Daniel Tolhurst)
ASReml practicals (Daniel Tolhurst & Thiago Paula Oliveira)
Overview of forest tree breeding (Jaroslav Klapste)
Genomic selection provides new opportunities for intercrop breeding (Jon Bancic)

Day 5 - Population and Quantitative genetics for breeding
Introduction to population and quantitative genetics for breeding (Martin Johnsson)
Change in frequencies with drift (Martin Johnsson)
Change in frequencies with mutation, migration and selection (Martin Johnsson)
Additive effects (Martin Johnsson)
Non-additive effects (Martin Johnsson)
Inbreeding depression and heterosis (Martin Johnsson)
Practicals (Martin Johnsson)
Genetic evaluation in a multinational plant breeding programs AND/OR CGIAR Excellence in Breeding platform (Eduardo Covarrubias-Pazaran)
Roadmap for black soldier fly breeding (Leticia de Castro Lara)

Day 6 - Quantitative genetics for breeding II
Variance, covariance, correlation and heritability (Eduardo Covarrubias-Pazaran)
Correlated response to selection (Eduardo Covarrubias-Pazaran)
Recurrent selection strategies (Eduardo Covarrubias-Pazaran)
Practicals (Eduardo Covarrubias-Pazaran)
National breeding programme for the Norwegian Red dairy cattle (Janez Jenko)
Breeding a man's best friend (Joanna Ilska)

Day 7 - Modelling phenotype data to estimate genetic effects
Genetic evaluations with focus on pedigree-based BLUP (Ivan Pocrnic)
Introduction to genome-wide association studies (Ivan Pocrnic)
Genomic evaluations (Ivan Pocrnic)
Practicals (Ivan Pocrnic)
A multipart breeding strategy for introgression of exotic germplasm in elite breeding programs using genomic selection (Irene Breider)
Population genetics tools with perspective in dog research (Mateja Janes)

Day 8 - Sustainable breeding
Breeders' dillema
Optimal contribution selection
Optimal cross selection
AlphaMate practical - optimising selection, management of diversity, and mate allocation in breeding programs
A walk-through of three examples
AlphaPart - quantifying the drivers of genetic change (Jana Obsteter & Thiago Paula Oliveira)
Recursive models in animal breeding (Maria Martinez Castillero)
Economic objectives in animal and plant breeding (Cheryl Quinton)

Day 9 - Exploiting modern technologies in breeding programmes
The role of reproductive technologies to boost animal breeding (Gabriela Mafra Fortuna & Gerson Oliveira)
Breeding for disease resistance in animals (Andrea Doeschel-Wilson)
Editing livestock genomes (Simon Lillico)
Evaluating the use of gene drives to limit the spread of invasive populations (Nicky Faber)
The potential of genome editing and gene drives for improving complex traits (Gregor Gorjanc)

Day 10 - Open-ended work on topics of participants' interest
Year(s) Of Engagement Activity 2021
 
Description HighlanderLab Twitter channel 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The HighlanderLab updates the scientific community and a broader audience about news around our research group, scientific output and engagement activities - on management and improvement of populations using data science, genetics, and breeding.
Year(s) Of Engagement Activity 2019,2020,2021,2022
URL https://twitter.com/HighlanderLab
 
Description HighlanderLab website 
Form Of Engagement Activity Engagement focused website, blog or social media channel
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The HighlanderLab updates the scientific community and a broader audience about news around our research group, scientific output and engagement activities - on management and improvement of populations using data science, genetics, and breeding.
Year(s) Of Engagement Activity 2021,2022
URL http://www.ed.ac.uk/roslin/HighlanderLab
 
Description INTRODUCTION TO GENOME-WIDE ASSOCIATION STUDIES (GWAS) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact ±50 researchers from various life science disciplines were introduced in the basics, strategy and methodology of genetic association studies.
Year(s) Of Engagement Activity 2020
URL https://www.physalia-courses.org/courses-workshops/course49/
 
Description InterBull Technical Workshop 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact In February 2018 InterBull organised a Technical Workshop in Ljubljana (Slovenia) on methodology underpinning the international genetic evaluations of dairy and beef cattle. InterBull uses the methodology on a monthly basis for a joint analysis of data from about 50 countries world-wide and their results are disseminated back to the countries and its breeders and farmers to make selection decisions. I have contributed with a talk titled "Integration of foreign estimates of SNP effects into a domestic SNPBLUP". The presentation gained substantial interest and has subsequently been published in a peer reviewed publication linked to this award.
Year(s) Of Engagement Activity 2018
 
Description John Hickey Guest in Farming Today (BBC Radio 4) 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact On Monday 26th September, The BBC Radio 4 Farming Today had Professor John Hickey as specialist scientist on the subject of breeding programs and scientific impact.
Year(s) Of Engagement Activity 2016
URL http://www.bbc.co.uk/programmes/b07w5xxq
 
Description Lush Visiting Professors and Short Courses in Animal Breeding and Genetics 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Dr. JOHN HICKEY from the Roslin Institute (https://www.ed.ac.uk/roslin/about/contact-us/staff/john-hickey) will be a Lush Visiting Professor at Iowa State University in April and May, 2018.

Dr. Hickey will give a 2-day short course May 10 and 11 entitled "Plant and animal breeding - exploiting new technologies in different ways and at different scale". A course synopsis is given at the end of this message.
Year(s) Of Engagement Activity 2018
URL https://www.bcb.iastate.edu/animal-breeding-and-genetics-announces-lush-visiting-professors-and-shor...
 
Description Modern plant and animal applied genomics driven by genotype and sequence data, University of Zagreb, Croatia, 17-19 July 2018 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Workshop organised and given by me and two other members of my group.
Year(s) Of Engagement Activity 2018
 
Description Public engagement at the Royal Highland Show 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact All members of the research group engaged the visitors of the RHS, to show the importance of their research towards the enhancement of the agricultural sector in direct or indirect ways.
Year(s) Of Engagement Activity 2019
URL https://www.royalhighlandshow.org
 
Description Researcher Links workshop at CNRG, INIFAP, Tepatitlán and Guadalajara, Mexico, 3-7 February 2018 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Workshop organised and given by me and the members of my group
Year(s) Of Engagement Activity 2018
 
Description Short course in Evolutionary Quantitative Genetics 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Study participants or study members
Results and Impact Evolutionary Quantitative Genetics course was a comprehensive review of modern concepts in Evolutionary Quantitative Genetics. The contents of the course are basic statistics, population genetics, quantitative genetics, evolutionary response in quantitative traits, estimating the fitness of traits and mixed models and their extensions. the instructor was Dr Bruce Walsh, Department of Ecology Evolutionary Biology, University of Arizona, and co-author of Genetics and Analysis of Quantitative Traits. The Course was hosted by Professor John Hickey at the Roslin Institute.
Year(s) Of Engagement Activity 2016
URL http://www.alphagenes.roslin.ed.ac.uk/bruce-walsh-visit/
 
Description Short course: The Search for Selection, Roslin Institute, University of Edinburgh 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Description
Biologists are obsessed (indeed, seduced) by the search for signatures of selection in organismal features of interest, ranging from specific traits to genome-wide signatures. A vast number of approaches have been suggested in this search for selection, including genomic-based signatures of recent or ongoing selection, tests based on either excessive amounts or nonrandom patterns of divergence (in both fossil sequences and functional genomics data) and the more classical Lande-Arnold fitness estimates (direct association of phenotypic values with fitness estimates) and their modern extensions (such as aster models). Given the breadth of such searches, a large amount of machinery has been developed, but is rarely presented in a unified fashion. This course presents an integrated overview of all these approaches, highlighting common themes and divergent assumptions.

The goal of this course is to expose investigators from all branches of biology to this rich menagerie of tests, applicable for population geneticists, genome biologists, evolutionary ecologists, paleontologists, functional morphologists, and just about any biologist who ponders on how to formally demonstrate that a feature (or features) of interest might have been shaped by selection.

Intended Audience.
The intended audience is advanced graduate students, postdocs, and faculty with an interest in searching for targets of selection, be they particular genomic sequences or specific traits. Given the breadth of this topic, the material is of interest to students from functional genomics, population and evolutionary genetics, ecology, paleobiology, functional morphology, and statistics (as well as other fields). Background required: some basic introduction to population and/or quantitative genetics.
Year(s) Of Engagement Activity 2018
URL https://wheat.pw.usda.gov/GG3/node/695
 
Description Teaching course: Next Generation Plant and Animal Breeding Programs, Animal Science Department, University of Nebraska, Lincoln. 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Series of the lectures and workshops on Plant and Animal Breeding Programs exploring current practices and future areas
of research. The course was designed and imparted by John Hickey and key members of his team.
Year(s) Of Engagement Activity 2016
URL http://animalscience.unl.edu/next-generation-plant-and-animal-breeding-programs
 
Description Training: Next generation plant breeding programs 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact Workshop to teach professionals in the crop breeding sector to use modern bioinformatics tools to process molecular data and simulate breeding programs in order to make these more efficient. The audience was very pleased with their acquired insights and skills, and considered the training extremely useful.
Year(s) Of Engagement Activity 2019
URL https://alphagenes.roslin.ed.ac.uk/wp/teaching-2/kwsgermany/
 
Description University of Ljubljana symposium on genetic improvement of livestock in honour of Professor Milena Kovac 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact One day symposium on genetic improvement of livestock in honour of Professor Milena Kovac took place in September 2017 at the Animal Science Department of Biotechnical Faculty (University of Ljubljana, Slovenia). I was invited to give a talk on "Optimising selection, maintenance of genetic diversity and logistic constraints". It showcased the role of operations research methods for practical breeding applications, which lead to a series of questions from the participating applied breeders.
Year(s) Of Engagement Activity 2017
 
Description World Congress on Genetics Applied to Livestock Production 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact In February 2018 World Congress on Genetics Applied to Livestock Production took place in Auckland (New Zealand). This is the most important conference in the field of genetic improvement of livestock with up to 2000 participants from public and private research organisations, governments and industry. I have presented a talk titled "Modelling segmental inheritance of complex traits in pedigreed and genotyped populations", which informed the participants about the strengths and weaknesses of different types of data and modelling choices on the genetic improvement of livestock.
Year(s) Of Engagement Activity 2018