šŸ“£ Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Future-Proofing the Sustainability of the MRC High Throughput Sequencing Hub in Scotland

Lead Research Organisation: University of Edinburgh
Department Name: Sch of Biological Sciences

Abstract

The GenePool is the leading next generation genomics facility in Scotland, and one of the largest academic facilities in the UK. The GenePool is funded by the MRC as a High Throughput Sequencing Hub, with specific remit to deliver collaborative access to next generation sequencing and informatics across the MRC science area. This application is for renewal for 24 months of the initial 3-year funding, to bed in the advances made in setting up the MRC Hub, and to move the facility to full cost-recovery. We will focus on delivering first-rate data and analysis on cutting edge science, to a wide range of collaborators both new and established, using the transformative tools of Illumina sequencing and high-throughput computing. We will build upon our MRC Hub and "spokes" model, fostering bioinformatics training and next generation genomics understanding across our target area.
We request core support for promotional and outreach activities, including a new bioinformatics staff member jointly appointed between the Liverpool and Edinburgh MRC Hubs who will organise and deliver advanced training in next generation genomics to PhD students and postdoctoral researchers. We will deliver proactive Next Generation Sequencing Clinics in academic research institutes and departments across Scotland. We will continue to support the Next Generation Bioinformatics User Group meetings, including inviting international speakers and delivering 2-day research workshops on selected technologies. The MRC network of 'spokes' bioinformaticians in institutions across Scotland will be expanded. We will produce promotional materials for distribution to potential collaborators and grant applicants, and use our web site and social media portals to promote the MRC HTS Hub approach to next generation genomics.
We will develop standard operating procedures for new techniques, and to explicitly and rigorously cross-check manufacturers' claims. We have good relationships with the major technology and reagent suppliers and will use these to negotiate early access to new products. We will perform in-house trials of new kits, in direct comparison to existing protocols, and make the results known to our collaborator base. We will work with companies and collaborators to bed in and develop new approaches to the generation of and analysis of next generation sequencing data.
The next generation sequencing revolution is based on two principles: transformative technologies and skilled staff. We have a mature charging model that includes elements of instrument depreciation and staff time, and generate a working surplus (for investment in the facility) each year. We have already transitioned several staff from core grant to cost recovery support. In the period of 24 months of the renewal, we will transition three further core MRC staff posts to full recovery (one bioinformatician and two high-level research technologists). Two new posts will be created to ensure best service to our collaborators, particularly given the vastly increased throughput of the Illumina HiSeq platform. A laboratory manager will ensure fitness-for-purpose of our LIMS installation, manage the wide portfolio of instrumentation (including robotics, qPCR, analytical platforms and other ancillary instrumentation), and provide oversight of the full laboratory workflow process. A business manager will administer the large number of active projects within the facility, in particular dealing with the financial side, managing the LIMS, and keeping collaborators up-to-date with project progress. Both of these new posts are planned to transition rapidly to full cost recovery in two years. Initiation of these posts will free the GenePool Scientific manager (Dr Karim Gharbi) and the Director (Prof Mark Blaxter) to focus on speeding the transformation of researchers' ideas into viable projects, well-analysed studies and high-hitting publications.

Technical Summary

During the first three years of the MRC High Throughput Sequencing Hubs programme, the GenePool invested in training highly-skilled wet-lab and bioinformatics teams and installing leading high-throughput technologies to deliver bespoke genomics collaborations across our science and geographical area. We propose further investment in personnel and in outreach and training to consolidate the Hub at the core of nextgen genomics in our area, to build nextgen genomics analytic capacity across Scotland and the UK, and to transition to a fully cost-recovered financial model.
We will develop a portfolio of training and outreach activities, including a new training post delivering master classes and summer schools in next gen genomics, the very successful Next Generation Bioinformatics User Group meetings and MRC 'spokes' bioinformaticians training. We will deliver proactive NextGen Clinics across Scotland, allowing prospective collaborators to discuss ideas directly with GenePool staff. The training bioinformatics post will be held jointly with our sister MRC Hub in Liverpool, linking the two facilities. We will produce promotional materials, and distribute these, including via our web and social media presences.
We will beta-test and develop standard operating procedures for new applications, including testing of new commercial kits and reagents, and working with collaborators to bed in new ways of using the technology.
We will enhance our collaborators' experience by improving the operation of the facility, appointing a laboratory manager and a business manager. This will streamline the processing of user data, freeing the scientific manager and director to focus on speeding the transformation of researchers' ideas into viable projects, well analysed studies and high-impact publications. At the end of the two year renewal period, the GenePool MRC HTS Hub will be fully transitioned to cost recovery on staff, reagents and equipment renewal.

Planned Impact

Investment by MRC in the GenePool will have a wide and significant range of impacts on academic research, biotechnological and healthcare industries, and clinical research, leading to impacts on clinical practice and policy. We will also have impacts on public understanding of science, and genomics in particular, and individual impact as we develop the skills of highly-motivated young researchers.
The direct impacts of the investment will be in enabling the collaborating research community to access and exploit next generation genomics. By leveraging investments by MRC, NERC, SULSA, the Darwin Trust and the University of Edinburgh the GenePool can deliver cutting edge science across the spectrum of genomic biology.
Impacts will be both institutional and individual. For institutions, access to the GenePool means that sorely-restricted funds can be effectively invested in complementary activities (other high-throughput analytical technologies, or staff and expertise in bioinformatics, for example). Collaborating institutions can rely on GenePool staying abreast of new developments and supporting their research portfolios in a directed and effective manner. For individual researchers, from senior principal investigators to postdoctoral students, access to the GenePool means they can focus on ground breaking biological and clinical research, leaving the details (and heavy lifting) of the genomics data generation and analysis to the Facility.
We have a record in supporting all researchers, no matter what the stage of their careers, and in particular in assisting genomics novices in understanding the platforms and the data, developing research ideas and being successful in grant and fellowship applications. These successes in turn mean that the research institutions and individuals are more successful in attracting additional investment and funding, and progressing their careers. Our proposed programme of master classes and summer schools will allow individuals at early stages in their careers to learn skills in next generation genomics, and prepare themselves for effective future research careers.
On our staff we have a cadre of young scientists who are, as is necessary in this fast moving field, learning on the job. The skills they are developing (in project management and collaborative science as well as in next generation sequencing or bioinformatics) will stand them in good stead in developing their research careers.
Our collaborative user base extends from university, to NHS clinical research and independent research institute staff. While most our collaborators are based in Scotland, there is a significant group from the remainder of the UK, and some from Europe and other countries. These include multinational collaborations investigating human disease (especially parasitic disease) and projects addressing pressing issues of food security in developing and emerging economies. On the commercial side, we collaborate closely in an academic setting with several research groupings, delivering academic genomics service to further specific commercial goals.
Collaborative research with clinical researchers may lead to changes in clinical practice and policy, either at the level of surveillance and screening (the development of tests for genetic or epigenetic changes associated with a particular disease) or in treatment (when a potentially causative genomic signature is identified and a possible treatment path developed). We will work closely with researchers in developing directed assays (such as targeted resequencing platforms, including for smaller-throughput technologies such as IonTorrent) that can be deployed in a clinical situation.
We will promote the work of the GenePool, and next generation genomics in general, to the wider scientific and general public through our website, through directed promotional materials and through presentations at open days and other forums within the University and elsewhere.

Publications

10 25 50
 
Title The genome of Caenorhabditis elegans 
Description The work is the complete genome of C. elegans printed in hardbound volumes, with each chromosome occupying two volumes, 12 volumes in all. The covers of the volumes tesselate to form a picture of the nematode, and the spinse align to make a picture of the nematode. The mitochondrial genome is supplied as an "erratum sheet". 
Type Of Art Artwork 
Year Produced 2014 
Impact The work has been on long term loan and is permanently exhibited in the Summerhall Arts Complex, Edinburgh, as part of a permanent exhibition curated by ASCUS. 
 
Title Transmissions exhibition 
Description The Blaxter lab collaborated closely wth artists-in-residence (see http://www.ascus.org.uk/ciie-micro-residency-artists-announced/) in the Centre for immunity Infection and Evolution to inspire and be part of the final exhibition "Transmissions". Mark Blaxter appears in the film work produced by Anne Milne, and the work of the lab inspired Jo hodges and Robbie Coleman to produce a piece dedicated to the lab. 
Type Of Art Artwork 
Year Produced 2014 
Impact 'Transmissions' was showcased to the general public within a group exhibition 'Parallel Perspectives' in Summerhall as part of the Edinburgh International Science Festival 2015 art programme, How The Light Gets In . This exhibition of work susequently travelled LifeSpace, Dundee, returning to Edinburgh to showcase at the Tent Gallery, Edinburgh College of Art. 
URL http://www.ascus.org.uk/ciie-micro-residency-2/
 
Description BBSRC Project Grant (Genome Databasing)
Amount £671,655 (GBP)
Funding ID BB/K020161/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 11/2013 
End 11/2016
 
Description BBSRC Project Grant (Ruff)
Amount £176,281 (GBP)
Funding ID Bb/J018791/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 09/2013 
End 09/2015
 
Description BBSRC project grant (Striga)
Amount £43,647 (GBP)
Funding ID BB/J011703/1 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 01/2013 
End 12/2015
 
Description BBSRC/DFID Project Grant (Ash Dieback)
Amount £162,746 (GBP)
Funding ID NORNEX 
Organisation Biotechnology and Biological Sciences Research Council (BBSRC) 
Sector Public
Country United Kingdom
Start 02/2014 
End 04/2015
 
Description European Commission FP6 (Filariasis)
Amount £120,500 (GBP)
Organisation Sixth Framework Programme (FP6) 
Sector Public
Country European Union (EU)
Start 08/2010 
End 08/2013
 
Description NERC Project grant (Genome evolution)
Amount £469,000 (GBP)
Funding ID NE/J011355/1 
Organisation Natural Environment Research Council 
Sector Public
Country United Kingdom
Start 09/2012 
End 09/2015
 
Description SynTax
Amount £19,775 (GBP)
Organisation Natural Environment Research Council 
Sector Public
Country United Kingdom
Start 05/2012 
End 05/2013
 
Description SynTax
Amount £26,587 (GBP)
Organisation Natural Environment Research Council 
Sector Public
Country United Kingdom
Start 05/2012 
End 05/2013
 
Description TSB (Atlantic salmon SNP chip)
Amount £323,669 (GBP)
Funding ID 100965 
Organisation Innovate UK 
Sector Public
Country United Kingdom
Start 07/2011 
End 07/2014
 
Description Wellcome Trust (RNASeq)
Amount £169,000 (GBP)
Organisation Wellcome Trust 
Sector Charity/Non Profit
Country United Kingdom
Start 09/2009 
End 08/2012
 
Title 2012 Installation of MiSeq platforms 
Description Following the successful installation and two Illumina HiSeq platforms in 2010, The GenePool installed two Illumina MiSeq instruments in 2012 to support small scale, quick turnaround projects. 
Type Of Material Improvements to research infrastructure 
Year Produced 2012 
Provided To Others? Yes  
Impact The MiSeq installation has been an immediate success with many R&D projects completed in weeks rather than months, and an expanding user base taking advantage of the quick turnaround time, high quality data. 
URL http://genomics.ed.ac.uk
 
Title 2013 Computational infrastructure upgrades 
Description The facility now runs a secure, powerful computer cluster running SGE, with several hundred cores and 2 terabytes (Tb) of RAM. The University of Edinburgh's HECTOR supercomputer and the Edinburgh Compute and Data Facilities (ECDF), a parallel compute cluster with over 3000 cores and several Tb of RAM, provide additional infrastructure. We have an advanced data store (with many hundreds of Tb of capacity) based on highly resilient technology to assure availability and security of data. 
Type Of Material Improvements to research infrastructure 
Provided To Others? No  
Impact The increase in the computational resources has increased our resilience and security, and allows the facility to take on ever larger projects. 
URL http://genomics.ed.ac.uk
 
Title 2013 Installation of four Beckman liquid handling robotic platforms 
Description The Beckman liquid handling robots are state-of-the art instruments for processing samples from purified nucleic acid to quantitated libraries ready for sequencing. 
Type Of Material Improvements to research infrastructure 
Provided To Others? No  
Impact The Beckman robotics platforms allow the facility to process large numbers of routine genomic and RNA samples efficiently and accurately, releasing skilled staff to deal with more difficult or bespoke projects. 
URL http://genomics.ed.ac.uk
 
Title 2013 Installation of six HiSeq 2500 instruments 
Description The facility has replaced its older HiSeq2000 instruments with the 2500 version. These offer higher throughput, faster turnaround, better quality data and improved reliability. 
Type Of Material Improvements to research infrastructure 
Provided To Others? No  
Impact Installation of the additional HiSeq2500 instruments across the two sites of the Edinburgh Genomics entity has made us the biggest in the UK (in data generation capacity) outside the Sanger Instiutute. The increased number of instruments, and changed ways of working with them allow us to respond rapidly to collaborator requests. 
URL http://genomics.ed.ac.uk
 
Title 2013 Installation of three MiSeq instruments 
Description The MiSeq is a key technology for both improving the flow of samples in the facility (it is used for sample/library QC) and delivering very rapid turnaround on critical samples (<24 hrs). 
Type Of Material Improvements to research infrastructure 
Provided To Others? No  
Impact The acquisition of three MiSeq has made the service offered more robust, and has allowed the facility to deliver to ever shorter timescales. in addition the increased read lengths of this instrument deliver to some key needs in the field. 
URL http://genomics.ed.ac.uk
 
Title Compute Infrastructure 2011 
Description The vast increase in data production from the facility required a significant upgrade in our informatics capacity. To this end we have upgraded our network within the facility, installed a 72 Tbyte high-availability data store, brought online several high-end compute clusters, and developed routines for use of the University of Edinburgh's grid computing environment ECDF. 
Type Of Material Improvements to research infrastructure 
Year Produced 2009 
Provided To Others? Yes  
Impact The improved compute infrastructure has allowed GenePool to keep abreast of increased data volumes, and offer more advanced bioinformatic support to collaborators. 
URL http://genomics.ed.ac.uk
 
Title HiSeq2500 
Description The HiSeq2500 is a key next generation sequencing platform that we have installed and offer to a wide range of research teams throughout the UK. While our roster of instruments is changed to suit requirements, we have had up to 5 HiSeq2500 instruments in the facility. 
Type Of Material Technology assay or reagent 
Year Produced 2014 
Provided To Others? Yes  
Impact The HiSeq2500 has generated data that underpin more than 300 papers published to date. 
URL http://genomics.ed.ac.uk
 
Title HiSeq4000 
Description The HiSeq4000 is a key next generation sequencing platform that we have installed and offer to a wide range of research teams throughout the UK. While our roster of instruments is changed to suit requirements, we currently have one HiSeq4000 instrument in the facility. 
Type Of Material Technology assay or reagent 
Year Produced 2015 
Provided To Others? Yes  
Impact The HiSeq4000 offers cheap, high quality sequence data for genome sequencing and resequencing, RNASeq and many other applications. 
URL http://genomics.ed.ac.uk
 
Title HiSeqX 
Description The HiSeqX is a key next generation sequencing platform that we have installed and offer to a wide range of research teams throughout the UK. While our roster of instruments is changed to suit requirements, we currently have 6 HiSeqX instruments in the facility. 
Type Of Material Technology assay or reagent 
Year Produced 2015 
Provided To Others? Yes  
Impact The HiSeqX allows the facility to sequence human and other genomes at very low per-base costs, promoting genetical and genomic research across the RC spectrum. 
URL http://genomics.ed.ac.uk
 
Title Installation of HiSeq2000 and associated robotics 2010 
Description The GenePool recieved delivery of an illumina HiSeq 2000in 2010. A second HiSeq 2000 was delivered in late 2010. Since then we have worked closely with Illumina UK to troubleshoot these instruments and move them from preproduction to production. The increased data production capacity of these instruments has required installation of robotic platforms for shearing, library preparatiion and presequencing amplification, and high-throughput QC/QA instrumentation. 
Type Of Material Improvements to research infrastructure 
Year Produced 2010 
Provided To Others? Yes  
Impact The improved QC/QA procedures made possible by the improved instrumentation has ensured timely delivery of data at high resilience. The robotics platforms have allowed us to take on larger projects and to make best use of the huge data production capacities of the HiSeqs. The HiSeqs are in production, generating 40 Gbase/day. 
URL http://genomics.ed.ac.uk
 
Title Installation of NovaSeq6000 
Description Edinburgh genomics installed the Illumina NovaSeq600 sequencer to replace its ageing HiSeq instruments. The NovaSeq is Illumina's latest platform, and offers higher, scalable throughput at lower cost. The NovaSeq was installed and operational within 1 month of delivery and has been used to service a wide range of UKRI-funded projects. 
Type Of Material Technology assay or reagent 
Year Produced 2018 
Provided To Others? Yes  
Impact The NovaSeq offers improved sequencing throughput at lower per base cost to users. 
URL http://www.genomics.ed.ac.uk
 
Title Installation of PromethION sequencer in Edinburgh genomics 
Description In early 2019, Edinburgh Genomics installed a PromethION sequencer, the latest long read sequencer from Oxford Nanopore. The sequencer has been tested on control and real world samples through an early adopter programme, and is now ready for open access use by the UK science community. 
Type Of Material Technology assay or reagent 
Year Produced 2019 
Provided To Others? Yes  
Impact The PromethION install is in its early days, but has already engendered a lot of interest in the community. 
URL http://www.genomics.ed.ac.uk
 
Title Installation of a second HiSeq 2000 
Description Second instrument purchased to meet demand for Illumina sequencing technology 
Type Of Material Improvements to research infrastructure 
Year Produced 2011 
Provided To Others? Yes  
Impact The installation of a second instrument effectively doubled the GenePool's sequencing capacity, allowing us to expand our user base and reduce turnaround times for all users 
URL http://genomics.ed.ac.uk
 
Title RAD Seq 
Description RAD seq is a new method for genotyping by sequencing. Edinburgh Genomics pioneered its use in the UK ecological and evolutionary genomics community, and continues to develop it. In 2014 Edinburgh Genomics signed the first European licence with KeyGene, holder of paptents in the RAD/GBS area, giving the facility security to deliver the technology in including new variations that improve throughput and accuracy. The facility has also developed software toolkits for RAD analyses and run RAD workshops and meetings.. 
Type Of Material Technology assay or reagent 
Year Produced 2009 
Provided To Others? Yes  
Impact RAD sequencing is now a tool of choice across biosciences. In particular it is excellent for discovering variants in new genomes, and then building high throughput assay tools. This has been made concrete in the development of a SNP assay chip for salmon with our industrial partners LandCatch. 
URL http://genomics.ed.ac.uk
 
Title Additional file 10: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S10. GO term enrichment for those subsets of genes whose highest PEM is for a given tissue. (XLSX 63 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_10_of_Combination_of_novel_and_public_R...
 
Title Additional file 10: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S10. GO term enrichment for those subsets of genes whose highest PEM is for a given tissue. (XLSX 63 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_10_of_Combination_of_novel_and_public_R...
 
Title Additional file 11: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S11. All-against-all correlation matrix for each tissue in the meta-dataset. (XLSX 79 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_11_of_Combination_of_novel_and_public_R...
 
Title Additional file 11: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S11. All-against-all correlation matrix for each tissue in the meta-dataset. (XLSX 79 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_11_of_Combination_of_novel_and_public_R...
 
Title Additional file 12: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S12. Tissues whose expression vectors are most strongly correlated with each other. (XLSX 18 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_12_of_Combination_of_novel_and_public_R...
 
Title Additional file 12: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S12. Tissues whose expression vectors are most strongly correlated with each other. (XLSX 18 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_12_of_Combination_of_novel_and_public_R...
 
Title Additional file 13: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S13. Clusters of co-expressed genes (obtained via network analysis of the RNA-seq meta-dataset), including candidate gene names for unannotated GalGal5 protein-coding genes. (XLSX 1432 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_13_of_Combination_of_novel_and_public_R...
 
Title Additional file 13: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S13. Clusters of co-expressed genes (obtained via network analysis of the RNA-seq meta-dataset), including candidate gene names for unannotated GalGal5 protein-coding genes. (XLSX 1432 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_13_of_Combination_of_novel_and_public_R...
 
Title Additional file 14: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S14. Proportion of genes in each co-expression cluster whose highest PEM is for a given tissue. (XLSX 19 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_14_of_Combination_of_novel_and_public_R...
 
Title Additional file 14: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S14. Proportion of genes in each co-expression cluster whose highest PEM is for a given tissue. (XLSX 19 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_14_of_Combination_of_novel_and_public_R...
 
Title Additional file 15: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S15. GO term enrichment for co-expression clusters containing >â =â 100 genes. (XLSX 42 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_15_of_Combination_of_novel_and_public_R...
 
Title Additional file 15: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S15. GO term enrichment for co-expression clusters containing >â =â 100 genes. (XLSX 42 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_15_of_Combination_of_novel_and_public_R...
 
Title Additional file 16: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S16. Correlation of expression profiles for genes with a known protein-protein interaction. (XLSX 4202 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_16_of_Combination_of_novel_and_public_R...
 
Title Additional file 16: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S16. Correlation of expression profiles for genes with a known protein-protein interaction. (XLSX 4202 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_16_of_Combination_of_novel_and_public_R...
 
Title Additional file 17: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S17. Clusters of co-expressed CAGE tags, obtained via network analysis of the Lizio, et al. dataset [53]. (XLSX 3452 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_17_of_Combination_of_novel_and_public_R...
 
Title Additional file 17: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S17. Clusters of co-expressed CAGE tags, obtained via network analysis of the Lizio, et al. dataset [53]. (XLSX 3452 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_17_of_Combination_of_novel_and_public_R...
 
Title Additional file 18: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S18. Comparison of co-expression clusters between the RNA-seq atlas and the Lizio, et al. CAGE dataset [53]. (XLSX 108 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_18_of_Combination_of_novel_and_public_R...
 
Title Additional file 18: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S18. Comparison of co-expression clusters between the RNA-seq atlas and the Lizio, et al. CAGE dataset [53]. (XLSX 108 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_18_of_Combination_of_novel_and_public_R...
 
Title Additional file 1: Table S1. of Integration of quantitated expression estimates from polyA-selected and rRNA-depleted RNA-seq libraries 
Description Gene expression level for ovine BMDMs pre- and post-LPS stimulation, quantified against the full Oar v3.1 reference transcriptome using polyA-selected and rRNA-depleted libraries. Table S2. Comparison of polyA+ and ribo-minus TPM distributions with different reference transcriptomes. Table S3. Genes detected in both polyA-selected and rRNA-depleted libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S4. Genes detected only in polyA-selected libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S5. Genes detected only in rRNA-depleted libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S6. Number of pseudoalignments per sample, if quantified against the full Oar v3.1 reference transcriptome. Table S7. Differences in expression between polyA-selected and rRNA-depleted libraries when varying the reference transcriptome. Table S8. Protein-coding genes with the largest absolute difference between polyA+ and ribo-minus TPM. Table S9. GO term enrichment for the set of protein-coding genes with the largest absolute difference between polyA+ and ribo-minus TPM. Table S10. Human gene expression meta-atlas, generated using polyA+ and rRNA-depleted ENCODE RNA-seq data. Table S11. Coding potential of putative novel CDS. (XLSX 32632 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_1_Table_S1_of_Integration_of_qu...
 
Title Additional file 1: Table S1. of Integration of quantitated expression estimates from polyA-selected and rRNA-depleted RNA-seq libraries 
Description Gene expression level for ovine BMDMs pre- and post-LPS stimulation, quantified against the full Oar v3.1 reference transcriptome using polyA-selected and rRNA-depleted libraries. Table S2. Comparison of polyA+ and ribo-minus TPM distributions with different reference transcriptomes. Table S3. Genes detected in both polyA-selected and rRNA-depleted libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S4. Genes detected only in polyA-selected libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S5. Genes detected only in rRNA-depleted libraries, if quantified against the full Oar v3.1 reference transcriptome. Table S6. Number of pseudoalignments per sample, if quantified against the full Oar v3.1 reference transcriptome. Table S7. Differences in expression between polyA-selected and rRNA-depleted libraries when varying the reference transcriptome. Table S8. Protein-coding genes with the largest absolute difference between polyA+ and ribo-minus TPM. Table S9. GO term enrichment for the set of protein-coding genes with the largest absolute difference between polyA+ and ribo-minus TPM. Table S10. Human gene expression meta-atlas, generated using polyA+ and rRNA-depleted ENCODE RNA-seq data. Table S11. Coding potential of putative novel CDS. (XLSX 32632 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_1_Table_S1_of_Integration_of_qu...
 
Title Additional file 1: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S1. Data sources for creating an RNA-seq meta-atlas. (XLSX 16 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Combination_of_novel_and_public_RN...
 
Title Additional file 1: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S1. Data sources for creating an RNA-seq meta-atlas. (XLSX 16 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Combination_of_novel_and_public_RN...
 
Title Additional file 1: of Transcriptome-wide RNA processing kinetics revealed using extremely short 4tU labeling 
Description Tables S1â S14; figures S1â S7; and Supplementary Methods and References. (ZIP 6028 kb) 
Type Of Material Database/Collection of data 
Year Produced 2015 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_1_of_Transcriptome-wide_RNA_pro...
 
Title Additional file 1: of Transcriptome-wide RNA processing kinetics revealed using extremely short 4tU labeling 
Description Tables S1â S14; figures S1â S7; and Supplementary Methods and References. (ZIP 6028 kb) 
Type Of Material Database/Collection of data 
Year Produced 2015 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_1_of_Transcriptome-wide_RNA_pro...
 
Title Additional file 2: Table S1. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing protein names, SILAC ratios, peptide counts, and peptide intensities identified by SILAC MS in T7-TRIM25, T7-TRIM25Î RBD, and T7-TRIM25â +â RNases co-immunoprecipitation assays. (XLSX 22 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_Table_S1_of_RNA-binding_activ...
 
Title Additional file 2: Table S1. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing protein names, SILAC ratios, peptide counts, and peptide intensities identified by SILAC MS in T7-TRIM25, T7-TRIM25Î RBD, and T7-TRIM25â +â RNases co-immunoprecipitation assays. (XLSX 22 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_Table_S1_of_RNA-binding_activ...
 
Title Additional file 2: of Caenorhabditis monodelphis sp. n.: defining the stem morphology and genomics of the genus Caenorhabditis 
Description Details of software, versions and parameters used in analysis. (TSV 5 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_of___Caenorhabditis_monodelph...
 
Title Additional file 2: of Caenorhabditis monodelphis sp. n.: defining the stem morphology and genomics of the genus Caenorhabditis 
Description Details of software, versions and parameters used in analysis. (TSV 5 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_of___Caenorhabditis_monodelph...
 
Title Additional file 2: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S2. Independent datasets sequencing the same tissue/cell type. (XLSX 8 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_2_of_Combination_of_novel_and_public_RN...
 
Title Additional file 2: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S2. Independent datasets sequencing the same tissue/cell type. (XLSX 8 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_2_of_Combination_of_novel_and_public_RN...
 
Title Additional file 3: Table S2. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing protein names, SILAC ratios (more than two-fold), peptide counts, and peptide intensities identified by SILAC MS in wild-type and TRIM25 KO HeLa cells. (XLSX 33 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_Table_S2_of_RNA-binding_activ...
 
Title Additional file 3: Table S2. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing protein names, SILAC ratios (more than two-fold), peptide counts, and peptide intensities identified by SILAC MS in wild-type and TRIM25 KO HeLa cells. (XLSX 33 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_Table_S2_of_RNA-binding_activ...
 
Title Additional file 3: of Caenorhabditis monodelphis sp. n.: defining the stem morphology and genomics of the genus Caenorhabditis 
Description Accessions and links to genome-derived protein sequence data used in phylogenetic analysis. (TSV 4 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_of___Caenorhabditis_monodelph...
 
Title Additional file 3: of Caenorhabditis monodelphis sp. n.: defining the stem morphology and genomics of the genus Caenorhabditis 
Description Accessions and links to genome-derived protein sequence data used in phylogenetic analysis. (TSV 4 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_of___Caenorhabditis_monodelph...
 
Title Additional file 3: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S3. Exponents of the log-log plots after plotting the reverse cumulative distribution of TPM per gene on a log-log scale. (XLSX 21 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_3_of_Combination_of_novel_and_public_RN...
 
Title Additional file 3: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S3. Exponents of the log-log plots after plotting the reverse cumulative distribution of TPM per gene on a log-log scale. (XLSX 21 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_3_of_Combination_of_novel_and_public_RN...
 
Title Additional file 4: Table S3. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing gene names and number of RNA reads derived from three independent T7-TRIM25 CLIP-seq experiments. (XLSX 117 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_Table_S3_of_RNA-binding_activ...
 
Title Additional file 4: Table S3. of RNA-binding activity of TRIM25 is mediated by its PRY/SPRY domain and is required for ubiquitination 
Description Supplementary data showing gene names and number of RNA reads derived from three independent T7-TRIM25 CLIP-seq experiments. (XLSX 117 kb) 
Type Of Material Database/Collection of data 
Year Produced 2017 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_Table_S3_of_RNA-binding_activ...
 
Title Additional file 4: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S4. Number of genes with detectable expression, per tissue, after the first iteration of Kallisto. (XLSX 11 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_4_of_Combination_of_novel_and_public_RN...
 
Title Additional file 4: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S4. Number of genes with detectable expression, per tissue, after the first iteration of Kallisto. (XLSX 11 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_4_of_Combination_of_novel_and_public_RN...
 
Title Additional file 5: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S5. Transcripts not detectably expressed (at >â 1 TPM) in any tissue, after the first iteration of Kallisto. (XLSX 91 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_5_of_Combination_of_novel_and_public_RN...
 
Title Additional file 5: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S5. Transcripts not detectably expressed (at >â 1 TPM) in any tissue, after the first iteration of Kallisto. (XLSX 91 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_5_of_Combination_of_novel_and_public_RN...
 
Title Additional file 6: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S6. Chicken RNA-seq meta-dataset, after the second (and final) iteration of Kallisto. (XLSX 23600 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_6_of_Combination_of_novel_and_public_RN...
 
Title Additional file 6: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S6. Chicken RNA-seq meta-dataset, after the second (and final) iteration of Kallisto. (XLSX 23600 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_6_of_Combination_of_novel_and_public_RN...
 
Title Additional file 7: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S7. Proportion of RNA-seq reads retained by down-sampling the LPS-stimulated BMDM datasets. (XLSX 12 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_7_of_Combination_of_novel_and_public_RN...
 
Title Additional file 7: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S7. Proportion of RNA-seq reads retained by down-sampling the LPS-stimulated BMDM datasets. (XLSX 12 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_7_of_Combination_of_novel_and_public_RN...
 
Title Additional file 8: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S8. Number of detectably expressed genes after randomly down-sampling the LPS-stimulated BMDM datasets. (XLSX 13 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_8_of_Combination_of_novel_and_public_RN...
 
Title Additional file 8: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S8. Number of detectably expressed genes after randomly down-sampling the LPS-stimulated BMDM datasets. (XLSX 13 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_8_of_Combination_of_novel_and_public_RN...
 
Title Additional file 9: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S9. Range of expression estimates, and absolute difference between largest and smallest estimate, after randomly down-sampling the LPS-stimulated BMDM datasets. (XLSX 8074 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_9_of_Combination_of_novel_and_public_RN...
 
Title Additional file 9: of Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken 
Description Table S9. Range of expression estimates, and absolute difference between largest and smallest estimate, after randomly down-sampling the LPS-stimulated BMDM datasets. (XLSX 8074 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_9_of_Combination_of_novel_and_public_RN...
 
Title Assembly and validation of conserved long non-coding RNAs in the ruminant transcriptome 
Description mRNA-like long non-coding RNAs (lncRNA) are a significant component of mammalian transcriptomes, although most are expressed only at low levels, with high tissue-specificity and/or at specific developmental stages. This dataset demonstrates that few lncRNA are fully captured by biological replicates of the same RNA-seq library. In a transcriptional atlas of the domestic sheep (https://doi.org/10.1371/journal.pgen.1006997), 31 diverse tissues/cell types were sampled in each of 6 individual adults (3 females, 3 males, all unrelated virgin animals approximately 2 years of age). By taking a subset of 31 common tissues per individual, each of the 6 adults (f1, f2, f3, m1, m2, and m3) was represented by ~0.75 billion reads. In a typical lncRNA assembly pipeline, read alignments from all individuals are merged, to maximise the number of candidate gene models (using, for instance, StringTie --merge). With n = 6 adults (and ~0.75 billion reads per adult), there are (2^n)-1 = 63 possible combinations of data for which GTFs can be made with StringTie --merge. This dataset comprises those GTFs. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://datashare.ed.ac.uk/handle/10283/2995
 
Title Data from: Comparative population genomics reveals key barriers to dispersal in Southern Ocean penguins 
Description The mechanisms that determine patterns of species dispersal are important factors in the production and maintenance of biodiversity. Understanding these mechanisms helps to forecast the responses of species to environmental change. Here we used a comparative framework and genome-wide data obtained through RAD-seq to compare the patterns of connectivity among breeding colonies for five penguin species with shared ancestry, overlapping distributions, and differing ecological niches, allowing an examination of the intrinsic and extrinsic barriers governing dispersal patterns. Our findings show that at-sea range and oceanography underlie patterns of dispersal in these penguins. The pelagic niche of emperor (Aptenodytes forsteri), king (A. patagonicus), Adélie (Pygoscelis adeliae) and chinstrap (P. antarctica) penguins facilitates gene flow over thousands of kilometres. In contrast, the coastal niche of gentoo penguins (P. papua) limits dispersal, resulting in population divergences. Oceanographic fronts also act as dispersal barriers to some extent. We recommend that forecasts of extinction risk incorporate dispersal and that management units are defined by at-sea range and oceanography in species lacking genetic data. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.bs30388
 
Title Data from: Frequent fires prime plant developmental responses to burning 
Description Coping with temporal variation in fire requires plants to have plasticity in traits that promote persistence, but how plastic responses to current conditions are affected by past fire exposure remains unknown. We investigate phenotypic divergence between populations of four resprouting grasses exposed to differing experimental fire regimes (annually-burnt or unburnt for >35 years), and test whether divergence persists after plants are grown in a common environment for one year. Traits relating to flowering and biomass allocation were measured before plants were experimentally burnt, and their regrowth was tracked. Genetic differentiation between populations was investigated for a subset of individuals. Historic fire frequency influenced traits relating to flowering and below-ground investment. Previously burnt plants produced more inflorescences and invested proportionally more biomass below ground, suggesting greater capacity for recruitment and resprouting than unburnt individuals. Branch-scale regrowth rate did not differ between treatments, but prior fire exposure enhanced total regrown biomass in two species. We found no consistent genetic differences between populations suggesting trait differences arose from developmental plasticity. Grass development is influenced by prior fire exposure, independent of current environmental conditions. This priming response to fire, resulting in adaptive trait changes, may produce communities more resistant to future fire regime changes. 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.7qr55jn
 
Title Data from: Genomic associations with bill length and disease reveal drift and selection across island bird populations 
Description Island species provide excellent models for investigating how selection and drift operate in wild populations, and for determining how these processes act to influence local adaptation and speciation. Here, we examine the role of selection and drift in shaping genomic and phenotypic variation across recently separated populations of Berthelot's pipit (Anthus berthelotii), a passerine bird endemic to three archipelagos in the Atlantic. We first characterised genetic diversity and population structuring, which supported previous inferences of a history of recent colonisations and bottlenecks. We then tested for regions of the genome associated with the ecologically important traits of bill length and malaria infection, both of which vary substantially across populations in this species. We identified a SNP associated with variation in bill length among individuals, islands and archipelagos; patterns of variation at this SNP suggest that both phenotypic and genotypic variation in bill length is largely shaped by founder effects. Malaria was associated with SNPs near/within genes involved in the immune response, but this relationship was not consistent among archipelagos, supporting the view that disease resistance is complex and rapidly evolving. Although we found little evidence for divergent selection at candidate loci for bill length and malaria resistance, genome scan analyses pointed to several genes related to immunity and metabolism as having important roles in divergence and adaptation. Our findings highlight the utility and challenges involved with combining association mapping and population genetic analysis in non-equilibrium populations, to disentangle the effects of drift and selection on shaping genotypes and phenotypes. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.9642b
 
Title Data from: Genomic landscapes of divergence among island bird populations: evidence of parallel adaptation but at different loci? 
Description When populations colonise new environments they may be exposed to novel selection pressures but also suffer from extensive genetic drift due to founder effects, small population sizes, and limited interpopulation gene flow. Genomic approaches enable us to study how these factors drive divergence, and disentangle neutral effects from differentiation at specific loci due to selection. Here, we investigate patterns of genetic diversity and divergence using whole-genome resequencing (> 22X coverage) in Berthelot's pipit (Anthus berthelotii), a passerine endemic to the islands of three north Atlantic archipelagos. Strong environmental gradients, including in pathogen pressure, across populations in the species range, make it an excellent system in which to explore traits important in adaptation and/or incipient speciation. Firstly, we quantify how genomic divergence accumulates across the speciation continuum, i.e., among Berthelot's pipit populations, between subspecies across archipelagos, and between Berthelot's pipit and its mainland ancestor, the tawny pipit (Anthus campestris). Across these colonisation timeframes (2.1 million - ca. 8,000 years ago), we identify highly differentiated loci within genomic islands of divergence and conclude that the observed distributions align with expectations for non-neutral divergence. Characteristic signatures of selection are identified in loci associated with craniofacial/bone and eye development, metabolism, and immune response between population comparisons. Interestingly, we find limited evidence for repeated divergence of the same loci across the colonisation range but do identify different loci putatively associated with the same biological traits in different populations, likely due to parallel adaptation. Incipient speciation across these island populations, in which founder effects and selective pressures are strong, may therefore be repeatedly associated with morphology, metabolism, and immune defence. 
Type Of Material Database/Collection of data 
Year Produced 2024 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.1g1jwsv4b
 
Title Data from: Genomic variation, population history and within-archipelago adaptation between island bird populations 
Description Oceanic island archipelagos provide excellent models to understand evolutionary processes. Colonisation events and gene flow can interact with selection to shape genetic variation at different spatial scales. Landscape-scale variation in biotic and abiotic factors may drive fine-scale selection within islands, while long-term evolutionary processes may drive divergence between distantly related populations. Here, we examine patterns of population history and selection between recently diverged populations of the Berthelot's pipit (Anthus berthelotii), a passerine endemic to three North Atlantic archipelagos. First we use demographic trees and f3 statistics to show that genome-wide divergence across the species range is largely shaped by colonisation and bottlenecks, with evidence of very weak gene flow between populations. Then, using a genome scan approach, we identify signatures of divergent selection within-archipelagos at SNPs in genes potentially associated with craniofacial development and DNA repair. We did not detect within-archipelago selection at the same SNPs as were detected previously at broader spatial scales between archipelagos, but did identify signatures of selection at loci associated with similar biological functions. These findings suggest that similar ecological factors may repeatedly drive selection between recently separated populations, as well as at broad spatial scales across varied landscapes. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.pc866t1kt
 
Title Data from: Reproductive workers show queen-like gene expression in an intermediately eusocial insect, the buff-tailed bumble bee Bombus terrestris 
Description Bumble bees represent a taxon with an intermediate level of eusociality within Hymenoptera. The clear division of reproduction between a single founding queen and the largely sterile workers is characteristic for highly eusocial species, whereas the morphological similarity between the bumble bee queen and the workers is typical for more primitively eusocial hymenopterans. Also, unlike other highly eusocial hymenopterans, division of labour among worker subcastes is plastic and not predetermined by morphology or age. We conducted a differential expression analysis based on RNA-seq data from 11 combinations of developmental stage and caste to investigate how a single genome can produce the distinct castes of queens, workers and males in the buff-tailed bumble bee Bombus terrestris. Based on expression patterns, we found males to be the most distinct of all adult castes (2411 transcripts differentially expressed compared to nonreproductive workers). However, only relatively few transcripts were differentially expressed between males and workers during development (larvae: 71 and pupae: 162). This indicates the need for more distinct expression patterns to control behaviour and physiology in adults compared to those required to create different morphologies. Among female castes, reproductive workers and their nonreproductive sisters displayed differential expression in over ten times more transcripts compared to the differential expression found between reproductive workers and their mother queen. This suggests a strong shift towards a more queenlike behaviour and physiology when a worker becomes fertile. This contrasts with eusocial species where reproductive workers are more similar to nonreproductive workers than the queen. 
Type Of Material Database/Collection of data 
Year Produced 2015 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.sp048
 
Title Genomic associations with poxvirus across divergent island populations in Berthelot's pipit 
Description Understanding the mechanisms and genes that enable animal populations to adapt to pathogens is important from an evolutionary, health and conservation perspective. Berthelot's pipit (Anthus berthelotii) experiences extensive and consistent spatial heterogeneity in avian pox infection pressure across its range of island populations, thus providing an excellent system with which to examine how pathogen-mediated selection drives spatial variation in immunogenetic diversity. Here we test for evidence of genetic variation associated with avian pox at both an individual and population-level. At the individual level, we find no evidence that variation in MHC class I and TLR4 (both known to be important in recognising viral infection) was associated with pox infection within two separate populations. However, using genotype-environment association (Bayenv) in conjunction with genome-wide (ddRAD-seq) data, we detected strong associations between population-level avian pox prevalence and allele frequencies of single nucleotide polymorphisms (SNPs) at a number of sites across the genome. These sites were located within genes involved in cellular stress signalling and immune responses, many of which have previously been associated with responses to viral infection in humans and other animals. Consequently, our analyses indicates that pathogen-mediated selection may play a role in shaping genomic variation among relatively recently colonised island bird populations and highlights the utility of genotype-environment associations for identifying candidate genes potentially involved in host-pathogen interactions. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://datadryad.org/stash/dataset/doi:10.5061/dryad.6t1g1jx0r
 
Title MOESM1 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 1. This file contains all supplementary tables relating to the characterisation of sheep lncRNAs. Table S1. Candidate sheep lncRNAs: a shortlist of novel gene models (plus independently confirmed known gene models) assessed for coding potential using CPC, CPAT, PLEK, blastp vs. Swiss-Prot, and HMMER vs. Pfam. Table S4. Sheep gene models considered as non-coding by either CPC, CPAT or PLEK but showing sequence homology to either a known protein (in Swiss-Prot) or protein domain (in Pfam-A). Table S6. Number of novel sheep lncRNA gene models identified per chromosome. Table S8. Number of novel sheep lncRNA gene models identified, by category. Table S10. Alignments of novel sheep lncRNA gene models to goat, cattle and human lncRNA. Table S19. Expression level estimates for 13,047 novel sheep lncRNAs, as transcripts per million (TPM), which were assessed using 71 adult RNA-seq libraries (11 tissues plus one cell type under two conditions, each sample is sequenced for six individuals). Table S21. Reproducibility of sheep lncRNA gene models when merging all combinations of data from six adults (three females and three males), for each individual a common set of RNA-seq libraries (comprising 31 tissues/cell types) was available. Table S22. Number of sheep expression atlas RNA-seq libraries (out of 429 in total) for which a candidate lncRNA gene model cannot be fully reconstructed. Table S23. Genes within each co-expression cluster, after network analysis of the sheep RNA-seq libraries. Table S24. Number of sheep lncRNAs co-expressed with protein-coding genes. Table S25. Distance between lncRNAs and protein-coding genes within the same co-expression cluster, on the same chromosome, for sheep. Table S26. Correlation between the expression profile of sheep lncRNAs and their nearest protein-coding genes, both 5' and 3'. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM1_of_Cross-species_inference_of_long_non-coding_RN...
 
Title MOESM1 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 1. This file contains all supplementary tables relating to the characterisation of sheep lncRNAs. Table S1. Candidate sheep lncRNAs: a shortlist of novel gene models (plus independently confirmed known gene models) assessed for coding potential using CPC, CPAT, PLEK, blastp vs. Swiss-Prot, and HMMER vs. Pfam. Table S4. Sheep gene models considered as non-coding by either CPC, CPAT or PLEK but showing sequence homology to either a known protein (in Swiss-Prot) or protein domain (in Pfam-A). Table S6. Number of novel sheep lncRNA gene models identified per chromosome. Table S8. Number of novel sheep lncRNA gene models identified, by category. Table S10. Alignments of novel sheep lncRNA gene models to goat, cattle and human lncRNA. Table S19. Expression level estimates for 13,047 novel sheep lncRNAs, as transcripts per million (TPM), which were assessed using 71 adult RNA-seq libraries (11 tissues plus one cell type under two conditions, each sample is sequenced for six individuals). Table S21. Reproducibility of sheep lncRNA gene models when merging all combinations of data from six adults (three females and three males), for each individual a common set of RNA-seq libraries (comprising 31 tissues/cell types) was available. Table S22. Number of sheep expression atlas RNA-seq libraries (out of 429 in total) for which a candidate lncRNA gene model cannot be fully reconstructed. Table S23. Genes within each co-expression cluster, after network analysis of the sheep RNA-seq libraries. Table S24. Number of sheep lncRNAs co-expressed with protein-coding genes. Table S25. Distance between lncRNAs and protein-coding genes within the same co-expression cluster, on the same chromosome, for sheep. Table S26. Correlation between the expression profile of sheep lncRNAs and their nearest protein-coding genes, both 5' and 3'. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM1_of_Cross-species_inference_of_long_non-coding_RN...
 
Title MOESM1 of Temporal stability of the rumen microbiota in beef cattle, and response to diet and supplements 
Description Additional file 1. Table of operational taxonomic units with animal metadata and taxonomic classification. 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM1_of_Temporal_stability_of_the_rumen_microbiota_in...
 
Title MOESM2 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 2. This file contains all supplementary tables relating to the characterisation of goat lncRNAs. Table S2. Candidate goat lncRNAs: a shortlist of novel gene models assessed for coding potential using CPC, CPAT, PLEK, blastp vs. Swiss-Prot, and HMMER vs. Pfam. Table S5. Goat gene models considered non-coding by either CPC, CPAT or PLEK but showing sequence homology to either a known protein (in Swiss-Prot) or protein domain (in Pfam-A). Table S7. Number of novel goat lncRNA gene models identified per chromosome. Table S9. Number of novel goat lncRNA gene models identified, by category. Table S11. Alignments of novel goat lncRNA gene models to sheep, cattle and human lncRNA. Table S20. Expression level estimates for 4392 novel goat lncRNAs, as transcripts per million (TPM), which were assessed using 54 RNA-seq libraries (20 tissues plus one cell type under two different conditions, for each sample four individuals were sequenced). 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM2_of_Cross-species_inference_of_long_non-coding_RN...
 
Title MOESM2 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 2. This file contains all supplementary tables relating to the characterisation of goat lncRNAs. Table S2. Candidate goat lncRNAs: a shortlist of novel gene models assessed for coding potential using CPC, CPAT, PLEK, blastp vs. Swiss-Prot, and HMMER vs. Pfam. Table S5. Goat gene models considered non-coding by either CPC, CPAT or PLEK but showing sequence homology to either a known protein (in Swiss-Prot) or protein domain (in Pfam-A). Table S7. Number of novel goat lncRNA gene models identified per chromosome. Table S9. Number of novel goat lncRNA gene models identified, by category. Table S11. Alignments of novel goat lncRNA gene models to sheep, cattle and human lncRNA. Table S20. Expression level estimates for 4392 novel goat lncRNAs, as transcripts per million (TPM), which were assessed using 54 RNA-seq libraries (20 tissues plus one cell type under two different conditions, for each sample four individuals were sequenced). 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM2_of_Cross-species_inference_of_long_non-coding_RN...
 
Title MOESM3 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 3. This file contains all supplementary tables relating to lncRNA identification via the conservation of synteny. Table S3. lncRNAs inferred in one species by the genomic alignment of a transcript assembled with the RNA-seq libraries from a related spdecies. Table S12. Presence of intergenic lncRNAs both in sheep and cattle, in regions of conserved synteny. Table S13. Presence of intergenic lncRNAs both in sheep and goat, in regions of conserved synteny. Table S14. Presence of intergenic lncRNAs both in cattle and goat, in regions of conserved synteny. Table S15. Presence of intergenic lncRNAs both in sheep and humans, in regions of conserved synteny. Table S16. Presence of intergenic lncRNAs both in goat and humans, in regions of conserved synteny. Table S17. Presence of intergenic lncRNAs both in cattle and humans, in regions of conserved synteny. Table S18. High-confidence lncRNA pairs, those conserved across species both sequentially and positionally. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM3_of_Cross-species_inference_of_long_non-coding_RN...
 
Title MOESM3 of Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome 
Description Additional file 3. This file contains all supplementary tables relating to lncRNA identification via the conservation of synteny. Table S3. lncRNAs inferred in one species by the genomic alignment of a transcript assembled with the RNA-seq libraries from a related spdecies. Table S12. Presence of intergenic lncRNAs both in sheep and cattle, in regions of conserved synteny. Table S13. Presence of intergenic lncRNAs both in sheep and goat, in regions of conserved synteny. Table S14. Presence of intergenic lncRNAs both in cattle and goat, in regions of conserved synteny. Table S15. Presence of intergenic lncRNAs both in sheep and humans, in regions of conserved synteny. Table S16. Presence of intergenic lncRNAs both in goat and humans, in regions of conserved synteny. Table S17. Presence of intergenic lncRNAs both in cattle and humans, in regions of conserved synteny. Table S18. High-confidence lncRNA pairs, those conserved across species both sequentially and positionally. 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/MOESM3_of_Cross-species_inference_of_long_non-coding_RN...
 
Title Table S1 from Contrasted histories of organelle and nuclear genomes underlying physiological diversification in a grass species 
Description Supplementary table provided as a xls file 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
URL https://rs.figshare.com/articles/dataset/Table_S1_from_Contrasted_histories_of_organelle_and_nuclear...
 
Title The mRNA derived MalH sRNA contributes to alternative carbon source utilization by tuning maltoporin expression in E. coli 
Description Previous high-throughput studies in Gram-negative bacteria identified a large number of 3'UTR fragments that potentially function as sRNAs. Here we extensively characterize the MalH sRNA. We show that MalH is a stable degradation intermediate derived from the 3' end of malG, which is part of the maltose uptake operon transcript malEFG. Unlike the majority of bacterial sRNAs, MalH is transiently expressed during the transition from the exponential to the stationary growth phase, suggesting that it contributes to adaptation to changes in nutrient availability. Over-expression of MalH reduces expression of general outer membrane porins and MicA, a repressor of the high-affinity maltose/maltodextrin transporter LamB. Disrupting MalH production and function significantly reduces lamB accumulation when maltose is the only available carbon source, presumably due to the accumulation of the MicA repressor. We propose that MalH is part of a regulatory network that, during the transition phase, directly or indirectly promotes accumulation of high-affinity maltose transporters in the outer membrane by dampening competing pathways. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
URL https://tandf.figshare.com/articles/dataset/The_mRNA_derived_MalH_sRNA_contributes_to_alternative_ca...
 
Description Edinburgh Genomics 
Organisation ARK-Genomics
Country United Kingdom 
Sector Charity/Non Profit 
PI Contribution We have formally merged the two genomics facilities in Edinburgh to form Edinburgh genomics. By judicious forward investment in technology and staff we are now the largest academic sequencing facility in the UK after the Sanger Institute.
Collaborator Contribution We joined the infrastructure and staff of the GenePool with that of ARK-Genomics, a BBSRC core funded facility in the Roslin Institute.
Impact Increased capacity; Improved service delivery
Start Year 2013
 
Description Heliconius genome consortium 
Organisation University of Cambridge
Department Department of Zoology
Country United Kingdom 
Sector Academic/University 
PI Contribution Offering genomic sequencing and reseaquencing technologies; close involvement in experimental design.
Collaborator Contribution Joining the consortium has allowed GenePool to develop custom targeted resequencing technologies, and associated bioinformatics skiils, and made us much more visible across this area of science
Impact The Heliconius melpomene genome has been assembled and annotated. A manuscript describing this work is submitted for publication. On the basis of the GenePool's involvement in this consortium we have coordinated a ladybird genome consortium with 6 partner laboratories, and currently have 2 genomes in sequencing.
Start Year 2010
 
Description MRC hub-spokes network 
Organisation Medical Research Council (MRC)
Department MRC Human Genetics Unit
Country United Kingdom 
Sector Academic/University 
PI Contribution Training and support for spokes bioinformaticians at collaborating institutions
Collaborator Contribution Training workshops, collaborative projects
Impact Training workshops, collaborative projects with GenePool
Start Year 2009
 
Description MRC hub-spokes network 
Organisation University of Aberdeen
Department Institute of Medical Sciences
Country United Kingdom 
Sector Academic/University 
PI Contribution Training and support for spokes bioinformaticians at collaborating institutions
Collaborator Contribution Training workshops, collaborative projects
Impact Training workshops, collaborative projects with GenePool
Start Year 2009
 
Description MRC hub-spokes network 
Organisation University of Dundee
Country United Kingdom 
Sector Academic/University 
PI Contribution Training and support for spokes bioinformaticians at collaborating institutions
Collaborator Contribution Training workshops, collaborative projects
Impact Training workshops, collaborative projects with GenePool
Start Year 2009
 
Description MRC hub-spokes network 
Organisation University of Edinburgh
Department The Roslin Institute
Country United Kingdom 
Sector Academic/University 
PI Contribution Training and support for spokes bioinformaticians at collaborating institutions
Collaborator Contribution Training workshops, collaborative projects
Impact Training workshops, collaborative projects with GenePool
Start Year 2009
 
Description MRC hub-spokes network 
Organisation University of Glasgow
Department Polyomics Facility
Country United Kingdom 
Sector Academic/University 
PI Contribution Training and support for spokes bioinformaticians at collaborating institutions
Collaborator Contribution Training workshops, collaborative projects
Impact Training workshops, collaborative projects with GenePool
Start Year 2009
 
Description MRC/BBSRC Genome Centre Network 
Organisation Earlham Institute
Country United Kingdom 
Sector Academic/University 
PI Contribution Organisation of networking meetings and training workshops
Collaborator Contribution Organisation of networking meetings and training workshops
Impact Networking, exchange of best practices, development of collaborative projects and joint proposals, training workshops for facility staff
Start Year 2010
 
Description MRC/BBSRC Genome Centre Network 
Organisation University of Cambridge
Department Eastern Sequence and Informatics Hub (EASIH)
Country United Kingdom 
Sector Academic/University 
PI Contribution Organisation of networking meetings and training workshops
Collaborator Contribution Organisation of networking meetings and training workshops
Impact Networking, exchange of best practices, development of collaborative projects and joint proposals, training workshops for facility staff
Start Year 2010
 
Description MRC/BBSRC Genome Centre Network 
Organisation University of Liverpool
Department Liverpool Centre for Genomic Research
Country United Kingdom 
Sector Academic/University 
PI Contribution Organisation of networking meetings and training workshops
Collaborator Contribution Organisation of networking meetings and training workshops
Impact Networking, exchange of best practices, development of collaborative projects and joint proposals, training workshops for facility staff
Start Year 2010
 
Description MRC/BBSRC Genome Centre Network 
Organisation University of Oxford
Department Oxford Genomics Centre
Country United Kingdom 
Sector Academic/University 
PI Contribution Organisation of networking meetings and training workshops
Collaborator Contribution Organisation of networking meetings and training workshops
Impact Networking, exchange of best practices, development of collaborative projects and joint proposals, training workshops for facility staff
Start Year 2010
 
Description Marine Alliance for Science & Technology for Scotland 
Organisation National Oceanography Centre
Country United Kingdom 
Sector Academic/University 
PI Contribution Experimental support: assistance with understanding technology and experimental design; Delivery of sequencing service; Horizon scanning consultation to SAMS collaborators
Collaborator Contribution The MASTS collaboration is coordinating marine science in Scotland, and has recommended GenePool as its sequencing provider
Impact Several projects running in GenePool; participation in SAMS genomics workshop.
Start Year 2010
 
Description The Caenorhabditis Genomes Project 
Organisation Columbia University
Country United States 
Sector Academic/University 
PI Contribution We are sequencing assembling and annotating the genomes of the remaining unsequenced species in the genus Caenorhabditis.
Collaborator Contribution Marie-Anne Felix and laboratory have paid for some of the sequencing and provided DNA and RNA samples. Peter Sarkies and laboratory have paid for sequencing small RBNA samples. Erich Schwarz and collaborators have provided PacBio data. Asher Cutter has supported the project with dedicated staff time.
Impact Genome sequences of three species have been deposited in public databases and distributed widely.
Start Year 2012
 
Description The Caenorhabditis Genomes Project 
Organisation Imperial College London
Department Imperial College Trust
Country United Kingdom 
Sector Charity/Non Profit 
PI Contribution We are sequencing assembling and annotating the genomes of the remaining unsequenced species in the genus Caenorhabditis.
Collaborator Contribution Marie-Anne Felix and laboratory have paid for some of the sequencing and provided DNA and RNA samples. Peter Sarkies and laboratory have paid for sequencing small RBNA samples. Erich Schwarz and collaborators have provided PacBio data. Asher Cutter has supported the project with dedicated staff time.
Impact Genome sequences of three species have been deposited in public databases and distributed widely.
Start Year 2012
 
Description The Caenorhabditis Genomes Project 
Organisation University of Toronto
Country Canada 
Sector Academic/University 
PI Contribution We are sequencing assembling and annotating the genomes of the remaining unsequenced species in the genus Caenorhabditis.
Collaborator Contribution Marie-Anne Felix and laboratory have paid for some of the sequencing and provided DNA and RNA samples. Peter Sarkies and laboratory have paid for sequencing small RBNA samples. Erich Schwarz and collaborators have provided PacBio data. Asher Cutter has supported the project with dedicated staff time.
Impact Genome sequences of three species have been deposited in public databases and distributed widely.
Start Year 2012
 
Description The Caenorhabditis Genomes Project 
Organisation Ɖcole Normale SupĆ©rieure, Paris
Country France 
Sector Academic/University 
PI Contribution We are sequencing assembling and annotating the genomes of the remaining unsequenced species in the genus Caenorhabditis.
Collaborator Contribution Marie-Anne Felix and laboratory have paid for some of the sequencing and provided DNA and RNA samples. Peter Sarkies and laboratory have paid for sequencing small RBNA samples. Erich Schwarz and collaborators have provided PacBio data. Asher Cutter has supported the project with dedicated staff time.
Impact Genome sequences of three species have been deposited in public databases and distributed widely.
Start Year 2012
 
Title Genomic associations with poxvirus across divergent island populations in Berthelot's pipit 
Description Understanding the mechanisms and genes that enable animal populations to adapt to pathogens is important from an evolutionary, health and conservation perspective. Berthelot's pipit (Anthus berthelotii) experiences extensive and consistent spatial heterogeneity in avian pox infection pressure across its range of island populations, thus providing an excellent system with which to examine how pathogen-mediated selection drives spatial variation in immunogenetic diversity. Here we test for evidence of genetic variation associated with avian pox at both an individual and population-level. At the individual level, we find no evidence that variation in MHC class I and TLR4 (both known to be important in recognising viral infection) was associated with pox infection within two separate populations. However, using genotype-environment association (Bayenv) in conjunction with genome-wide (ddRAD-seq) data, we detected strong associations between population-level avian pox prevalence and allele frequencies of single nucleotide polymorphisms (SNPs) at a number of sites across the genome. These sites were located within genes involved in cellular stress signalling and immune responses, many of which have previously been associated with responses to viral infection in humans and other animals. Consequently, our analyses indicates that pathogen-mediated selection may play a role in shaping genomic variation among relatively recently colonised island bird populations and highlights the utility of genotype-environment associations for identifying candidate genes potentially involved in host-pathogen interactions. 
Type Of Technology Software 
Year Produced 2022 
Open Source License? Yes  
URL https://zenodo.org/record/6514262
 
Description 2018 Edinburgh genomics Training Workshops 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Edinburgh Genomics delivered a rich portfolio of training courses in 2018, ranging from a one day Interoduction to Linux (delivered three times) through to an intensive "spring school" in Bioinformatics for Genomics (one week). The courses were delivered in Edinburgh, and had from 12 to 50 participants. The participants came from across the UK HEI sector, including undergraduates, postgraduates and postdoctoral researchers, and also attracted overseas attendants (mainly from other European countries). The training strand is funded from student fees and from BBSRC and NERC dedicated sources. The training strand employs a full time Training Manager who both administers the scheme and develops and delivers courses.
Year(s) Of Engagement Activity 2018
 
Description Blaxter group presentations and outreach 2015 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Mark Blaxter and research team communication and outreach 2015

Globodera genomics and blobtools software
25/02/2015 JHI Postgraduate Student Competition 2015 James Hutton Institute, Aberdeen, UK A tale of Two Peaks: Analysing Genomic Data from Potato Cyst Nematodes Talk
26/03/2015 JHI Cell and Molecular Sciences (CMS) seminar James Hutton Institute, Invergowrie, Dundee, UK Frustration and happiness : (De)-constructing parasite genomes Talk
16/06/2015 JHI Dundee effector consortium (DEC) meeting 2015 Birnam Arts and Conference Centre, Birnam, UK Variation within the Globodera pallida species complex: preliminary results Talk
03/09/2015 Molecular and Cellular Biology of Helminth Parasites IX Bratsera Hotel, Hydra, Greece Inter- and intra-specific analyses of the effector complement in potato cyst nematodes Poster
18/09/2015 UoE Postgraduate Poster Day University of Edinburgh, Edinburgh, UK Inter- and intra-specific analyses of the effector complement in potato cyst nematodes Poster
26/09/2015 Edinburgh University Doors Open Day University of Edinburgh, Edinburgh, UK Potato Cyst Nematodes (PCN) - Nematode parasites of potatoes Poster
30/11/2015 NextGenBug University of Edinburgh, Edinburgh, UK Blobtools: Blobology 2.0 Talk
01/12/2015 UK pollinator genomics meeting Roslin Institute, Edinburgh, UK Bees and Blobs Talk

LepBase
06/03/2015 EMARES Cambridge, UK The Bicyclus Genome Project Talk
06/03/2015 EMARES Cambridge, UK An introduction to Lepbase Talk
17/06/2015 Arthropod Genomics Manhattan, Kansas, USA Lepbase - A multi genome database for the Lepidoptera Poster
24/07/2015 10th Heliconius Meeting Gamboa, Panama Lepbase - A multi genome database for the Lepidoptera (API demonstration) Workshop
24/07/2015 10th Heliconius Meeting Panama Lepbase - A multi genome database for the Lepidoptera Poster
26/07/2015 10th Heliconius Meeting Panama Lepbase Workshop Talk
04/09/2015 Edinburgh Bioinformatics Edinburgh, UK Lepbase - A multi genome database for the Lepidoptera Talk
26/09/2015 Open Doors Day "Make a butterfly" interactive exhibition
26/09/2015 Edinburgh University Doors Open Day Edinburgh, UK Lepbase Multiple Sequence Alignments game Poster+Game
28/10/2015 NextgenBUG Dundee, UK Lepbase - an Ensembl (and more) for the Lepidoptera Talk

Nematode genomics
24.06.2015 20th International C. elegans Meeting Los Angeles USA A new evolutionary framework for the phylum Nematoda: a case study of HOX cluster evolution Poster
24.06.2015 20th International C. elegans Meeting Los Angeles USA Caenorhabditis Genomes Project Workshop (organiser and chair) Talk
24.06.2015 20th International C. elegans Meeting Los Angeles USA Current status of the CGP in Edinburgh Talk

Meloidogyne genomics
10-14 August 2015 ESEB Lausanne-Switzerland Genomic consequences of hybridization and the loss of meiotic recombination in Root-knot nematodes poster
15-18 December 2015 PopGroup Edinburgh-UK Genomic consequences of hybridization and the loss of meiotic recombination in Root-knot nematodes talk
23 February 2016 NextGenBug Edinburgh-UK Genomics of Root-knot nematodes talk
Year(s) Of Engagement Activity 2015
 
Description Press releases and website 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact We have engaged actively with the University press office to promote press coverage of our research outcomes, particularly major publications (which have had coverage in national and international newspapers) and in blogs and other online media. We have also promoted major new initiatives such as additional core funding of the Edinburgh genomics facility.

Increased visibility of Edinburgh Genomics within the community; requests for comment by funders and government on matters pertaining to genomics.
Year(s) Of Engagement Activity 2009,2010,2011,2012,2013,2014,2015,2016