Drosophila ORFeome. In vitro and in vivo resource for the research community.

Lead Research Organisation: University of Dundee
Department Name: School of Life Sciences

Abstract

The fruitfly (Drosophila melanogaster) is an important model organism for the studies of normal development and different disease states. It has a relatively small but well annotated genome encoding for approximately 15 000 genes, of which 13500 are protein coding genes. Drosophila has been a successful model organism partly because the great majority (close to 80%) of human disease genes have homologous genes in fruitfly making it a very good model to study various diseases. The rapid life cycle and powerful methods to analyse gene function makes fruitfly an attractive system to study gene function even in a large scale.
Understanding the function of each and every gene and protein in an organism is the ultimate goal in biology, and this is likely to be significantly easier to achieve in Drosophila than in human. Currently, depending on the organism, up to half of the genes identified through genome sequencing studies are of unknown function and it is a major challenge in the future to understand the function and interactions of all these genes.
Another challenge in biology is to understand how to exploit the gene information to improve human health. There has been intensive research focus on human disease genes, which when mutated lead to disease. However, a disease mutation does not always lead to disease. Furthermore, very often a drug targeting such a disease gene fails to cure the disease due to unexpected effects. Thus we need to learn more about the other genes that are regulating these diseased genes and deciding factors if a disease gene mutation will cause disease. Some of these genes may not have been identified and that is why we need a new tool to identify such disease regulating genes.
Genetic research has traditionally used two approaches to study gene function which are complementary in nature. These are decreasing the amount of an expressed gene by mutations or gene silencing or by increasing the expression of the gene with additional copies of the gene. Whereas decreasing the amount of gene expression in a large scale has been relatively easy by generation and utilization of mutant collections or gene silencing, increasing the gene expression has been hampered due to lack of suitable gene collections.
We aim to isolate the majority of the Drosophila protein coding genes to form such a large-scale collection of genes, which can be expressed in both cultured cells and in live fruitflies to study the function of these genes in different biological processes such as cell growth and animal development. Such genome-scale collections have been or are being made for other species, including the collections of worm (C. elegans), mouse and human genes, but no collections are available for screens in live animals.
Our aim is that the Drosophila gene collection becomes a resource for the whole scientific community. Having a flexible and thoroughly characterized gene collection covering most of the genome would save significant amount of time and resources and enable researchers to enhance their scientific output. This resource would also be important for the identification of new genes regulating disease progression or any other biological processes.
Following the tradition in Drosophila research where openness and free distribution of research reagents and clones is promoted, this resource of genes, as plasmids and fly strains, will be made available for all researchers.

Technical Summary

We aim to generate a fully sequenced Drosophila melanogaster ORFeome collection in a plasmid format and as transgenic flies expressing these genes using site-specific phiC31 integration. Although ORF collections have been done for other species, there are no genome-scale metazoan in vivo collections. The collection would be a community resource to complement the large-scale RNA interference (RNAi) approaches by us and others in the recent years. Our laboratory is interested in using this resource for studying growth in cell culture assays and in vivo in the future.
The genome scale ORF collection would provide tools to systematically study the functions of individual genes in vitro, in cultured cells and in selected tissues using the GAL4/UAS system. In comparison to RNA interference libraries, an ORF collection is more versatile. It can also be used to study localization of the proteins in cells and tissues or to identify binding partners for the proteins using yeast two hybrid and tandem affinity purifications. Importantly, the ORF expression approach allows also identification of redundant functions which is not possible with RNAi and our preliminary in vivo analyses identify phenotypes for 15% of the genes which show no phenotype by RNAi.
To prepare the ORF collection to the highest standards, we are fully sequencing the original template cDNA using RNAseq and use that as a reference sequence to compare the sequences of the ORF clones. This allows us to distinct between true biological sequence variation and PCR generated mutations. This type of careful analysis has not been made for any other collection so far. Finally, we will set up a database with clone, gene and sequence information in a database format for easy access, analysis and distribution to researchers worldwide.

Planned Impact

This work is expected to directly and indirectly benefit research and society at multiple levels, not only in UK but also worldwide. Most if not all Drosophila researchers would be interested in being able to use a characterized plasmid collection and ready-made transgenic flies for their own research as this would save their time and funding. These effects would be expected to be immediate and sustained.
This collection is also expected to immediately boost new approaches to study large-scale biology and the datasets generated by using this resource would have a wide impact on the systems biology community in the next years to come and would be expected to last until "next generation" approaches are identified and realized. Within a 5-10 year scope, any findings relevant to human disease from the studies using the resource could boost translational research focusing on the development of new diagnostic or therapeutic means to improve human health. One potential application would be to study which genes decrease the fitness of the fly and extrapolate these findings to find new approached to decrease the spread of diseases like malaria in the developing countries.
It is not unconceivable that some people would utilize the resource for commercial activities. This could happen very rapidly in terms of commercial "research and development" type of activities. For example, the large-scale yeast two-hybrid Drosophila protein interaction map was done in collaboration with CuraGen Corporation and academic groups in US with the aim of identifying novel drug targets. However, it is difficult to predict what these activities might be, but they are likely to concentrate around the health sector.
Another immediate benefit of the project is the gain of knowledge and transferable skills obtained by the post-doctoral research assistant. In addition to immediate research skills learner, he/she would be able to learn project management and dissemination of results to research community and wider public. These skills are likely to be desired not only in research sector, but in any other future employment. Any other people working later on with the collection would learn modern research approaches to enhance the output of their laboratories and host institutions.
Having a standardized gene set which is widely used by the research community would also benefit BBSRC by strengthening the BBSCR brand as a supporter of internationally recognized key systems biology resources and as a supporter of international collaborations.
Finally, by having an impact on multiple levels of basic and translational research, this project would be expected to improve human health and well-being. These effects would be expected to be most indirect and take the longest amount of time to realize, the time scale in this would be likely 10-30 years before seeing any significant impact. However, given the track record of Drosophila genetic research on human health, these are very likely to occur.

Publications

10 25 50
publication icon
Bischof J (2014) Generation of a transgenic ORFeome library in Drosophila. in Nature protocols

 
Description We have cloned more than 5000 fruitfly genes into plasmid vectors. These plasmids are being used to generate fruitfly strains expressing these genes in controllable fashion using fly genetics. This collection forms a standardised resource for the Drosophila community for their research into various basic science and biomedical purposes.
Exploitation Route The Drosophila community is widely adopting the collection for their research as demonstrated by the increasing number of orders for the fly strains. Based on literature citations, genes from the collection have been studied in the context of developmental biology, evolution, cancer and neurology, demonstrating that this work has wide impact.
Sectors Healthcare,Pharmaceuticals and Medical Biotechnology,Other

 
Title Drosophila ORFeome collection 
Description The cloned open reading frames are being integrated into transgenic Drosophila strains. 
Type Of Material Model of mechanisms or symptoms - non-mammalian in vivo 
Year Produced 2013 
Provided To Others? Yes  
Impact The paper describing our methodology was highlighted at the Nature Protocols journal home page and the view/download statistics indicated that it is within the top 10 percent of papers in the journal indicating substantial impact of the project. Although the project is still incomplete as only part of the cloned Drosophila ORFs have been prepared as transgenic flies, the strains have been widely requested and used around the world. Our collaborators in Zurich who take care of the distribution of the collections can report that more than 12 000 vials of these ORF expressing strains have been sent out to 320 individual laboratories in 32 countries. The information related to these transgenic flies is also available from FlyBase, the main information site for Drosophila researchers as well as a dedicated website flyorf.ch, which can be used to order the strains with nominal cost. 
URL http://www.flyorf.ch/
 
Title Zurich ORFeome Project 
Description The publicly available website flyorf.ch describes the ORF sequences and the Drosophila strains expressing these and can be used to order any of these. 
Type Of Material Database/Collection of data 
Year Produced 2013 
Provided To Others? Yes  
Impact The paper describing our methodology was highlighted at the journal (Nature Protocols) home page and the view/download statistics indicated that it is within the top 10 percent of papers in the journal indicating visible impact of the project. Although the project is still incomplete as only part of the cloned Drosophila ORFs have been prepared as transgenic flies, the products have been widely requested and used around the world. Our collaborators in Zurich who take care of the distribution of the collections can report that more than 12 000 vials of these ORF expressing strains have been sent out to 320 individual laboratories in 32 countries. The information related to these transgenic flies is also available from FlyBase, the main information site for Drosophila researchers as well as a dedicated website flyorf.ch, which can be used to order the strains with nominal cost. 
URL http://flyorf.ch
 
Description Bellen 
Organisation Baylor College of Medicine
Department Department of Molecular and Human Genetics
Country United States 
Sector Academic/University 
PI Contribution In August 2015, my laboratory was specifically requested if we could provide a selected set of ORFs for a leading drosophila genetics laboratory lead by prof. Hugo Bellen.
Collaborator Contribution The collaborator lab has not yet received the strains due to the time constrains required to produce the ORFs and prepare them as fly strains. However, they will analyse the role of these genes related to mitochondrial function.
Impact na
Start Year 2015
 
Description TF set 
Organisation École normale supérieure de Lyon (ENS Lyon)
Country France 
Sector Academic/University 
PI Contribution We have provided a collection of Drosophila transcription factors cloned as fluorescent protein fusions for a team investigating transcription factors and their interactions with each other using bimolecular fluorescence complementation (BiFC) technology.
Collaborator Contribution The group has made preliminary tests and is trying to obtain further funding to perform large scale analyses.
Impact no outcomes yet
Start Year 2015
 
Description Newspaper article 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Neue Zürcher Zeitung, a leading newspaper in the german speaking world made an article about the Drosophila ORFeome collection. This appeared in 3rd of March 2015 both in print and online. The circulation of this paper is ~130 000 copies, potentially reaching a wide audience of general public with interest in science and medicine.
Year(s) Of Engagement Activity 2015
URL http://www.nzz.ch/nzzas/nzz-am-sonntag/bibliothek-der-fliegen-1.18492821
 
Description School visit (Grove Academy) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact In this school outreach (4 times in 2013 and 2014), typically 100 pupils each time visit the demonstration stall of microscopic examination of fly phenotypes and what flies can tell us about animal development and human disease. There are other similar stalls related to other LifeScience activities on the same day.

The recurrent invitations indicate that this activity sparks interest from pupils and teachers alike.
Year(s) Of Engagement Activity 2013,2014