Development of an Ontology for Drug Screening and Design

Lead Research Organisation: Aberystwyth University
Department Name: Computer Science

Abstract

As science progresses it is generating ever larger amounts of data, and in some fields this increase in data has become a deluge. The result is that science increasingly depends on computers to store, integrate, and analyse data. However, the full power of computers can only be efficiently exploited when the data they work with is formalised, i.e. put into a form where the meaning of the data is made explicit to the computer. The first step in formalising knowledge is to define an ontology, i.e. to describe what exists. Ontologies are increasingly used in biology to describe experiments, as this enables the results of the experiment to be more easily used by other scientists, and for the experiment to be more easily repeated - to ensure that the published results are correct. We have developed EXPO, the first general ontology of scientific experiments. This collects together the most basic concepts in experiments, and removes the need to duplicate them in ontologies for specialized sciences. Drug design experiments are arguably the most important type of experiments executed in the UK, thanks to their potential to find treatments and cures for diseases: which both helps the NHS, and makes money for some of the UK's largest companies. We propose to develop an ontology of drug design experiments. This will help standardize and formalise these experiments, and will make their results more easily exchanged and repeated. The ontology will include information on the chemicals themselves, the assays used to test new chemicals, the models, the equipment used, etc. A major technical difficulty in forming ontologies is to extract the necessary information out of expert humans, as they find it hard to articulate their experise. One radical way around this problem that has been developed in Aberystwyth is to use Robot Scientists. These are Artificial Intelligence systems that are capable of fully automating simple parts of science / cycles of hypothesis formation, experiment planning, experiment execution, and results analysis. In a Robot Scientist all aspects of a scientific experiment are explicit. In Aberystwyth we are developing a Robot Scientist designed for drug design, and we will use this to help design the drug design ontology. We will also consult widely with the pharmaceutical industry to ensure that the ontology captures the key concepts in drug design.

Technical Summary

Scientific knowledge should, ideally, be expressed in a formal logical language. Formal languages promote semantic clarity, which in turn supports the free exchange of scientific knowledge and simplifies scientific reasoning. The first step in formalising knowledge is to define an explicit ontology, i.e. to describe what exists. The use of ontologies is becoming increasingly important in scientific research, and biology is leading this development. We have developed the first common ontology of scientific experiments, EXPO, which formalises generic knowledge about scientific experimental design, methodology, and results representation. EXPO abstracts out the general features of experiments, this removes the need for these to be re-invented each time a specific domain ontology is created, and allows cross-disciplinary standards. We propose to build the top levels of an open source Ontology of Drug Screening and Design Experiments (EXPO-ODSDE). Drug Screening and Design Experiments are arguably the most important type of experiments executed in the UK, both in terms of social and economic impact. However, little ontological work has been done on these experiments. EXPO-ODSDE will link to EXPO and will formalise the most important entities and relations in drug screening and design. EXPO-ODSDE will help enable the scientific knowledge generated in drug design experiments to be more explicit, to help detect errors, to facilitate the sharing and reuse of common knowledge, and to promote the interchange and reliability of experimental methods and conclusions. A major technical difficulty in forming ontologies is the elucidation of domain expertise. One way to alleviate this problem is to use Robot Scientists. We are currently completing the plans for a Robot Scientist, Eve, designed to automate drug screening and design experiments.

Publications

10 25 50
publication icon
Qi D (2010) An ontology for description of drug discovery investigations. in Journal of integrative bioinformatics

publication icon
Olier I (2021) Transformational machine learning: Learning how to learn from many related scientific problems. in Proceedings of the National Academy of Sciences of the United States of America