A systems approach to the classification of genes impacting the cardiovascular phenome

Lead Research Organisation: University of Bristol
Department Name: Social Medicine

Abstract

Recent research has been very successful in identifying genetic factors involved in common diseases like heart disease, obesity and diabetes. The majority of findings are based on the association of a gene with a single disease or characteristic, which works well for genes that are very variable between people. Our proposal is to identify some of the genes that are less variable between people, by using multiple characteristics to identify groups of genes that are involved in particular diseases. By looking for patterns in several characteristics at the same time we aim to identify disease fingerprints in known disease genes, and use those fingerprints to find new disease genes. The newly identified genes will include potential candidates for drug development and diagnostic tests.

Technical Summary

Genome-wide association studies have transformed the world of genetic association studies, robustly identifying hundreds of loci involved in a wide range of common, complex diseases and traits. This approach tests each genetic variant against a single trait or disease. However, the typically small effect size (per genetic variant) on traits means that the importance of a particular gene in disease may be underestimated or overlooked due to an absence of genetic variants of major effect on the function of that gene (GWAS results only reflect function that is altered by genetic variation). We propose an alternative approach that uses the pattern of effect across several relevant phenotypes (a phenotypic profile or fingerprint ), rather than the magnitude of effect for a single phenotype, to classify which genes in the genome are functionally involved in a particular trait. By training a supervised learning classifier with phenotypic effect vectors from genes of known relevance we aim to be able to classify all other genes, thus identifying new causal pathways and potential drug targets. This systems approach of utilising multiple related phenotypes (eg the coagulation cascade) will provide new insights into genetic pathways and interactions not accessible within single SNP/single trait analyses. The expected outcomes will be novel, generalisable approaches to the identification of disease genes and classification of new genes involved in cardiovascular disease risk.

Publications

10 25 50