Integrated gene and brain mapping of language and reading abilities

Lead Research Organisation: University of Edinburgh

Department Name: Sch of Philosophy Psychology & Language

Abstract

The ability to communicate using language is unique to humans. Language is central to our social world, and associated reading and writing skills are central to education and daily life. People vary in how well they comprehend and use language, and in their abilities to read and write, our research aims to better understand the biology behind these individual differences in language and reading abilities in the general population. Most genetic research in this area has focussed on children who have difficulties acquiring language and reading skills, but these sample sizes are too small to find the common genes of very small individual effect that are likely to influence normal variation in these traits. Our research is novel because it collects language and reading measures in adults, and can therefore rapidly attain sample sizes large enough to make genetic discoveries. The sample that we focus on is special for three main reasons. 1) They already have genome-wide genotyping data available, so we can run our gene discovery analysis (genome-wide association; GWA) as soon as language and reading skills have been measured in this sample. 2) A subsample have recently had their brain scanned using magnetic resonance imaging (MRI), so we can investigate how their language and reading skills relate to structural features and networks of the brain. Importantly, some of this sample have historical measures of reading skill in childhood (7-11 years), so that we can test whether their childhood reading scores are related to the same brain structures and networks measured in adulthood. 3) The sample have genome-wide methylation data available. Methylation is a chemical mark that can attach to DNA and affect gene expression and modify gene function. This type of epigenetic (non-genetic influence on gene expression) marker has not previously been tested for its association with language and reading skills, our study will provide the first investigation of this. Our GWA results will be combined with those from international GWA cohorts to produce the largest GWA of language and reading abilities. We can further boost power of gene discovery by using brain MRI markers in a multivariate GWA meta-analysis with language/reading measures. Finally, we will perform an integrated analysis of gene expression, methylation, and genetic data to find further novel genes associated with language and reading abilities, and to better understand the role of variants that lie outside genes. Our novel approach of combining genetic, epigenetic, and brain imaging research streams will provide a comprehensive framework for understanding the biology of language and reading abilities, enabling the important link from genes to brain to behaviour.

Technical Summary

Genome-wide association (GWA) has revolutionised the discovery of genes and biological pathways influencing complex traits, but not in the area of language and reading abilities where small samples of children and adolescents limit the sample size and resulting power required for successful GWA. The proposed research aims to accelerate the discovery of genes and biological pathways influencing language and reading traits by three main methods. Firstly, by using a novel sampling strategy focussed on adults to achieve a large GWA sample (N ~ 10,000), which will then be meta-analysed with GWA results from international cohorts (Total N~ 36,400). Secondly, by identifying brain imaging structural and connectivity endophenotypes of language and reading abilities in a very large single magnetic resonance imaging (MRI) study (N ~ 1,200; a subsample of the GWA sample). These results will inform a multivariate GWA meta-analysis of language/reading and MRI traits to further increase statistical power for gene discovery. Thirdly, by investigating associations between genome-wide methylation and language and reading abilities to establish epigenetic associations. This will be the first epigenome-wide association study (EWAS) of these traits in a sample (N ~ 10,000) that exceeds current EWAS of other cognitive abilities. Importantly, the data generated by these three main research streams will be integrated with publicly available methylation and expression QTL data to find further novel genetic variants influencing language and reading abilities, and to better characterise the role of newly discovered GWA variants, especially those in noncoding regions. This multi-layered approach of genetics, neuroimaging, and epigenetics will transform our understanding of the biological pathways connecting genes and brain to language and reading behaviour.

Planned Impact

This project targets the biological origins of language and reading abilities, essential cognitive skills known to influence educational and occupational success, and linked to health. The first group of potential beneficiaries is therefore people with low reading ability. Increased understanding of the biological mechanisms underpinning reading and language abilities has the potential to improve the quality of life of those who struggle to develop these skills. Such people tend to be overrepresented in the prison system and have more limited occupational opportunities.
These individual and social outcomes also lead to economic impacts on national wealth. In European countries, the economic cost of illiteracy is roughly 2% of gross domestic product, for example, in the UK roughly £81 billion is lost to the economy each year (World literacy Foundation, 2015 economic report).
One specific example of the translational impact of this research is the potential to improve the prediction of those people who are genetically predisposed to language and reading difficulties. Reading interventions are most successful if they are implemented early, meaning that such diagnostic genetic tests could even be performed prior to reading instruction, maximising the success of remediation. The development of polygenic prediction models based on our meta-analysis genome-wide association (GWA) findings could commence on completion of our project and is a downstream project with relevance to education that we are very interested in pursuing. Our summary GWA results will also be made publicly available so that other research groups with such interests can construct polygenic scores in their own data. Polygenic prediction models could be improved by incorporating environmental predictors like quality of early social interaction and phonics instruction (for reading). This would enable optimal environments (including intervention) for language/reading acquisition to be provided earlier in a child's development. There might also be applications for improving the acquisition of these skills in the general population. Once we identify neuroimaging correlates of normal language and reading abilities, this will provide a knowledge base for downstream research to investigate whether particular learning strategies more efficiently build up/consolidate the brain networks involved in reading and writing. If such learning strategies were implemented within education, this would produce a more literate society, with associated benefits to employment and health, and lowered risk of delinquency. Thus improving a person's individual quality of life and the economic and social fabric of society as a whole.
The above outlines some of the longer term potential impacts, but there is also scope for people to benefit from the research during the life-cycle of the project itself. For example, health professionals who identify reading/language problems and educators who teach children how to read may benefit from the project's findings. A better understanding of genetics and reading more generally may enable these groups to discuss reading ability with parents more confidently. From previous engagement activities, the PI knows this is an area which is often misunderstood, with people sometimes being wary of what they perceive as a deterministic approach to literacy. This project offers the opportunity to open up a dialogue with educators and the public about how genetics data could be used to improve educational and social outcomes. More detail of how we will engage with these groups is given in the Pathways to Impact section.

Funded Value:

£806,759

Funded Period:

Feb 20 - Jan 25

Funder:

BBSRC

Project Status:

Active

Project Category:

Research Grant

Project Reference:

BB/T000813/1

Principal Investigator:

Michelle Luciano

Research Subject:

Genetics & development (25%)

Omic sciences & technologies (25%)

Psychology (50%)

Research Topic:

Biological Psychology (25%)

Cognitive Psychology (25%)

Epigenetics (25%)

Genomics (25%)

Organisations

People	ORCID iD
Michelle Luciano (Principal Investigator)	http://orcid.org/0000-0002-7306-3008
Heather Whalley (Co-Investigator)
Riccardo Marioni (Co-Investigator)
Timothy Bates (Co-Investigator)

Publications

Author Name Title

Publication Date Published

10 25 50

Bridges EC (2023) Longitudinal Reading Measures and Genome Imputation in the National Child Development Study: Prospects for Future Reading Research. in Twin research and human genetics : the official journal of the International Society for Twin Studies

Policy Influence
Further Funding
Research Databases and Models
Collaboration


Description	Specific Learning Difficulties (SpLD) Assessment Standards Committee (SASC) Consultation Paper
Geographic Reach	National
Policy Influence Type	Participation in a guidance/advisory committee
URL	https://sasc.org.uk/NewsItem.aspx?id=108


Description	College of Arts, Humanities, and Social Sciences KE Impact Grant
Amount	£3,470 (GBP)
Organisation	University of Edinburgh
Sector	Academic/University
Country	United Kingdom
Start	01/2023
End	07/2023


Description	Specific learning disorders in Scotland
Amount	£10,000 (GBP)
Organisation	Royal Society of Edinburgh (RSE)
Sector	Charity/Non Profit
Country	United Kingdom
Start	01/2022
End	01/2023


Title	Generation Scotland Reading and Language Data
Description	We are in the process of cleaning the new phenotypes in the Generation Scotland: Family Health Study cohort which will then be made available for use by other researchers on application. Data access will be available on the completion of our BBSRC grant. Our testing was completed on the 22/02/23 with a final sample of ~1500 participants.
Type Of Material	Database/Collection of data
Year Produced	2023
Provided To Others?	No
Impact	The data up until last year were used by 2 MSc students for their dissertations, and the postdoc on the project is currently cleaning the final dataset.


Description	Genetics of Stuttering Consortium
Organisation	The Walter and Eliza Hall Institute of Medical Research (WEHI)
Country	Australia
Sector	Academic/University
PI Contribution	My research team has run a genome-wide association analysis of stuttering in the 1958 cohort data for inclusion in a meta-analysis led by the Genetics of Stuttering Consortium.
Collaborator Contribution	Our collaborators provided a very detailed analysis pipeline that has been helpful to inform our own decisions about analytic strategies on the BBSRC reading and language grant.
Impact	A manuscript is in preparation, we have yet to receive a draft. The collaboration is multi-disciplinary in that it involves geneticists and speech/language clinicians/researchers.
Start Year	2020


Description	Natural language processing
Organisation	University of Tromso
Country	Norway
Sector	Academic/University
PI Contribution	I am contributing my expertise in working with Generation Scotland and my genetic analysis skills to this collaboration.
Collaborator Contribution	My collaborators have expertise in natural language processing analysis and have funding to collect speech recordings from Generation Scotland participants.
Impact	We are currently preparing an NHS ethics application for the study. The collaboration is multi-disciplinary in that my collaborators have expertise in machine language learning and natural speech processing whereas my expertise is in genetic analysis.
Start Year	2022

Abstract

Technical Summary

Planned Impact

Organisations

People

ORCID iD

Publications