Using phylogenetic analysis of large-scale Next-Gen HIV sequence datasets and computer simulation to assess the impact of high-risk populations on the

Lead Research Organisation: University of Edinburgh
Department Name: Sch of Molecular. Genetics & Pop Health

Abstract

Project Title: Using phylogenetic analysis of large-scale Next-Gen HIV sequence datasets and computer simulation to assess the impact of high-risk populations on the ART rollout in Uganda.

Background
A huge global expansion of the availability of antiretroviral (ARV) therapy for HIV-positive individuals has occurred in the last 10 years and now over 15 million, out of a total of 37 million, are on treatment1. Effective treatment stops HIV transmission and already there are annually 10% fewer new infections than 15 years ago although there are ~20% more HIV+ people. UNAIDS seeks to reduce HIV transmission by 25% by 2020 by increasing the proportion on treatment to 90% of those known to be infected2. However, it has been assuming that all communities in the "generalised" heterosexual HIV epidemics of sub-Saharan Africa are similar in transmission rate and access to care and this is far from the case.
Uganda has a strong reputation for managing its HIV epidemic well, but it has more than 1.5 million HIV+ individuals. Although in the rural population only 5% are HIV+, around Lake Victoria there are populations associated with the fishing industry, on the lake shore, or on islands, where up to 25% are HIV+. If, by transmission between communities, these populations contribute substantially to the national epidemic, the effectiveness of the national programme for ARV rollout will be threatened. This project will integrate HIV sequence analysis with evolutionary and epidemiologically driven mathematical and simulation models in order to estimate transmission between communities and predict its impact.

To understand how much difference the new data and methods will make the project will exploit an existing computational simulation model of an HIV epidemic that has been developed in the Leigh Brown group. By analysing available data for key features of the population contact structure (network structure and dynamics, as informed by our prior knowledge of HIV epidemiology), the model will be adjusted to match the known features of the populations studied, and will output virus genomes that arise from the type of population that is described. As the route by which they evolved, the "transmission tree" is known in the simulation, this allows testing of how good the methods of analysis are, and identifies key assumptions that could influence decision-making in regards to disease control and therefore require refinement.

Aims
Analysis of the genome data will inform the development of models of population contact. Making use of Prof Kao's expertise in network analysis of dense datasets, the aim will be to identify the influence that contact structure or mixing has on the observed infection rate, and in particular the relationship between individual level patterns of transmission, and the community level observations. This will inform our understanding of the importance of reducing infection rates in these communities for infection rates in the general population. As that may require greater expenditure per person than elsewhere, it will be important to demonstrate the national impact to persuade the agencies responsible to focus most effectively. The results of this work will be highly relevant to everyone involved in the rollout of ARV therapy in sub-Saharan Africa.

Training Outcomes
By the conclusion of the project the student will be expert in computational biology including programming in one or more languages, statistical analysis of sequence data (likelihood-based and Bayesian), epidemiological modeling in complex populations and familiar with issues around delivery of intervention strategies for global health.

Studentship Projects

Project Reference Relationship Related To Start End Student Name
MR/N013166/1 01/10/2016 30/09/2025
1938328 Studentship MR/N013166/1 01/09/2017 31/08/2021 Heather Grant
 
Description UK-Canada Globalink Doctoral Exchange Scheme
Amount £10,000 (GBP)
Funding ID NE/T014482/1 
Organisation Natural Environment Research Council 
Sector Public
Country United Kingdom
Start 04/2020 
End 04/2021
 
Title HIV Genome Dataset from 1986 
Description Full-length HIV Genomes from 1986 Genbank Numbers OP039379:OP039487 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact The largest historical HIV dataset from 1980's Africa 
URL https://www.ncbi.nlm.nih.gov/nuccore/OP039379
 
Description Historical HIV genomes from Uganda 
Organisation MRC/UVRI Uganda Research Unit on AIDS
Country Uganda 
Sector Public 
PI Contribution This collaboration involves further efforts to obtain full-length HIV genomes from samples stored in -80 freezers since 1986. These precious samples from Uganda will give phylodynamic insights into the history of the Ugandan HIV epidemic. They belong to the MRC/UVRI unit in Uganda and are currently stored at UCL.
Collaborator Contribution The UVRI will assist with meta-data, analysis, and interpretation, whilst UCL are undertaking the sequencing and will assist with the genome assembly.
Impact I will present this work as an oral presentation at HIV Dynamics 2020.
Start Year 2019
 
Description Historical HIV genomes from Uganda 
Organisation University College London
Department Division of Infection and Immunity
Country United Kingdom 
Sector Academic/University 
PI Contribution This collaboration involves further efforts to obtain full-length HIV genomes from samples stored in -80 freezers since 1986. These precious samples from Uganda will give phylodynamic insights into the history of the Ugandan HIV epidemic. They belong to the MRC/UVRI unit in Uganda and are currently stored at UCL.
Collaborator Contribution The UVRI will assist with meta-data, analysis, and interpretation, whilst UCL are undertaking the sequencing and will assist with the genome assembly.
Impact I will present this work as an oral presentation at HIV Dynamics 2020.
Start Year 2019
 
Description School visits with the "Bioinformatics for Schools" project [Cumbernauld Academy 17th December] and [12th Dec - St Margaret's Academy, Livingston] 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Schools
Results and Impact I volunteered with the bioinformatics for schools project, organised by Dr Barker & Dr Bain of Edinburgh University. Sixth form students are given an introduction to bioinformatics, including a mystery DNA sequence to identify using NCBI and a small Raspberry Pi computer.
Year(s) Of Engagement Activity 2019
URL https://4273pi.org/