Using phylogenetic analysis of large-scale Next-Gen HIV sequence datasets and computer simulation to assess the impact of high-risk populations on the
Lead Research Organisation:
University of Edinburgh
Department Name: Sch of Molecular. Genetics & Pop Health
Abstract
Project Title: Using phylogenetic analysis of large-scale Next-Gen HIV sequence datasets and computer simulation to assess the impact of high-risk populations on the ART rollout in Uganda.
Background
A huge global expansion of the availability of antiretroviral (ARV) therapy for HIV-positive individuals has occurred in the last 10 years and now over 15 million, out of a total of 37 million, are on treatment1. Effective treatment stops HIV transmission and already there are annually 10% fewer new infections than 15 years ago although there are ~20% more HIV+ people. UNAIDS seeks to reduce HIV transmission by 25% by 2020 by increasing the proportion on treatment to 90% of those known to be infected2. However, it has been assuming that all communities in the "generalised" heterosexual HIV epidemics of sub-Saharan Africa are similar in transmission rate and access to care and this is far from the case.
Uganda has a strong reputation for managing its HIV epidemic well, but it has more than 1.5 million HIV+ individuals. Although in the rural population only 5% are HIV+, around Lake Victoria there are populations associated with the fishing industry, on the lake shore, or on islands, where up to 25% are HIV+. If, by transmission between communities, these populations contribute substantially to the national epidemic, the effectiveness of the national programme for ARV rollout will be threatened. This project will integrate HIV sequence analysis with evolutionary and epidemiologically driven mathematical and simulation models in order to estimate transmission between communities and predict its impact.
To understand how much difference the new data and methods will make the project will exploit an existing computational simulation model of an HIV epidemic that has been developed in the Leigh Brown group. By analysing available data for key features of the population contact structure (network structure and dynamics, as informed by our prior knowledge of HIV epidemiology), the model will be adjusted to match the known features of the populations studied, and will output virus genomes that arise from the type of population that is described. As the route by which they evolved, the "transmission tree" is known in the simulation, this allows testing of how good the methods of analysis are, and identifies key assumptions that could influence decision-making in regards to disease control and therefore require refinement.
Aims
Analysis of the genome data will inform the development of models of population contact. Making use of Prof Kao's expertise in network analysis of dense datasets, the aim will be to identify the influence that contact structure or mixing has on the observed infection rate, and in particular the relationship between individual level patterns of transmission, and the community level observations. This will inform our understanding of the importance of reducing infection rates in these communities for infection rates in the general population. As that may require greater expenditure per person than elsewhere, it will be important to demonstrate the national impact to persuade the agencies responsible to focus most effectively. The results of this work will be highly relevant to everyone involved in the rollout of ARV therapy in sub-Saharan Africa.
Training Outcomes
By the conclusion of the project the student will be expert in computational biology including programming in one or more languages, statistical analysis of sequence data (likelihood-based and Bayesian), epidemiological modeling in complex populations and familiar with issues around delivery of intervention strategies for global health.
Background
A huge global expansion of the availability of antiretroviral (ARV) therapy for HIV-positive individuals has occurred in the last 10 years and now over 15 million, out of a total of 37 million, are on treatment1. Effective treatment stops HIV transmission and already there are annually 10% fewer new infections than 15 years ago although there are ~20% more HIV+ people. UNAIDS seeks to reduce HIV transmission by 25% by 2020 by increasing the proportion on treatment to 90% of those known to be infected2. However, it has been assuming that all communities in the "generalised" heterosexual HIV epidemics of sub-Saharan Africa are similar in transmission rate and access to care and this is far from the case.
Uganda has a strong reputation for managing its HIV epidemic well, but it has more than 1.5 million HIV+ individuals. Although in the rural population only 5% are HIV+, around Lake Victoria there are populations associated with the fishing industry, on the lake shore, or on islands, where up to 25% are HIV+. If, by transmission between communities, these populations contribute substantially to the national epidemic, the effectiveness of the national programme for ARV rollout will be threatened. This project will integrate HIV sequence analysis with evolutionary and epidemiologically driven mathematical and simulation models in order to estimate transmission between communities and predict its impact.
To understand how much difference the new data and methods will make the project will exploit an existing computational simulation model of an HIV epidemic that has been developed in the Leigh Brown group. By analysing available data for key features of the population contact structure (network structure and dynamics, as informed by our prior knowledge of HIV epidemiology), the model will be adjusted to match the known features of the populations studied, and will output virus genomes that arise from the type of population that is described. As the route by which they evolved, the "transmission tree" is known in the simulation, this allows testing of how good the methods of analysis are, and identifies key assumptions that could influence decision-making in regards to disease control and therefore require refinement.
Aims
Analysis of the genome data will inform the development of models of population contact. Making use of Prof Kao's expertise in network analysis of dense datasets, the aim will be to identify the influence that contact structure or mixing has on the observed infection rate, and in particular the relationship between individual level patterns of transmission, and the community level observations. This will inform our understanding of the importance of reducing infection rates in these communities for infection rates in the general population. As that may require greater expenditure per person than elsewhere, it will be important to demonstrate the national impact to persuade the agencies responsible to focus most effectively. The results of this work will be highly relevant to everyone involved in the rollout of ARV therapy in sub-Saharan Africa.
Training Outcomes
By the conclusion of the project the student will be expert in computational biology including programming in one or more languages, statistical analysis of sequence data (likelihood-based and Bayesian), epidemiological modeling in complex populations and familiar with issues around delivery of intervention strategies for global health.
People |
ORCID iD |
Andrew Leigh Brown (Primary Supervisor) | |
Heather Grant (Student) |
Publications


Grant HE
(2020)
Pervasive and non-random recombination in near full-length HIV genomes from Uganda.
in Virus evolution

Olabode AS
(2022)
Revisiting the recombinant history of HIV-1 group M with dynamic network community detection.
in Proceedings of the National Academy of Sciences of the United States of America

Pujol-Hodge E
(2022)
Detection of HIV-1 Transmission Clusters from Dried Blood Spots within a Universal Test-and-Treat Trial in East Africa.
in Viruses
Studentship Projects
Project Reference | Relationship | Related To | Start | End | Student Name |
---|---|---|---|---|---|
MR/N013166/1 | 30/09/2016 | 29/09/2025 | |||
1938328 | Studentship | MR/N013166/1 | 31/08/2017 | 30/08/2021 | Heather Grant |
Description | UK-Canada Globalink Doctoral Exchange Scheme |
Amount | £10,000 (GBP) |
Funding ID | NE/T014482/1 |
Organisation | Natural Environment Research Council |
Sector | Public |
Country | United Kingdom |
Start | 03/2020 |
End | 04/2021 |
Title | HIV Genome Dataset from 1986 |
Description | Full-length HIV Genomes from 1986 Genbank Numbers OP039379:OP039487 |
Type Of Material | Database/Collection of data |
Year Produced | 2022 |
Provided To Others? | Yes |
Impact | The largest historical HIV dataset from 1980's Africa |
URL | https://www.ncbi.nlm.nih.gov/nuccore/OP039379 |
Description | Historical HIV genomes from Uganda |
Organisation | MRC/UVRI Uganda Research Unit on AIDS |
Country | Uganda |
Sector | Public |
PI Contribution | This collaboration involves further efforts to obtain full-length HIV genomes from samples stored in -80 freezers since 1986. These precious samples from Uganda will give phylodynamic insights into the history of the Ugandan HIV epidemic. They belong to the MRC/UVRI unit in Uganda and are currently stored at UCL. |
Collaborator Contribution | The UVRI will assist with meta-data, analysis, and interpretation, whilst UCL are undertaking the sequencing and will assist with the genome assembly. |
Impact | I will present this work as an oral presentation at HIV Dynamics 2020. |
Start Year | 2019 |
Description | Historical HIV genomes from Uganda |
Organisation | University College London |
Department | Division of Infection and Immunity |
Country | United Kingdom |
Sector | Academic/University |
PI Contribution | This collaboration involves further efforts to obtain full-length HIV genomes from samples stored in -80 freezers since 1986. These precious samples from Uganda will give phylodynamic insights into the history of the Ugandan HIV epidemic. They belong to the MRC/UVRI unit in Uganda and are currently stored at UCL. |
Collaborator Contribution | The UVRI will assist with meta-data, analysis, and interpretation, whilst UCL are undertaking the sequencing and will assist with the genome assembly. |
Impact | I will present this work as an oral presentation at HIV Dynamics 2020. |
Start Year | 2019 |
Description | School visits with the "Bioinformatics for Schools" project [Cumbernauld Academy 17th December] and [12th Dec - St Margaret's Academy, Livingston] |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Schools |
Results and Impact | I volunteered with the bioinformatics for schools project, organised by Dr Barker & Dr Bain of Edinburgh University. Sixth form students are given an introduction to bioinformatics, including a mystery DNA sequence to identify using NCBI and a small Raspberry Pi computer. |
Year(s) Of Engagement Activity | 2019 |
URL | https://4273pi.org/ |