📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Geographic genetic profiling of human Plasmodium malaria

Lead Research Organisation: London School of Hygiene and Tropical Medicine
Department Name: Infectious and Tropical Diseases

Abstract

Malaria caused by Plasmodium falciparum kills about 600,000 people per year, and increased population mobility through international air travel carries further risks of re-introducing parasites to elimination areas and dispersing drug resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable tool for locating the source of outbreaks, and spotting the spread of drug resistant parasites from Asia into Africa. Genetic markers have proved extremely valuable in tracking and eradicating diseases, such as Polio. However, the previous candidates for malaria genetic barcodes have relied on identifying DNA markers found in the parasite nucleus, which shows too much genetic variation between individual parasites to be used accurately. Now, DNA sequences found outside the nucleus in organelles called the mitochondria and the apicoplast have been analysed. These are only inherited through maternal lines and therefore much more stable over generations than nuclear DNA sequences. The research outlined in this methodology proposal will create computational tools which will help to exploit use of mitochondria and the apicoplast sequences to create reliable genetic barcodes for tracking the geographical movement of malaria in an operational context.

Human malaria can be caused by one of 6 different Plasmodium species. We will develop genetic barcodes based on mitochondria and apicoplast sequences for each of the 6 species. We will develop new analytical approaches which can discriminate the different species even in mixed infections. We will also refine the exisiting barcoding methodology for discrimninating between infections originating from geographically distinct populations of the same species. Most crucially we will develop analytical software which can infer barcodes from complex mixed infections which are commonly found in malaria patients in many parts of the world.

We will create a publically available online resource to facilitate the widespread use of barcoding. It will be of practical use to malaria control agencies and research groups worldwide.

Technical Summary

We propose to develop:
1. A library of apicoplast and mitochondrial genomic sequence variants across multiple human Plasmodium species P. falciparum, P. vivax, P. ovale curtisi, P. ovale wallikeri, P. malariae, and P. knowlesi using existing raw genomic sequence data generated by collaborating investigators with external funding.
2. A statistical algorithm to infer informative SNP haplotypes within and between species from complex mixed infections. The perfect linkage disequilibrium or "perfect phylogeny" across the co-inherited organelle SNPs leads to an opportunity to construct phylogenetic trees that represent the relationship between haplotypes. Crucially this allows modelling approaches to disaggregate complex mixed infections.
3. Prototype barcodes based on newly generated mt/apico sequences for Pv, Po, Pm, and Pk, in partnership with collaborators;
4. A proof of principle. In collaboration with overseas research colleagues who have raw genomic sequence data suspected to contain mixed species co-infection (e.g. P. falciparum + P. malariae in Kenya, P. falciparum+P. vivax in Thailand). Colleagues at the National Institute of Parasitic Diseases at the Chinese Center for Disease Control and Prevention in Shanghai, China are genotyping local and putative imported infections from archived bloodspot samples and we will reanalyse these using the published Pf barcoding methodology of 23 SNPs developed using a classification and regression tree (CART) algorithm (Preston et al Nature Communications in press).
5. An online resource, which summarises the library of mt/api genomic variants and barcode haplotypes, and facilitates the input of sequence data for the rapid identification of species and the potential geographic source of (imported) infections.

Planned Impact

Malaria caused by Plasmodium falciparum kills about 600,000 people per year, and increased population mobility through international air travel carries further risks of re-introducing parasites to elimination areas and dispersing drug resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable tool for locating the source of outbreaks, and spotting the spread of drug resistant parasites from Asia into Africa. Genetic markers have proved extremely valuable in tracking and eradicating diseases, such as Polio. However, the previous candidates for malaria genetic barcodes have relied on identifying DNA markers found in the parasite nucleus, which shows too much genetic variation between individual parasites to be used accurately. DNA sequences found outside the nucleus in organelles called the mitochondria and the apicoplast are only inherited through maternal lines and therefore much more stable over generations than nuclear DNA sequences. The research outlined in this methodology proposal will create computational tools which will help to exploit use of mitochondria and the apicoplast sequences to create reliable genetic barcodes for tracking the geographical movement of malaria in an operational context. We aim to create an analytical framework which will support simple genetic barcoding for use by National Malaria Control Programmes who are engaged in malaria elimination. Other potential beneficiaries are all those engaged in malaria control and malaria treatment who's work will benefit from new knowledge about how malaria parasite populations are interconnected.

Publications

10 25 50

 
Title Additional file 1 of Large-scale genomic analysis of global Klebsiella pneumoniae plasmids reveals multiple simultaneous clusters of carbapenem-resistant hypervirulent strains 
Description Additional file 1: Figure S1. Analysis work flow, jpg image. 
Type Of Art Film/Video/Animation 
Year Produced 2023 
URL https://springernature.figshare.com/articles/figure/Additional_file_1_of_Large-scale_genomic_analysi...
 
Title Additional file 1 of Large-scale genomic analysis of global Klebsiella pneumoniae plasmids reveals multiple simultaneous clusters of carbapenem-resistant hypervirulent strains 
Description Additional file 1: Figure S1. Analysis work flow, jpg image. 
Type Of Art Film/Video/Animation 
Year Produced 2023 
URL https://springernature.figshare.com/articles/figure/Additional_file_1_of_Large-scale_genomic_analysi...
 
Title Additional file 10: Figure S9. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Non-neutral evolution for genes within Clusters of Orthologous Groups (COG*) categories. Boxplots are constructed using (-log10) p-values of non-neutral evolution for each gene. *ppe/Nâ =â pe/ppe genes annotated as COG category N, * COG categories: A RNA processing and modification, B Chromatin Structure and dynamics, C Energy production and conversion, D Cell cycle control and mitosis, E Amino Acid metabolism and transport, F Nucleotide metabolism and transport, G Carbohydrate metabolism and transport, H Coenzyme metabolism, I Lipid metabolism, J Translation, K Transcription, L Replication and repair, M Cell wall/membrane/envelope biogenesis, N Cell motility, O Post-translational modification, protein turnover, chaperone functions, P Inorganic ion transport and metabolism, Q Secondary Structure, T Signal Transduction, U Intracellular trafficking and secretion, Y Nuclear structure, Z Cytoskeleton, R General Functional Prediction only, S Function Unknown. (TIF 127Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_10_Figure_S9_of_Recombination_in...
 
Title Additional file 10: Figure S9. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Non-neutral evolution for genes within Clusters of Orthologous Groups (COG*) categories. Boxplots are constructed using (-log10) p-values of non-neutral evolution for each gene. *ppe/Nâ =â pe/ppe genes annotated as COG category N, * COG categories: A RNA processing and modification, B Chromatin Structure and dynamics, C Energy production and conversion, D Cell cycle control and mitosis, E Amino Acid metabolism and transport, F Nucleotide metabolism and transport, G Carbohydrate metabolism and transport, H Coenzyme metabolism, I Lipid metabolism, J Translation, K Transcription, L Replication and repair, M Cell wall/membrane/envelope biogenesis, N Cell motility, O Post-translational modification, protein turnover, chaperone functions, P Inorganic ion transport and metabolism, Q Secondary Structure, T Signal Transduction, U Intracellular trafficking and secretion, Y Nuclear structure, Z Cytoskeleton, R General Functional Prediction only, S Function Unknown. (TIF 127Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_10_Figure_S9_of_Recombination_in...
 
Title Additional file 10: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Sublineage frequency table, Numbers and percentage by lineage assigned to each sublineage. (PPTX 36 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_10_of_Genome-wide_analysis_of_Mycobacte...
 
Title Additional file 10: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Sublineage frequency table, Numbers and percentage by lineage assigned to each sublineage. (PPTX 36 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_10_of_Genome-wide_analysis_of_Mycobacte...
 
Title Additional file 1: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Variant Summary Tables, Summary tables of variants called in comparison to the H37rv reference, with monomorphic variants removed for each dataset. a Total numbers of variants by lineage; b Number of variants per sample; c Non-reference variant frequency summary; variants called in comparison to the H37rv reference. (PPTX 39 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_1_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 1: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Variant Summary Tables, Summary tables of variants called in comparison to the H37rv reference, with monomorphic variants removed for each dataset. a Total numbers of variants by lineage; b Number of variants per sample; c Non-reference variant frequency summary; variants called in comparison to the H37rv reference. (PPTX 39 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_1_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 2: Figure S1. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description The global distribution of geographic origin and lineage of the isolates. Lineages one to four are represented by blue, green, purple, and red, respectively. (PNG 265 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_2_Figure_S1_of___Mycobacterium_t...
 
Title Additional file 2: Figure S1. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description The global distribution of geographic origin and lineage of the isolates. Lineages one to four are represented by blue, green, purple, and red, respectively. (PNG 265 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_2_Figure_S1_of___Mycobacterium_t...
 
Title Additional file 2: Figure S1. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Allele frequency spectra for each lineage by synonymous (blue) and non-synonymous (red) mutations. The peaks at intermediate allele frequencies include sub-lineage defining SNPs (Lineage 1 Indo-Oceanic; Lineage 2 East-Asian (Beijing); Lineage 3 East-African-Indian; Lineage 4 Euro-American). (TIF 207Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_2_Figure_S1_of_Recombination_in_...
 
Title Additional file 2: Figure S1. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Allele frequency spectra for each lineage by synonymous (blue) and non-synonymous (red) mutations. The peaks at intermediate allele frequencies include sub-lineage defining SNPs (Lineage 1 Indo-Oceanic; Lineage 2 East-Asian (Beijing); Lineage 3 East-African-Indian; Lineage 4 Euro-American). (TIF 207Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_2_Figure_S1_of_Recombination_in_...
 
Title Additional file 2: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Non-reference variant frequency histogram, A histogram showing log10(frequency + 1) of non-reference alleles compared to the H37rv reference for a lineage 2 and b lineage4. (PPTX 69 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_2_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 2: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Non-reference variant frequency histogram, A histogram showing log10(frequency + 1) of non-reference alleles compared to the H37rv reference for a lineage 2 and b lineage4. (PPTX 69 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_2_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 3: Figure S2. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description SNP allele frequency spectrum. A large number of rare variants are observed. Peaks with higher allele frequency reflect the presence of lineage and sub-lineage specific SNPs. (PNG 33 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_3_Figure_S2_of___Mycobacterium_t...
 
Title Additional file 3: Figure S2. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description SNP allele frequency spectrum. A large number of rare variants are observed. Peaks with higher allele frequency reflect the presence of lineage and sub-lineage specific SNPs. (PNG 33 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_3_Figure_S2_of___Mycobacterium_t...
 
Title Additional file 3: Figure S2. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Gene-based nucleotide diversity (D ) for the 21 reference genomes. All genes with high nucleotide diversity (D â >â 0.0075) are labelled. (TIF 148Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_3_Figure_S2_of_Recombination_in_...
 
Title Additional file 3: Figure S2. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Gene-based nucleotide diversity (D ) for the 21 reference genomes. All genes with high nucleotide diversity (D â >â 0.0075) are labelled. (TIF 148Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_3_Figure_S2_of_Recombination_in_...
 
Title Additional file 3: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Population diversity within investigated strains, a Principal component 1 (PC1) by principal component 2 (PC2) for lineage 2, The first 10 principal components account for 71.9% of the variation in lineage 2; b Distance plot for lineage 2 showing pairwise number of variant differences between samples; c Principal component 1 (PC1) by principal component 2 (PC2) for lineage 4, the first 10 principal components account for 88.9% of the variation in lineage 4. d Distance plot for lineage 2 showing pairwise number of variant differences between samples. (PPTX 5650 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_3_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 3: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Population diversity within investigated strains, a Principal component 1 (PC1) by principal component 2 (PC2) for lineage 2, The first 10 principal components account for 71.9% of the variation in lineage 2; b Distance plot for lineage 2 showing pairwise number of variant differences between samples; c Principal component 1 (PC1) by principal component 2 (PC2) for lineage 4, the first 10 principal components account for 88.9% of the variation in lineage 4. d Distance plot for lineage 2 showing pairwise number of variant differences between samples. (PPTX 5650 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_3_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 4: Figure S3. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Phylogenetic tree constructed using 50,540 genome-wide SNPs. Clear clustering according to lineage can be seen (Lineage 1 (Indo-Oceanic, green), lineage 2 (East-Asian (Beijing), blue), lineage 3 (East-African-Indian, purple), lineage 4 (Euro-American, red)). Reference genomes are labelled. M. canetti is annotated in cyan. (TIF 69Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_4_Figure_S3_of_Recombination_in_...
 
Title Additional file 4: Figure S3. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Phylogenetic tree constructed using 50,540 genome-wide SNPs. Clear clustering according to lineage can be seen (Lineage 1 (Indo-Oceanic, green), lineage 2 (East-Asian (Beijing), blue), lineage 3 (East-African-Indian, purple), lineage 4 (Euro-American, red)). Reference genomes are labelled. M. canetti is annotated in cyan. (TIF 69Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_4_Figure_S3_of_Recombination_in_...
 
Title Additional file 4: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Scree plots for the principal component analyses, Scree plots showing the proportion of variation accounted for by the first ten principal components, calculated for the pairwise distances within a lineage 4 and b lineage 2. (PPTX 142 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_4_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 4: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Scree plots for the principal component analyses, Scree plots showing the proportion of variation accounted for by the first ten principal components, calculated for the pairwise distances within a lineage 4 and b lineage 2. (PPTX 142 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_4_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 5: Figure S4. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Identifying sites leading to differences in tree topologies based on all SNPs (Additional file 4: Figure S3a) and only those from pe/ppe genes (Additional file 4: Figure S3b). The Î Site wise log likelihood score (Î SSLS) is calculated for each SNP in the pe/ppe gene alignments. Negative differences indicate SNP positions favouring the pe/ppe tree. SNPs in pe_pgrs3, ppe57 and ppe60 produce strong phylogenetic signals supporting the pe/ppe tree. (TIF 113Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_5_Figure_S4_of_Recombination_in_...
 
Title Additional file 5: Figure S4. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Identifying sites leading to differences in tree topologies based on all SNPs (Additional file 4: Figure S3a) and only those from pe/ppe genes (Additional file 4: Figure S3b). The Î Site wise log likelihood score (Î SSLS) is calculated for each SNP in the pe/ppe gene alignments. Negative differences indicate SNP positions favouring the pe/ppe tree. SNPs in pe_pgrs3, ppe57 and ppe60 produce strong phylogenetic signals supporting the pe/ppe tree. (TIF 113Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_5_Figure_S4_of_Recombination_in_...
 
Title Additional file 5: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Drug-resistance phenotype frequency table, Drug-resistance phenotype frequency table by lineage. 'Totals' shows the number and percentage of each lineage with a known drug-resistance phenotype. (PPTX 45 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_5_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 5: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Drug-resistance phenotype frequency table, Drug-resistance phenotype frequency table by lineage. 'Totals' shows the number and percentage of each lineage with a known drug-resistance phenotype. (PPTX 45 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_5_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 6: Figure S5. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Phylogenetic tree created using only SNPs from pe_pgrs3. No clear clustering by lineage is observed. However there are two major clades, one consistent with H37Rv (bottom-left). (TIF 126Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_6_Figure_S5_of_Recombination_in_...
 
Title Additional file 6: Figure S5. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Phylogenetic tree created using only SNPs from pe_pgrs3. No clear clustering by lineage is observed. However there are two major clades, one consistent with H37Rv (bottom-left). (TIF 126Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_6_Figure_S5_of_Recombination_in_...
 
Title Additional file 6: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Cross-resistance phenotype table, Cross-Resistance Table upper diagonal shows proportion of samples phenotyped for both vertical and horizontal phenotype, that test positive for vertical phenotype. Diagonal shows number of samples with each phenotype. Lower diagonal shows number of samples with phenotype for both horizontal and vertical phenotype. (PPTX 45 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_6_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 6: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Cross-resistance phenotype table, Cross-Resistance Table upper diagonal shows proportion of samples phenotyped for both vertical and horizontal phenotype, that test positive for vertical phenotype. Diagonal shows number of samples with each phenotype. Lower diagonal shows number of samples with phenotype for both horizontal and vertical phenotype. (PPTX 45 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_6_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 7: Figure S6. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Molecular interactions established by wild-type residues in katG and rpoB residues. (A) The interactions established by Ser315 in katG. Given the proximity of the residue to the ligands INH and HEM, mutations to Asn and Thr, with slightly larger side chains, would potentially cause steric clashes. (B) The interactions of Asp435 in rpoB. It directly interacts with RMP via polar interactions that would be disrupted by mutations to Val. (C) Thr400 in rpoB is at the end of an alpha helix establishing intra molecular interactions. Giving its distance to RMP, it would be expected that its mutation to Ala would be a lower impact, which would arise from alosteric changes. (D) Ser450 establishes strong intra molecular interactions in the RMP binding site. Mutations to larger residues (Trp and Leu) could disrupt the packing of the region and therefore binding. (E). Ile491 performs hydrophobic interactions with RMP and its neighbouring residues. Mutations to Phe or Val would compromise packing, either inducing steric clashes or compromising packing. (F). His445 performs strong intra molecular interactions, including a donor-pi (blue dashes) and hydrogen bond (red dashes). Mutations to residues Asp, Tyr or Arg would imply in the loss of the pi interaction as well as potential introduction of steric clashes. (PNG 749 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_7_Figure_S6_of___Mycobacterium_t...
 
Title Additional file 7: Figure S6. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Molecular interactions established by wild-type residues in katG and rpoB residues. (A) The interactions established by Ser315 in katG. Given the proximity of the residue to the ligands INH and HEM, mutations to Asn and Thr, with slightly larger side chains, would potentially cause steric clashes. (B) The interactions of Asp435 in rpoB. It directly interacts with RMP via polar interactions that would be disrupted by mutations to Val. (C) Thr400 in rpoB is at the end of an alpha helix establishing intra molecular interactions. Giving its distance to RMP, it would be expected that its mutation to Ala would be a lower impact, which would arise from alosteric changes. (D) Ser450 establishes strong intra molecular interactions in the RMP binding site. Mutations to larger residues (Trp and Leu) could disrupt the packing of the region and therefore binding. (E). Ile491 performs hydrophobic interactions with RMP and its neighbouring residues. Mutations to Phe or Val would compromise packing, either inducing steric clashes or compromising packing. (F). His445 performs strong intra molecular interactions, including a donor-pi (blue dashes) and hydrogen bond (red dashes). Mutations to residues Asp, Tyr or Arg would imply in the loss of the pi interaction as well as potential introduction of steric clashes. (PNG 749 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_7_Figure_S6_of___Mycobacterium_t...
 
Title Additional file 7: Figure S6. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Lineage-specific recombination hotspots. Manhattan plots showing genes that are likely to be recombination hotspots in each lineage (Lineage 1 Indo-Oceanic; Lineage 2 East-Asian (Beijing); Lineage 3 East-African-Indian; Lineage 4 Euro-American). The (â log10) p-value for the phi statistic is plotted against genome position. All genes with p-valuesâ 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_7_Figure_S6_of_Recombination_in_...
 
Title Additional file 7: Figure S6. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Lineage-specific recombination hotspots. Manhattan plots showing genes that are likely to be recombination hotspots in each lineage (Lineage 1 Indo-Oceanic; Lineage 2 East-Asian (Beijing); Lineage 3 East-African-Indian; Lineage 4 Euro-American). The (â log10) p-value for the phi statistic is plotted against genome position. All genes with p-valuesâ 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_7_Figure_S6_of_Recombination_in_...
 
Title Additional file 7: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Variant Position Table, Table detailing variants at all positions with at least one non-synonymous variant found to be significantly associated with a phenotype in any of the variant-based analyses. (PPTX 52 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_7_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 7: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Variant Position Table, Table detailing variants at all positions with at least one non-synonymous variant found to be significantly associated with a phenotype in any of the variant-based analyses. (PPTX 52 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_7_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 8: Figure S7. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Evidence of recombination at a gene level in the 21 reference genomes. A Manhattan plot showing genes that are likely to be recombination hotspots. The (â log10) p-value for the phi statistic is plotted against genome position. Genes with p-values less than 0.05 are shown. (TIF 120Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_8_Figure_S7_of_Recombination_in_...
 
Title Additional file 8: Figure S7. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Evidence of recombination at a gene level in the 21 reference genomes. A Manhattan plot showing genes that are likely to be recombination hotspots. The (â log10) p-value for the phi statistic is plotted against genome position. Genes with p-values less than 0.05 are shown. (TIF 120Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_8_Figure_S7_of_Recombination_in_...
 
Title Additional file 8: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Locus Comparison Table, Locus comparison table showing which analyses and in which lineage each loci was identified. An 'x' indicates a locus which was not identified by the method of analysis in question. Loci without a known association with the phenotype are highlighted in bold. (PPTX 44 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_8_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 8: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Locus Comparison Table, Locus comparison table showing which analyses and in which lineage each loci was identified. An 'x' indicates a locus which was not identified by the method of analysis in question. Loci without a known association with the phenotype are highlighted in bold. (PPTX 44 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_8_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 9: Figure S8. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Selection dN/dS values for each gene within Clusters of Orthologous Groups (COG*) categories. *ppe/Nâ =â pe/ppe genes annotated as COG category N, * COG categories: A RNA processing and modification, B Chromatin Structure and dynamics, C Energy production and conversion, D Cell cycle control and mitosis, E Amino Acid metabolism and transport, F Nucleotide metabolism and transport, G Carbohydrate metabolism and transport, H Coenzyme metabolism, I Lipid metabolism, J Translation, K Transcription, L Replication and repair, M Cell wall/membrane/envelope biogenesis, N Cell motility, O Post-translational modification, protein turnover, chaperone functions, P Inorganic ion transport and metabolism, Q Secondary Structure, T Signal Transduction, U Intracellular trafficking and secretion, Y Nuclear structure, Z Cytoskeleton, R General Functional Prediction only, S Function Unknown. (TIF 124Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_9_Figure_S8_of_Recombination_in_...
 
Title Additional file 9: Figure S8. of Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages 
Description Selection dN/dS values for each gene within Clusters of Orthologous Groups (COG*) categories. *ppe/Nâ =â pe/ppe genes annotated as COG category N, * COG categories: A RNA processing and modification, B Chromatin Structure and dynamics, C Energy production and conversion, D Cell cycle control and mitosis, E Amino Acid metabolism and transport, F Nucleotide metabolism and transport, G Carbohydrate metabolism and transport, H Coenzyme metabolism, I Lipid metabolism, J Translation, K Transcription, L Replication and repair, M Cell wall/membrane/envelope biogenesis, N Cell motility, O Post-translational modification, protein turnover, chaperone functions, P Inorganic ion transport and metabolism, Q Secondary Structure, T Signal Transduction, U Intracellular trafficking and secretion, Y Nuclear structure, Z Cytoskeleton, R General Functional Prediction only, S Function Unknown. (TIF 124Â kb) 
Type Of Art Film/Video/Animation 
Year Produced 2016 
URL https://springernature.figshare.com/articles/figure/Additional_file_9_Figure_S8_of_Recombination_in_...
 
Title Additional file 9: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Study frequency table, Study frequency table, showing numbers and percentage of strains from each study by lineage. (PPTX 40 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_9_of_Genome-wide_analysis_of_Mycobacter...
 
Title Additional file 9: of Genome-wide analysis of Mycobacterium tuberculosis polymorphisms reveals lineage-specific associations with drug resistance 
Description Study frequency table, Study frequency table, showing numbers and percentage of strains from each study by lineage. (PPTX 40 kb) 
Type Of Art Film/Video/Animation 
Year Produced 2019 
URL https://springernature.figshare.com/articles/Additional_file_9_of_Genome-wide_analysis_of_Mycobacter...
 
Description NIH Research grant
Amount $2,100,000 (USD)
Funding ID 2R01AI103629 - 04A1 
Organisation National Institutes of Health (NIH) 
Sector Public
Country United States
Start 07/2017 
End 08/2021
 
Description Newton Institutional Links Grant
Amount £279,000 (GBP)
Funding ID 261868591 
Organisation British Council 
Sector Charity/Non Profit
Country United Kingdom
Start 03/2017 
End 04/2019
 
Description Newton Researcher Links Workshop Grants (Infectious Disease 'Omics (Philippines) )
Amount £60,000 (GBP)
Funding ID 2017-RLWK8-10671 
Organisation Newton Fund 
Sector Public
Country United Kingdom
Start 01/2018 
End 12/2018
 
Description Newton Researcher Links Workshop Grants (Infectious Disease 'Omics (Philippines) )
Amount £60,000 (GBP)
Funding ID Ref. 2017-RLWK9-110970 
Organisation Newton Fund 
Sector Public
Country United Kingdom
Start 03/2018 
End 12/2018
 
Title Molecular barcode for Plasmodia 
Description A software tool to call malaria parasite species from whole genome sequencing data. Ongoing work is developing a molecular assay for a field setting. 
Type Of Material Improvements to research infrastructure 
Year Produced 2018 
Provided To Others? No  
Impact It will have impact. A manuscript is under review, and a molecular assay is under development. 
 
Title New barcode for Plasmodium vivax 
Description We have now developed a molecular barcode for Plasmodium vivax, and an informatics tool to translate sequence data into a geographical and transmission profile. 
Type Of Material Improvements to research infrastructure 
Year Produced 2020 
Provided To Others? Yes  
Impact There is no impact as yet. 
 
Title Sequence data analysis pipelines 
Description We have established bioinformatic pipelines to process large numbers of sample sequences, and identify informative genomic variants. 
Type Of Material Improvements to research infrastructure 
Year Produced 2015 
Provided To Others? Yes  
Impact More rapid and accurate analysis by ourselves and collaborators, leading to new research insights. We propose to publish our methods and make them available to the research community. 
 
Title Additional file 1: of Identifying mixed Mycobacterium tuberculosis infections from whole genome sequence data 
Description Strain information and full results table for clinically-derived Malawi samples, in silico replicate samples and Portuguese replicate samples. Lineage, total number of SNPs, number of heterozygous sites and the mixture analysis result for both Bayesian clustering and heterozygous sites approaches is included for each sample. (XLSX 162 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Identifying_mixed_Mycobacterium_tu...
 
Title Additional file 1: of Identifying mixed Mycobacterium tuberculosis infections from whole genome sequence data 
Description Strain information and full results table for clinically-derived Malawi samples, in silico replicate samples and Portuguese replicate samples. Lineage, total number of SNPs, number of heterozygous sites and the mixture analysis result for both Bayesian clustering and heterozygous sites approaches is included for each sample. (XLSX 162 kb) 
Type Of Material Database/Collection of data 
Year Produced 2018 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Identifying_mixed_Mycobacterium_tu...
 
Title Additional file 1: of Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs 
Description Data S1. Mutations in the library (CSV 51 kb) 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Integrating_informatics_tools_and_...
 
Title Additional file 1: of Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs 
Description Data S1. Mutations in the library (CSV 51 kb) 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/Additional_file_1_of_Integrating_informatics_tools_and_...
 
Title Additional file 2 of Large-scale genomic analysis of global Klebsiella pneumoniae plasmids reveals multiple simultaneous clusters of carbapenem-resistant hypervirulent strains 
Description Additional file 2: Data S1. Samples AMR profile. Data S2. Plasmid sequences and their distribution. Data S3. Assignments of samples to clusters. Data S4. Contig IDs and Replicons. Data S5. Murray Collection Replicons 
Type Of Material Database/Collection of data 
Year Produced 2023 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_of_Large-scale_genomic_analys...
 
Title Additional file 2 of Large-scale genomic analysis of global Klebsiella pneumoniae plasmids reveals multiple simultaneous clusters of carbapenem-resistant hypervirulent strains 
Description Additional file 2: Data S1. Samples AMR profile. Data S2. Plasmid sequences and their distribution. Data S3. Assignments of samples to clusters. Data S4. Contig IDs and Replicons. Data S5. Murray Collection Replicons 
Type Of Material Database/Collection of data 
Year Produced 2023 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_2_of_Large-scale_genomic_analys...
 
Title Additional file 3 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 3. All BLASTN hits from AaegL5 assembly. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_of_Flavivirus_integrations_in...
 
Title Additional file 3 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 3. All BLASTN hits from AaegL5 assembly. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_3_of_Flavivirus_integrations_in...
 
Title Additional file 4 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 4. Accension IDs of all used flavivirus genomes. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_of_Flavivirus_integrations_in...
 
Title Additional file 4 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 4. Accension IDs of all used flavivirus genomes. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_of_Flavivirus_integrations_in...
 
Title Additional file 4: Figure S3. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Population structure analysis of the 144 isolates show clustering by lineage (Lineages one to four are represented by blue, green, purple, and red points, respectively). (a) A phylogenetic tree rooted with M. canetti. (b) First two principal components represent 33Â % and 30.5Â % of the variation explained between isolates, respectively. (ZIP 105 kb) 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_Figure_S3_of___Mycobacterium_...
 
Title Additional file 4: Figure S3. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Population structure analysis of the 144 isolates show clustering by lineage (Lineages one to four are represented by blue, green, purple, and red points, respectively). (a) A phylogenetic tree rooted with M. canetti. (b) First two principal components represent 33Â % and 30.5Â % of the variation explained between isolates, respectively. (ZIP 105 kb) 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_4_Figure_S3_of___Mycobacterium_...
 
Title Additional file 5 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 5. All BLASTN hits from AaloF2 assembly. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_5_of_Flavivirus_integrations_in...
 
Title Additional file 5 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 5. All BLASTN hits from AaloF2 assembly. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_5_of_Flavivirus_integrations_in...
 
Title Additional file 5: Figure S4. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description The relationship between the total number of non-synonymous SNPs in candidate loci and the MIC values. The size of the circle represents the number of isolates. a) Ethambutol (embB, embA, embA promoter, embC, embR and ubiA). b) Streptomycin (rpsL, rrs). The size of the circles is proportional to the frequency. The MIC values tend to increase with the number of non-synonymous mutations (ethambutol: rho = 0.24, slope = 0.29, p = 0.003; streptomycin: rho = 0.48, slope = 3.59, p = 1.65 × 10-8). The horizontal blue lines refer to the resistance cut-offs. (ZIP 92 kb) 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_5_Figure_S4_of___Mycobacterium_...
 
Title Additional file 5: Figure S4. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description The relationship between the total number of non-synonymous SNPs in candidate loci and the MIC values. The size of the circle represents the number of isolates. a) Ethambutol (embB, embA, embA promoter, embC, embR and ubiA). b) Streptomycin (rpsL, rrs). The size of the circles is proportional to the frequency. The MIC values tend to increase with the number of non-synonymous mutations (ethambutol: rho = 0.24, slope = 0.29, p = 0.003; streptomycin: rho = 0.48, slope = 3.59, p = 1.65 × 10-8). The horizontal blue lines refer to the resistance cut-offs. (ZIP 92 kb) 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_5_Figure_S4_of___Mycobacterium_...
 
Title Additional file 6 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 6. EVE coordinates and sequences identified in AaloF2. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_6_of_Flavivirus_integrations_in...
 
Title Additional file 6 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 6. EVE coordinates and sequences identified in AaloF2. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_6_of_Flavivirus_integrations_in...
 
Title Additional file 6: Figure S5. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Percentage of the variation in MIC values explained by each mutated codon in candidate genes. Bars in red represent significant independent associations with increased MIC (pâ 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_6_Figure_S5_of___Mycobacterium_...
 
Title Additional file 6: Figure S5. of Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance 
Description Percentage of the variation in MIC values explained by each mutated codon in candidate genes. Bars in red represent significant independent associations with increased MIC (pâ 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_6_Figure_S5_of___Mycobacterium_...
 
Title Additional file 7 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 7. EVE bedfiles for AaegL5 and AaloF2 assemblies; phylogenetic trees and corresponding metadata for Ae. aegypti EVEs and cox1 gene. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_7_of_Flavivirus_integrations_in...
 
Title Additional file 7 of Flavivirus integrations in Aedes aegypti are limited and highly conserved across samples from different geographic regions unlike integrations in Aedes albopictus 
Description Additional file 7. EVE bedfiles for AaegL5 and AaloF2 assemblies; phylogenetic trees and corresponding metadata for Ae. aegypti EVEs and cox1 gene. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/Additional_file_7_of_Flavivirus_integrations_in...
 
Title Genomic evidence supporting the clonal expansion of extensively drug-resistant tuberculosis bacteria belonging to a rare proto-Beijing genotype 
Description Tuberculosis disease (TB), caused by Mycobacterium tuberculosis, is a major public health issue in Thailand. The high prevalence of modern Beijing (Lineage 2.2.1) strains has been associated with multi- and extensively drug-resistant infections (MDR-, XDR-TB), complicating disease control. The impact of rarer proto-Beijing (L2.1) strains is less clear. In our study of thirty-seven L2.1 clinical isolates spanning thirteen years, we found a high prevalence of XDR-TB cases (32.4%). With = 12 pairwise SNP distances, 43.2% of L2.1 patients belong to MDR-TB or XDR-TB transmission clusters suggesting a high level of clonal expansion across four Thai provinces. All XDR-TB (100%) were likely due to transmission rather than inadequate treatment. We found a 47 mutation signature and a partial deletion of the fadD14 gene in the circulating XDR-TB cluster, which can be used for surveillance of this rare and resilient M. tuberculosis strain-type that is causing increasing health burden. We also detected three novel deletion positions, a deletion of 1285 bp within desA3 (Rv3230c), large deletions in the plcB, plcA, and ppe38 gene which may play a role in the virulence, pathogenesis or evolution of the L2.1 strain-type. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
URL https://tandf.figshare.com/articles/dataset/Genomic_evidence_supporting_the_clonal_expansion_of_exte...
 
Title Genomic evidence supporting the clonal expansion of extensively drug-resistant tuberculosis bacteria belonging to a rare proto-Beijing genotype 
Description Tuberculosis disease (TB), caused by Mycobacterium tuberculosis, is a major public health issue in Thailand. The high prevalence of modern Beijing (Lineage 2.2.1) strains has been associated with multi- and extensively drug-resistant infections (MDR-, XDR-TB), complicating disease control. The impact of rarer proto-Beijing (L2.1) strains is less clear. In our study of thirty-seven L2.1 clinical isolates spanning thirteen years, we found a high prevalence of XDR-TB cases (32.4%). With = 12 pairwise SNP distances, 43.2% of L2.1 patients belong to MDR-TB or XDR-TB transmission clusters suggesting a high level of clonal expansion across four Thai provinces. All XDR-TB (100%) were likely due to transmission rather than inadequate treatment. We found a 47 mutation signature and a partial deletion of the fadD14 gene in the circulating XDR-TB cluster, which can be used for surveillance of this rare and resilient M. tuberculosis strain-type that is causing increasing health burden. We also detected three novel deletion positions, a deletion of 1285 bp within desA3 (Rv3230c), large deletions in the plcB, plcA, and ppe38 gene which may play a role in the virulence, pathogenesis or evolution of the L2.1 strain-type. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
URL https://tandf.figshare.com/articles/dataset/Genomic_evidence_supporting_the_clonal_expansion_of_exte...
 
Title High clustering rate and genotypic drug-susceptibility screening for the newly recommended anti-tuberculosis drugs among global extensively drug-resistant Mycobacterium tuberculosis isolates 
Description Multidrug-resistant tuberculosis (MDR-TB) and extensively drug-resistant tuberculosis (XDR-TB) make TB difficult to control. Global susceptibility data for six newly recommended anti-TB drugs against M/XDR-TB are still limited. Using publicly available whole-genome sequences, we determined the proportion of 513 phenotypically XDR-TB isolates that carried mutations associated with resistance against these drugs (bedaquiline, clofazimine, linezolid, delamanid, pretomanid and cycloserine). Mutations of Rv0678 and Rv1979c were detected in 69/513 isolates (13.5%) for bedaquiline resistance and 79/513 isolates (15.4%) for clofazimine resistance with additional mmpL5 mutations. Mutations conferring resistance to delamanid were detected in fbiB and ddn genes for 11/513 isolates (2.1%). For pretomanid, a mutation was detected in the ddn gene for 3/513 isolates (0.6%). Nineteen mutations of pykA, cycA, ald, and alr genes, conferring resistance to cycloserine, were found in 153/513 isolates (29.8%). No known mutations associated with linezolid resistance were detected. Cluster analysis showed that 408/513 isolates fell within 99 clusters and that 354 of these isolates were possible primary drug-resistant TB (292 XDR-TB, 57 pre-XDR-TB and 5 MDR-TB). Clonal transmission of primary XDR isolates might contribute significantly to the high prevalence of DR-TB globally. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://tandf.figshare.com/articles/dataset/High_clustering_rate_and_genotypic_drug-susceptibility_s...
 
Title High clustering rate and genotypic drug-susceptibility screening for the newly recommended anti-tuberculosis drugs among global extensively drug-resistant Mycobacterium tuberculosis isolates 
Description Multidrug-resistant tuberculosis (MDR-TB) and extensively drug-resistant tuberculosis (XDR-TB) make TB difficult to control. Global susceptibility data for six newly recommended anti-TB drugs against M/XDR-TB are still limited. Using publicly available whole-genome sequences, we determined the proportion of 513 phenotypically XDR-TB isolates that carried mutations associated with resistance against these drugs (bedaquiline, clofazimine, linezolid, delamanid, pretomanid and cycloserine). Mutations of Rv0678 and Rv1979c were detected in 69/513 isolates (13.5%) for bedaquiline resistance and 79/513 isolates (15.4%) for clofazimine resistance with additional mmpL5 mutations. Mutations conferring resistance to delamanid were detected in fbiB and ddn genes for 11/513 isolates (2.1%). For pretomanid, a mutation was detected in the ddn gene for 3/513 isolates (0.6%). Nineteen mutations of pykA, cycA, ald, and alr genes, conferring resistance to cycloserine, were found in 153/513 isolates (29.8%). No known mutations associated with linezolid resistance were detected. Cluster analysis showed that 408/513 isolates fell within 99 clusters and that 354 of these isolates were possible primary drug-resistant TB (292 XDR-TB, 57 pre-XDR-TB and 5 MDR-TB). Clonal transmission of primary XDR isolates might contribute significantly to the high prevalence of DR-TB globally. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://tandf.figshare.com/articles/dataset/High_clustering_rate_and_genotypic_drug-susceptibility_s...
 
Title MOESM1 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 1: Table S1. SNPs identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM1_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM1 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 1: Table S1. SNPs identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM1_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM2 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 2: Table S2. Insertions (INS) and deletions (DEL) identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM2_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM2 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 2: Table S2. Insertions (INS) and deletions (DEL) identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM2_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM3 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 3: Table S3. SNPs and Indels identified in the PF3D7_1222600 gene among P. falciparum field isolates from Africa and Asia. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM3_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM3 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 3: Table S3. SNPs and Indels identified in the PF3D7_1222600 gene among P. falciparum field isolates from Africa and Asia. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM3_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM4 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 4: Table S4. Structural variants identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM4_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM4 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 4: Table S4. Structural variants identified in the three lines 3D7A, A4 and F12. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM4_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM7 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 7: Table S5. A scan for motifs in gene sequences (coding and UTR regions). 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM7_of_Genomic_variation_in_two_gametocyte_n...
 
Title MOESM7 of Genomic variation in two gametocyte non-producing Plasmodium falciparum clonal lines 
Description Additional file 7: Table S5. A scan for motifs in gene sequences (coding and UTR regions). 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://springernature.figshare.com/articles/dataset/MOESM7_of_Genomic_variation_in_two_gametocyte_n...
 
Title Malaria sequencing database 
Description We have processed sequences for over 2000 malaria samples, and all genomic variants detected have been assembled in a database. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
Impact It has allowed our collaborators to investigate their genes of interest. We plan to make the database accessible to the research community after further testing and confirmation of variants. 
 
Title Neglected malaria sequence data 
Description To fill in gaps in Plasmodium genomics, and to improve our geographical and species barcodes, we have sequenced >500 P. vivax, P. malaria and P. ovale parasites. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? No  
Impact We have a new molecular barcode for malaria speciation, geographical source and drug resistance. This has been implemented within a new software tool ("Malaria_Profiler"), which we will release in the next few months via a manuscript in preparation. 
 
Description Brazil Falciparum & Vivax - Simone Santos Silva Oliveira 
Organisation Oswaldo Cruz Foundation (Fiocruz)
Country Brazil 
Sector Public 
PI Contribution Analysis of genetic data.
Collaborator Contribution Contributing malaria samples to sequence.
Impact Sequence data is being generated.
Start Year 2015
 
Description Brazil Falciparum and Vivax - Claudio Marinho 
Organisation Universidade de São Paulo
Department Department of Parasitology
Country Brazil 
Sector Academic/University 
PI Contribution Supported visitor that came to work in the laboratory.
Collaborator Contribution DNA Samples. A technician visited to work on samples.
Impact A draft manuscript. Samples genotyped.
Start Year 2016
 
Description Brazil Vivax - Marcelo Urbano Ferreira 
Organisation Universidade de São Paulo
Country Brazil 
Sector Academic/University 
PI Contribution Analysis of P.vivax sequence data for the collaborator.
Collaborator Contribution Contributing P.vivax sequence data to the MRC funded barcoding project.
Impact A manuscript describing P.vivax diversity is in preparation.
Start Year 2016
 
Description Cambodia Vivax - Rich Fairhurst 
Organisation National Institutes of Health (NIH)
Country United States 
Sector Public 
PI Contribution Sequencing of malaria samples
Collaborator Contribution Malaria samples to sequence.
Impact Samples are being prepared.
Start Year 2015
 
Description Malaria genotyping - Jonathan Curry 
Organisation LGC Ltd
Country United Kingdom 
Sector Private 
PI Contribution Samples provided for genotyping
Collaborator Contribution Genotyping of 200 samples
Impact Genotyping data, currently being written up for a publication.
Start Year 2016
 
Description Malaysia - Knowlesi 
Organisation Menzies School of Health Research
Country Australia 
Sector Academic/University 
PI Contribution Sequencing of malaria samples.
Collaborator Contribution Malaria samples for sequencing.
Impact Samples have been sent.
Start Year 2015
 
Description Pakistan Vivax - Nazma Habib Khan 
Organisation University of Oklahoma
Department Department of Zoology
Country United States 
Sector Academic/University 
PI Contribution Sequencing of P.vivax samples
Collaborator Contribution Contributed P.vivax samples.
Impact Malaria samples are about to be sequenced.
Start Year 2016
 
Description Sequencing - GIS 
Organisation Agency for Science, Technology and Research (A*STAR)
Department Genome Institute of Singapore
Country Singapore 
Sector Academic/University 
PI Contribution Samples for pacino sequencing
Collaborator Contribution Sequencing data.
Impact Sequence data, and scientific publications.
Start Year 2016
 
Description Capacity building workshop in LSHTM 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Study participants or study members
Results and Impact 35 participants (including collaborators and partners) attended a genomic data workshop where malaria data (including generated by the project) was analysed. Future projects were discussed.
Year(s) Of Engagement Activity 2017,2018