SA-DISCNet: A collaborative data science training network across southern Africa and southern UK

Lead Research Organisation: University of Portsmouth
Department Name: Institute of Cosmology and Gravitation

Abstract

Data intensive science is a major global growth area, as the volume, complexity and rate of digital data within governments and companies continues to rapidly increase. At the same time, powerful analysis techniques continue to evolve for obtaining radical insights into large datasets, including finding clusters and anomalies, as well as detecting and predicting dominant trends and correlations in such data. This data intensive science comes at a crucial time for global development. Major worldwide challenges, as encapsulated in the United Nations' Sustainable Development Goals (SDGs), require multidisciplinary solutions, many of which include data science. Moreover, the South African National Development Plan (NDP) for 2030 recognises the need to "sharpen its innovative edge and continue contributing to global scientific and technological advancement" and "shift to a more knowledge-intensive economy".

We therefore propose to build a training network in data intensive science between universities in southern UK and partners in southern Africa to help address these SDGs and NDP priorities. The cornerstones of this network will be the `Data Intensive Science Centre in SEPnet' (DISCnet) and the African Institute of Mathematical Science (AIMS) South Africa. Together, we will pilot an innovative course of training and internships for the next generation of data analysts, focusing on solving SDG-related questions in South Africa and acting as a driver of the country's economy in the 21st century.

Our aim with this pilot training programme is to equip and send students to solve data science problems associated with sustainable development goals (SDGs) in SA and beyond. The specific goals of the pilot programme are to: (i) Deliver an initial cohort of at least 10 highly trained African data scientists; (ii) Provide a world-class data science school to African students, leveraging existing DISCnet training material; (iii) Prime-pump a new 8-week hand-on data science training course at AIMS with contributions from DISCnet; (iv) Contribute to the sustainable development goals via 3 month strategic student internships with South African organisations and companies, focusing on economic development and welfare; (v) Understand the details of managing an extended, sustainable training network across southern Africa.

This pilot leverages considerable investment from STFC, our university partners, and the Royal Society (RS). Our long-term ambition is to create a sustainable network of comparable scale to DISCnet, e.g. approximately 25 African STEM students per year receiving our specialist training. These students will become the future data science leaders in Africa.

Planned Impact

There are several groups who will benefit from the impact of this project:

a) Students from South Africa and other African nations will benefit by being trained in data intensive science through our school and workshop, and will have their training put into practice by participating in our internship programme, where they will solve real-world problems associated with sustainable development goals. Such trained students will be highly sought-after for data science positions in Southern Africa, e.g. we would hope that some of our students will return to their host companies to continue their careers. Overall, we expect them to become leaders in data science in the future.

b) Companies and organisations in South Africa who participate in our internment programme will gain new insights into their problems and data, with the aim of improving infrastructure, equal opportunities and innovation in South Africa in line with the SDGs.

c) We will strengthen ties between AIMS and DISCnet for data science-related training. Our SA proposers will gain from the large range of transferable training materials from the DISCnet courses.

d) DISCnet will learn from the good practice of AIMS in training students from diverse backgrounds in data science. DISCnet will also benefit from the experience of the existing AIMS internship programme.

e) Everyone involved will benefit from increased collaborations in data intensive science, both for academic studies (e.g. SKA and LSST) and applications to real-world problems (innovation and SDGs).

f) We will extend our programme to include larger student cohorts, more internships, and in nodes across Africa in future years (Cameroon, Ghana, Rwanda, Senegal and Tanzania). Through demonstrating the value and excitement of our programme via this pilot, we will seek further funding from GCRF and other UK and SA opportunites.

Publications

10 25 50
 
Description Data Intensive Science training
Geographic Reach Africa 
Policy Influence Type Influenced training of practitioners or researchers
Impact We have provided training and experience in Data Intensive Science for graduate students from African countries. We have held a data science school, an 8-week immersive workshop, and placements with local organisations and companies working on SDG-related projects. Our initial school has trained 40 graduate students from African DAC countries in data science. Students are now applying their training in areas including the financial market, water treatment, and pattern recognition. Feedback is very positive, with 100% keen to recommend the school to colleagues, and 100% wanting a follow-up school. Our 8-week intensive workshop has invested in 16 students from African DAC countries to prepare them to be data science leaders. The students have benefited from coaching by experts from AIMS (Cape Town), SEPnet (UK), the ESRC Business and Data Research Centre (U. Essex) and Netflix. Internships have included / are including work with the University of Pretoria, Cape Town Transport department, Zoona, Siyavula, Naspers, Conversion Science, Grailabs and RLabs; see 'Collaborations and Partnerships' and 'Other Outputs' for details. DSI participants are now being offered opportunities in data science careers, including: two Square Kilometre Array research fellowships; a permanent position at RLabs; a part-time position at Conversion Science; an internship with Data Prophet. Full career outcomes will be assessed later this year.
 
Description STFC ODA institutional Award
Amount £15,000 (GBP)
Organisation Science and Technologies Facilities Council (STFC) 
Sector Academic/University
Country United Kingdom
Start 01/2018 
End 12/2018
 
Description Collaboration between DISCnet and AIMS 
Organisation African Institute for Mathematical Sciences
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Collaboration between DISCnet and AIMS 
Organisation Open University
Country United Kingdom 
Sector Academic/University 
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Collaboration between DISCnet and AIMS 
Organisation Queen Mary University of London
Country United Kingdom 
Sector Academic/University 
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Collaboration between DISCnet and AIMS 
Organisation University of Essex
Country United Kingdom 
Sector Academic/University 
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Collaboration between DISCnet and AIMS 
Organisation University of Southampton
Country United Kingdom 
Sector Academic/University 
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Collaboration between DISCnet and AIMS 
Organisation University of Sussex
Country United Kingdom 
Sector Academic/University 
PI Contribution I have acted as Principal Investigator for the SA-DISCnet collaboration. This collaboration was included as the focus of our grant application, linking the UK Universities participating in the STFC-funded DISCnet doctoral training scheme with the African Institute for Mathematical Sciences (AIMS) in Cape Town, South Africa. My contribution has included managing the whole programme of training students from across Africa in Data Science, including the initial training school, an 8 week intensive data science training programme, and internships with companies in Cape Town. I also acted as a lecturer for the initial school, teaching advanced statistics for data science applications.
Collaborator Contribution The African Institute of Mathematical Studies has acted as local organiser for the programme. They have hosted the initial training school, communicating with participants beforehand, providing the lecture theatre, organising meals, and providing a computer lab for daily practicals. They also provided one of the lecturers (Michelle Lochner) teaching Machine Learning, together with tutors for the practicals. AIMS were local organisers of the 8 week Data Science Intensive programme, organising venue and communicating with participants. Professor Bruce Bassett from AIMS contributed to the training throughout the DSI programme. AIMS have also acted as local organisers of the internship programme, communicating with participants and internship organisations. The University of Essex provided two lecturers for the DSI programme (Haider Raza and Ana Matran-Fernandez), and QMUL provided one lecturer (Alkistis Pourtsidou). University of Southampton, University of Sussex and Open University have contributed to proposal writing and management of the project.
Impact The key outcomes of this partnership are as follows (details provided under the relevant sections of this form): - A 3 day training school for students from across Africa, providing training in Data Science; - An 8 week intensive training programme for students across Africa, enabling practical training and experience in solving data science problems, preparing these participants for internships; - Ongoing internships with organisations and businesses in South Africa, applying data science to problems associated with UN sustainable development goals.
Start Year 2018
 
Description Internships with businesses and organisations in South Africa 
Organisation African Institute for Mathematical Sciences
PI Contribution We have provided 11 internships for 9 participants in our DSI programme, all from African DAC countries, together with 2 students from the UK; the interns have worked with businesses and organisations in Cape Town to carry out work in data science focussed on problems associated with sustainable development goals. SA-DISCnet has selected participants and found internship organisations, have organised travel and subsistence for the participants, and continued to communicate with the participants to check progress.
Collaborator Contribution The businesses and organisations have engaged with the interns to explain their current data science challenges, and have interacted with the interns regularly to discuss progress. Four representatives from these organisations attended our internship report day; two provided talks during the Data Intensive Science training programme.
Impact Internships have included / are including work with: the University of Pretoria to deliver a machine learning algorithm to detect malaria with good accuracy from a hand scanner; Cape Town Transport department analysing city population and travel behaviour for low income commuters; Zoona, a payments company partnering with small emerging entrepreneurs and small businesses in Zambia, Malawi and Mozambique; Siyavula, an innovative edutech startup that offers SA high school students personalised automated training; Naspers, an internet and media group; Conversion Science, a search engine marketing and analytics agency; Grailabs, providing machine learning and artificial intelligence services to industry in SA; and RLabs, a Cape Town non-profit aiming to bring social change through information and communication technologies. Note that internships are continuing through to March 2019, and we will be able to assess outcomes fully at that point. AIMS will be continuing its association with Cape Town transport, with plans for further internships and joint research projects. AIMS have also created a new industry contact with Conversion Science, who has offered a paid internship to an AIMS Masters student.
Start Year 2018
 
Description Internships with businesses and organisations in South Africa 
Organisation Zoona
PI Contribution We have provided 11 internships for 9 participants in our DSI programme, all from African DAC countries, together with 2 students from the UK; the interns have worked with businesses and organisations in Cape Town to carry out work in data science focussed on problems associated with sustainable development goals. SA-DISCnet has selected participants and found internship organisations, have organised travel and subsistence for the participants, and continued to communicate with the participants to check progress.
Collaborator Contribution The businesses and organisations have engaged with the interns to explain their current data science challenges, and have interacted with the interns regularly to discuss progress. Four representatives from these organisations attended our internship report day; two provided talks during the Data Intensive Science training programme.
Impact Internships have included / are including work with: the University of Pretoria to deliver a machine learning algorithm to detect malaria with good accuracy from a hand scanner; Cape Town Transport department analysing city population and travel behaviour for low income commuters; Zoona, a payments company partnering with small emerging entrepreneurs and small businesses in Zambia, Malawi and Mozambique; Siyavula, an innovative edutech startup that offers SA high school students personalised automated training; Naspers, an internet and media group; Conversion Science, a search engine marketing and analytics agency; Grailabs, providing machine learning and artificial intelligence services to industry in SA; and RLabs, a Cape Town non-profit aiming to bring social change through information and communication technologies. Note that internships are continuing through to March 2019, and we will be able to assess outcomes fully at that point. AIMS will be continuing its association with Cape Town transport, with plans for further internships and joint research projects. AIMS have also created a new industry contact with Conversion Science, who has offered a paid internship to an AIMS Masters student.
Start Year 2018
 
Description Businesses visit 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact On February 16th 2019, we held a workshop where all 9 African internship participants presented the results of their placements so far. Four industry representatives were invited to hear about the results, and then all attendees engaged in discussion about the future landscape of Artificial Intelligence and its impact on Africa. All students and business representatives were from African countries on the DAC list.
Year(s) Of Engagement Activity 2019