Automating reaction condition prediction for supramolecular organic assemblies

Lead Research Organisation: Imperial College London
Department Name: Chemistry

Abstract

Supramolecular synthesis is a powerful strategy for assembling complex molecules, but doing this by targeted design is challenging. Whilst there is some success in targeting specific molecules using chemical intuition, knowledge of how other synthetic variables such as solvent, temperature, and concentration, control the reaction outcome is limited, and arguably left to chance. We would like to move towards a data-driven approach to predicting the reaction outcomes but are hampered by the lack of high quality data in a sufficiently high quantity. Here, we propose to use high-throughput automation to carry out large numbers of reaction screens using both commercially available and simple synthetically viable precursors. This will fill the gap in data by generating thousands of data points across different precursor combinations for dynamic covalent chemistry, solvent choice, temperature choice, and reaction concentration, all of which will be recorded in an electronic lab notebook. With the available data, we will use supervised machine learning approaches to attempt to first predict the best synthetic conditions for a successful, "clean" reaction, and then to predict the reaction outcome (i.e. molecular mass). We will also apply atomistic modelling where possible using available software within the Jelfs group, to both determine which hits are the most promising to scale up and to investigate the physical reasons behind observed design rules.

Planned Impact

Academic impact:
Recent advances in data science and digital technology have a disruptive effect on the way synthetic chemistry is practiced. Competence in computing and data analysis has become increasingly important in preparing chemistry students for careers in industry and academic research.

The CDT cohort will receive interdisciplinary training in an excellent research environment, supported by state-of-the-art bespoke facilities, in areas that are currently under-represented in UK Chemistry graduate programmes. The CDT assembles a team of 74 Academics across several disciplines (Chemistry, Chemical Engineering, Bioengineering, Maths and Computing, and pharmaceutical manufacturing sciences), further supported by 16 industrial stakeholders, to deliver the interdisciplinary training necessary to transform synthetic chemistry into a data-centric science, including: the latest developments in lab automation, the use of new reaction platforms, greater incorporation of in-situ analytics to build an understanding of the fundamental reaction pathways, as well as scaling-up for manufacturing.

All of the research data generated by the CDT will be captured (by the use of a common Electronic Lab Notebook) and made openly accessible after an embargo period. Over time, this will provide a valuable resource for the future development of synthetic chemistry.

Industrial and Economic Impact:
Synthetic chemistry is a critical scientific discipline that underpins the UK's manufacturing industry. The Chemicals and Pharmaceutical industries are projected to generate a demand for up to 77,000 graduate recruits between 2015-2025. As the manufacturing industry becomes more digitised (Industry 4.0), training needs to evolve to deliver a new generation of highly-skilled workers to protect the manufacturing sector in the UK. By expanding the traditional skill sets of a synthetic chemist, we will produce highly-qualified personnel who are more resilient to future challenges. This CDT will produce synthetic chemists with skills in automation and data-management skills that are highly prized by employers, which will maintain the UK's world-leading expertise and competitiveness and encourage inward investment.

This CDT will improve the job-readiness of our graduate students, by embedding industrial partners in our training programme, including the delivery of training material, lecture courses, case studies, and offers of industrial placements. Students will be able to exercise their broadened fundamental knowledge to a wide range of applied and industrial problems and enhance their job prospects.

Societal:
The World's population was estimated to be 7.4 billion in August 2016; the UN estimated that it will further increase to 11.2 billion in the year 2100. This population growth will inevitably place pressure on the world's finite natural resources. Novel molecules with improved effectiveness and safety will supersede current pharmaceuticals, agrochemicals, and fine chemicals used in the fabrication of new materials.

Recent news highlights the need for certain materials (such as plastics) to be manufactured and recycled in a sustainable manner, and yet their commercial viability of next-generation manufacturing processes will depend on their cost-effectiveness and the speed which they can be developed. The CDT graduates will act as ambassadors of the chemical science, engaging directly with the Learned Societies, local council, general public (including educational activities), as well as politicians and policymakers, to champion the importance of the chemical science in solving global challenges.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/S023232/1 01/04/2019 30/09/2027
2454812 Studentship EP/S023232/1 01/10/2020 30/09/2024 Annabel Basford