GFlowNets for molecule generation with guaranteed synthesis pathways

Lead Research Organisation: UNIVERSITY OF CAMBRIDGE
Department Name: Chemistry

Abstract

Recent breakthroughs in generative modelling have led to a number of works proposing molecular generation models for drug discovery. While these models perform well at capturing drug-like motifs such as rings, they are known to produce synthetically inaccessible molecules. This is because these models are trained to compose atoms or fragments in a way that approximates the training distribution, but these models are not explicitly aware of the synthesis constraints what come with making molecules in the laboratory. To help tackle this, we introduce Synthesis Flow Networks (SynFlowNet), a GFlowNet model whose action space is to build molecules from reactions and reactants. We find that SynFlowNet consistently samples synthetically feasible molecules, while still being able to find diverse and high-utility candidates. Furthermore, we found that molecules designed with SynFlowNet were 85% more synthesisable than those sampled from a GFlowNet which builds molecules using molecular fragments, according to SA score.

Publications

10 25 50

Studentship Projects

Project Reference Relationship Related To Start End Student Name
EP/S024220/1 31/05/2019 30/11/2027
2895016 Studentship EP/S024220/1 30/09/2023 29/09/2027 Miruna Cretu