Active Learning for design of chemical libraries

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2024/22-841

Type:

NAISS Small Compute

Principal Investigator:

Simon Johansson

Affiliation:

Chalmers tekniska högskola

Start Date:

2024-07-01

End Date:

2025-07-01

Primary Classification:

10499: Other Chemistry Topics

Webpage:

Allocation

Alvis at C3SE: 250 GPU-h/month
Mimer at C3SE: 250 GiB

Abstract

A problem that arises with data-driven generative models is that their capacity to generate molecules in drug discovery settings far exceeds the capacity for experimental validation. Faced with millions of possible molecules but only having a practical capacity of producing experimental results for thousands of molecules leads to the following research question: which selection process should a decision maker use to design a wet lab experiment, when presented with a large set of generated molecules and a limited experimental budget? We have previously constructed a framework for selecting a set of molecules of specified size for experimentation. With this project we aim to simulate a full drug discovery cycle of iterative experimentation. To our knowledge this is the first attempt at a multistep simulation of a Design-Make-Test-Analyze (DMTA) cycle incorporating suggestions made by molecular generation modeling.