phenoVLM: Leveraging Vision Language Models for Phenotypic Drug Discovery

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2025/22-1454

Type:

NAISS Small Compute

Principal Investigator:

Télio Cropsal

Affiliation:

Chalmers tekniska högskola

Start Date:

2025-10-27

End Date:

2026-11-01

Primary Classification:

10201: Computer Sciences

Webpage:

https://ailab.bio/

Allocation

Alvis at C3SE: 500 GPU-h/month
Mimer at C3SE: 500 GiB

Abstract

Phenotypic drug discovery produces large-scale datasets of cellular images under molecular or genetic perturbations. These datasets are typically used to infer relationships between compounds and genes based on observable cellular phenotypes. In this project, we propose to investigate whether phenotypic data can be used for molecular optimization. Specifically, we aim to train models that solve the inverse problem: given an image of a cellular phenotype, predict the molecule that caused it. To achieve this, we plan to leverage recent advances in Vision-Language Models (VLMs). These models are well suited for our task because cellular phenotypes are represented as images, while molecular structures or genetic perturbations can be encoded as strings (e.g., SMILES format). This project opens several research directions: - How should we evaluate the performance of such models in this domain? - What strategies are effective for adapting VLMs to biological image data and molecular or genetic representations? - Can we extend this framework beyond inverse prediction to support Visual Question Answering (VQA) from experimental data, enabling researchers to ask questions about cellular responses and molecular effects?