Discrete Diffusion LLM Training

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2026/4-733

Type:

NAISS Small

Principal Investigator:

Arseni Ivanov

Affiliation:

Lunds universitet

Start Date:

2026-04-15

End Date:

2026-07-01

Primary Classification:

10208: Natural Language Processing

Webpage:

Allocation

Mimer at C3SE: 500 GiB
Arrhenius Disk at NAISS: 250 GiB
Alvis at C3SE: 200 GPU-h/month

Abstract

The intent with the project is to explore Discrete Diffusion/Flow matching paradigms for LLMs in order to improve text generation speed during inference. The request is made in order to scale existing working prototypes of the models by utilizing multiple GPUs, first with torch.distributed, and later with NVSHMEM. The latter goal is to experiment with various losses, masking/noise schemes in order to figure out what works well with dLLMs at scale and why. Main Supervisor: Michael Doggett Associate Professor Lund University