Refining Transformers for Structured Equation Generation in Physics

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2025/22-521

Type:

NAISS Small Compute

Principal Investigator:

Eliel Camargo-Molina

Affiliation:

Uppsala universitet

Start Date:

2025-04-02

End Date:

2026-05-01

Primary Classification:

10301: Subatomic Physics

Webpage:

https://huggingface.co/JoseEliel/BART-Lagrangian

Allocation

Alvis at C3SE: 1000 GPU-h/month
Klemming at PDC: 500 GiB
Mimer at C3SE: 500 GiB
Dardel at PDC: 3 x 1000 core-h/month

Abstract

In this project, we will extend our recent work using transformer models for abstract symbolic mathematics in particle physics (https://arxiv.org/pdf/2501.09729). Our previous NAISS allocation allowed us to produce a paper demonstrating, for the first time, that transformers can accurately predict Lagrangians from particle lists. This work was presented at several conferences and has initiated multiple new collaborations. Building upon those results, we aim to refine our transformer models to handle realistic scenarios relevant for state-of-the-art phenomenology studies. Furthermore, we will broaden our approach beyond particle physics Lagrangians toward a wider class of symbolic mathematics tasks, leveraging recent advances in structured mathematical reasoning and equation manipulation. During the previous NAISS project, our models worked so well that we shifted to larger-scale training sooner than expected, and additional Google Cloud research credits further reduced our resource needs. As a result, NAISS usage was lower than initially projected. Now, having confirmed that NAISS is the best platform for our needs, and with multiple new collaborations and more ambitious projects, we anticipate significantly higher usage in this cycle.