SUPR
Advance Data Products Alma Pipeline (ADPAlmaP)
Dnr:

NAISS 2025/23-347

Type:

NAISS Small Storage

Principal Investigator:

Borja Montoro Molina

Affiliation:

Chalmers tekniska högskola

Start Date:

2025-06-09

End Date:

2026-07-01

Primary Classification:

10305: Astronomy, Astrophysics, and Cosmology

Webpage:

Allocation

Abstract

ADPAlmaP is a project proposed and approved under ESO, aimed at developing a pipeline based on tools originally designed for large-scale studies of high-intensity spectral lines (HI), specifically adapted to ALMA spectral line data in order to generate ALMA Advanced Data Products (ADPs). The pipeline is currently in an advanced stage of development. At this point, we are conducting a testing campaign among project members to evaluate the various features implemented. For this purpose, we are using a wide and diverse range of datasets, both publicly available from the ALMA archive and private datasets. So far, we have successfully run the pipeline on our personal machines using relatively lightweight datasets (<2 GB). However, since the ultimate goal is to ensure the pipeline can process any dataset that has passed QA2, it is essential to test it on much larger datasets (up to and exceeding 100 GB), which pose challenges to certain parts of the pipeline. This testing can only be performed using high-performance computing resources and sufficient storage capacity to handle all the resulting outputs. For this initial round of testing with such large datasets, we are requesting storage space to continue the pipeline validation campaign — the maximum amount allowed under this proposal, 5000 GiB. In terms of duration, we would require the storage space to be available for the next year and a half.