NAISS
SUPR
SUPR
LiU Berzelius 2021

Decided

This round has been closed as all proposals have been handled.

Apply to this round for a project on the Berzelius SuperPOD at NSC. Berzelius is a non SNIC system donated to NSC by Knut and Alice Wallenberg foundation (KAW). Monthly evaluation of proposals during the year.

To apply, you must be a scientist in Swedish academia, at the level of PhD student or higher.

Resources

Resource Centre Total
Requested
Upper
Limit
Default
Storage
Available Unit Note
Berzelius Ampere NSC 601 820 14 400 345 600 GPU-h/month Applications are normally evaluated during the last week each month.
Submit your proposal at least one week before the end of a month to be considered for an allocation from the first of the following month. Received proposals will be evaluated against each other and time that become available as project ends at the end of a month will be allocated to the proposed projects accordingly.

Berzelius Ampere is an NVIDIA® SuperPOD consisting of 94 NVIDIA® DGX-A100 compute nodes supplied by Atos/Eviden and 8 CPU nodes also supplied by Eviden. The original 60 "thin" DGX-A100 nodes are each equipped with 8 NVIDIA® A100 Tensor Core GPUs, 2 AMD Epyc™ 7742 CPUs, 1 TB RAM and 15 TB of local NVMe SSD storage. The A100 GPUs have 40 GB on-board HBM2 VRAM. The 34 newer DGX-A100 nodes "fat" are each equipped with 8 NVIDIA® A100 Tensor Core GPUs, 2 AMD Epyc™ 7742 CPUs, 2 TB RAM and 30 TB of local NVMe SSD storage. The A100 GPUs have 80 GB on-board HBM2 VRAM. The CPU nodes are each equipped with 2 AMD Epyc™ 9534 CPUs, 1.1 TB RAM and 6.4 TB of local NVMe SSD storage.

Fast compute interconnect is provided via 8x NVIDIA® Mellanox® HDR per DGX connected in a non-blocking fat-tree topology. In addition, every node is equipped with NVIDIA® Mellanox® HDR dedicated storage interconnect.

All nodes have a local disk where applications can store temporary files. The size of this disk (available to jobs as `/scratch/local`) is 15 TB on "thin" nodes, 30 TB on "fat" nodes, and 6.4 TB on CPU nodes, and is shared between all jobs using the node.

Berzelius Storage NSC 736 358 931 322 GiB Applications are normally evaluated during the last week each month.
Submit your proposal at least one week before the end of a month to be considered for an allocation from the first of the following month. Received proposals will be evaluated against each other and time that become available as project ends at the end of a month will be allocated to the proposed projects accordingly.

Shared, central storage accessible from all Berzelius Ampere and Berzelius Hopper compute and login nodes is provided by a storage cluster from VAST Data consisting of 8 CBoxes and 3 DBoxes using an NVMe-oF architecture. The storage servers are connected end-to-end to the GPUs using a high bandwidth interconnect separate from the East-West compute interconnect. The installed physical storage capacity is 3 PB, but due to compression and deduplication this will be higher in practice.

NSC centre storage (as available on Tetralith) is not accessible on Berzelius.


Click above to show more information about the resource.