Network minimization

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2026/4-813

Type:

NAISS Small

Principal Investigator:

Ida-Maria Sintorn

Affiliation:

Uppsala universitet

Start Date:

2026-04-29

End Date:

2027-05-01

Primary Classification:

10210: Artificial Intelligence

Webpage:

Allocation

Alvis at C3SE: 1000 GPU-h/month
Mimer at C3SE: 500 GiB
Arrhenius GPU at NAISS: 400 GPU-h/month
Arrhenius Disk at NAISS: 250 GiB

Abstract

The overall purpose of the project is to investigate adapting NNs to different complexities and limited resources. A secondary objective is to develop a framework for pruning networks and evaluate importance of parameters for different pruning strategies and applications. Our project investigates gradual neural network pruning with redeployment into a progressively smaller network between pruning stages, producing a final model that is functionally equivalent to the pruned full-size network but substantially smaller in parameter count, memory footprint, and inference/carbon cost. The core step is minimization: an exact structural rewrite that converts a pruned network with mask-zeros into a smaller dense network with an identical forward function. In our preliminary experiments, iterating a prune–minimize–re-enable–fine-tune cycle until no further units can be removed yields a ~40× parameter reduction on a fully-connected model on MNIST and a ~10× reduction on a single, incomplete ConvNeXt-Tiny run on CIFAR-10, both at matched or slightly higher accuracy than a single prune-and-shrink pass. We expect the ConvNeXt figure to improve with longer runs. The required compute cost is described below.