Structural Simplicity in Neural Networks

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2025/5-212

Type:

NAISS Medium Compute

Principal Investigator:

Claudio Altafini

Affiliation:

Linköpings universitet

Start Date:

2025-10-01

End Date:

2026-10-01

Primary Classification:

10202: Information Systems (Social aspects at 50804)

Webpage:

Allocation

Alvis at C3SE: 2000 GPU-h/month
Centre Storage at NSC: 500 GiB
Mimer at C3SE: 500 GiB
Tetralith at NSC: 15 x 1000 core-h/month

Abstract

The aim of this project is to understand from a new point of view the ability of deep neural networks to successfully learn from data and generalize. There is a tendency of neural networks (trained in presence or absence of explicit regularization) to represent a given input-output relation in a structurally “simple” manner, in the sense that many parts of the network encode for the same, most informative feature(s) of the input. We speculate that the degree of simplicity, which is related to the ability of the models to learn the high-information modes of data and disregard the noise, can be quantified using graph theoretic measures on trained neural networks. So far we have observed the tendency to simplicity in the structure of small-scale networks of different architectures, e.g., ANNs and CNNs, trained on real data, and medium-scale pretrained networks. To test the hypothesis further, we need to train and run experiments on medium to large-scale networks, which require significant computational resources.