Multi-camera video processing

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2026/4-884

Type:

NAISS Small

Principal Investigator:

Erik Brorsson

Affiliation:

Chalmers tekniska högskola

Start Date:

2026-06-01

End Date:

2027-06-01

Primary Classification:

10207: Computer graphics and computer vision (System engineering aspects at 20208)

Webpage:

Allocation

Arrhenius Disk at NAISS: 3000 GiB
Arrhenius GPU at NAISS: 250 GPU-h/month

Abstract

I'm a 4th year PhD student at Chalmers working on fundamental research in computer vision and autonomous systems. My interests include image and video analysis, e.g., semantic segmentation and neural network architectures dedicated to video processing, learning based sensor fusion strategies, e.g., neural network architectures designed to process the input of multiple sensors, and different strategies to train the neural networks used for these problems, e.g., unsupervised domain adaptation (UDA) and semi-supervised learning (SSL). In my first year, I studied UDA semantic segmentation, wherein a neural network makes semantic segmentation predictions for images provided by a single camera. In my second and third year, I developed training methods for multi-view object detection using a neural network capable of taking multiple images as input. The findings were central to my Licentiate thesis, defended on October 7, 2025. In my fourth year (the current project), I will investigate principled methods for making multi-view neural networks invariant to the camera poses and calibration. Such invariance would constitute an important step to widespread deployment of multi-view neural networks as it allows them to generalize to any camera setup without additional training.