I'm a 3rd year PhD student at Chalmers working on fundamental research in computer vision and autonomous systems. My interests include image and video analysis, e.g., semantic segmentation and neural network architectures dedicated to video processing, learning based sensor fusion strategies, e.g., neural network architectures designed to process the input of multiple sensors, and different strategies to train the neural networks used for these problems, e.g., unsupervised domain adaptation (UDA) and semi-supervised learning (SSL).
In my first year, I studied UDA semantic segmentation, wherein a neural network makes semantic segmentation predictions for images provided by a single camera. In my second year, I developed training methods for multi-view object detection using a neural network capable of taking multiple images as input. In my third year (the current project), I will investigate methods for processing videos from multiple cameras, focusing particularly on making use of foundation computer vision models.