Partitioning and quantization of object detection neural networks for efficient implementation

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2024/22-549

Type:

NAISS Small Compute

Principal Investigator:

Oscar Berg

Affiliation:

Mittuniversitetet

Start Date:

2024-04-11

End Date:

2025-05-01

Primary Classification:

10206: Computer Engineering

Webpage:

Allocation

Alvis at C3SE: 500 GPU-h/month
Mimer at C3SE: 500 GiB

Abstract

State-of-the-art neural networks offer powerful capabilities but can be too resource-intensive for deployment on Internet-of-Things (IoT) devices. This project proposes a split YOLO network architecture for distributed processing across resource-constrained IoT devices and servers. This approach reduces communication overhead by processing a portion of the network on the device itself, eliminating the need to send raw data to the server. To further optimize communication efficiency, the project implements weight and intermediate feature map quantization. This reduces the size of data transmitted between the device and server, enabling faster processing on the resource-limited IoT device.