SUPR
Partitioning and quantization of object detection neural networks for efficient implementation
Dnr:

NAISS 2024/22-549

Type:

NAISS Small Compute

Principal Investigator:

Oscar Berg

Affiliation:

Mittuniversitetet

Start Date:

2024-04-11

End Date:

2025-05-01

Primary Classification:

10206: Computer Engineering

Webpage:

Allocation

Abstract

State-of-the-art neural networks offer powerful capabilities but can be too resource-intensive for deployment on Internet-of-Things (IoT) devices. This project proposes a split YOLO network architecture for distributed processing across resource-constrained IoT devices and servers. This approach reduces communication overhead by processing a portion of the network on the device itself, eliminating the need to send raw data to the server. To further optimize communication efficiency, the project implements weight and intermediate feature map quantization. This reduces the size of data transmitted between the device and server, enabling faster processing on the resource-limited IoT device.