NAISS
SUPR
NAISS Projects
SUPR
Communicating Medicine (Swemper)
Dnr:

NAISS 2025/22-1273

Type:

NAISS Small Compute

Principal Investigator:

Matts Lindström

Affiliation:

Uppsala universitet

Start Date:

2025-09-26

End Date:

2026-10-01

Primary Classification:

60104: History of Science and Ideas

Webpage:

Allocation

Abstract

The Swemper project aims to develop and deploy advanced AI models for efficient and accurate metadata extraction and comprehensive image description generation. This will significantly enhance our ability to process large visual datasets of historical medical print, improving search functionalities and automated content organization. Our core objectives are: - Object Detection for Metadata Extraction: To train robust object detection models to identify and categorize key image elements, automating metadata extraction. This involves leveraging and fine-tuning pre-trained object detection architectures like Detectron2 and Co-DETR. - Image Description Generation using Vision-Language Models (VLMs): To finetune and infer with VLMs for generating human-like image descriptions. Access to the alvis cluster will allow us in exploring various open source SOTA VLMs.