AI Agents for clinical healthcare dataset creation

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2025/22-715

Type:

NAISS Small Compute

Principal Investigator:

Fredrik Carlsson

Affiliation:

Karolinska Institutet

Start Date:

2025-05-15

End Date:

2026-06-01

Primary Classification:

30299: Other Clinical Medicine

Webpage:

Allocation

Cloud at SSC: 30000 Coins
Mimer at C3SE: 500 GiB
Alvis at C3SE: 250 GPU-h/month

Abstract

The development and validation of AI models in healthcare are significantly constrained by the scarcity of high-quality, diverse, and standardized datasets. Manual data curation by healthcare professionals is time-consuming, costly, and difficult to scale, limiting the broader adoption and robust evaluation of AI solutions in clinical settings. This study will investigate the use of AI agents as an alternative mechanism for generating benchmarking datasets, with a focus on medical imaging and clinical documentation. We hypothesize that AI agents can produce datasets of comparable quality to those curated by trained clinicians, enabling scalable, efficient, and privacy-conscious data collection. The datasets generated by AI agents will be rigorously evaluated using two complementary methods: - Prompt compliance - Quality-based assessments