Understanding the role of human microbiota in health and disease requires insights into its taxonomic composition and functional capabilities.The 16S ribosomal RNA (rRNA) gene sequencing is a technique widely used for bacterial taxonomic classification and therefore for this purpose. This project develops a 16S rDNA sequencing data lake from 20,000 historical clinical samples previously sequenced at the Department of Clinical Microbiology, Karolinska University Hospital from sterile body sites such as cerebrospinal fluid (CSF), bronchoalveolar lavage (BAL), and abscesses. The purpose of this project is to improve microbial identification and real-time diagnostics for infections in sterile body sites. Since sequencing is already used clinically for this purpose, the project aims to optimize and standardize its application as well as to gain more knowledge between the microbial profiles and clinical metadata in Sweden.