Large-scale high-throughput data is the basis of modern bioinformatics. This data comes in many forms, including, but not limited to: genome sequencing, transcriptomics, proteomics, and metagenomics. The analysis of these types of data requires the use of advanced statistical and computational methods, as well as large-scale data mining, and thus demands a significant amount of computational power.
This storage project is requested in association with the C3SE 2025/1-11 “High-throughput microbiology” compute project. It will be used to temporarily store raw and processed metagenomic data while these analyses are conducted on Vera. Specifically, the metagenomic samples originate from over a hundred sampling locations worldwide, and will be datamined for bacterial genes with functional characterizations related to point-mutation-induced antibiotic resistance.