This project applies computational methods to characterise prostate cancers using RNA sequencing (RNA-seq) and targeted DNA sequencing (GMS560 panel) data from approximately 600 patient tissue samples. The overarching aim is to develop molecular classification models that can contribute to treatment stratification and an improved understanding of tumour biology.
The analytical workflow consists of two major phases. First, sequencing data is pre-processed (including alignment and variant calling) via established pipelines at Clinical Genomics Umeå. The resulting data will be imported to Bianca for downstream analysis. Second, these multi-omics feature sets will be integrated to explore and validate different classification frameworks and statistical models for tumour subtyping and biomarker discovery. This includes evaluating the relationship between specific DNA mutations and global gene expression patterns across the 600-patient cohort.
Because the data include sensitive personal information derived from human biobank samples and linked health records, all processing must occur within a secure computing environment that complies with GDPR and the ethical approvals governing the study. The NAISS SENS resource Bianca at UPPMAX provides the required security infrastructure."