Developing and evaluating de novo transcriptome assembly in spruce

SUPR uses JavaScript for certain functions. We cannot guarantee that you will be able to use the system with JavaScript disabled.

Dnr:

NAISS 2024/5-15

Type:

NAISS Medium Compute

Principal Investigator:

Olof Emanuelsson

Affiliation:

Kungliga Tekniska högskolan

Start Date:

2024-01-29

End Date:

2025-02-01

Primary Classification:

10203: Bioinformatics (Computational Biology) (applications to be 10610)

Webpage:

Allocation

Dardel at PDC: 80 x 1000 core-h/month

Abstract

Transcriptome assembly from RNA-sequencing data in species without a reliable reference genome has to be performed de novo, but studies have shown that de novo methods often have inadequate reconstruction ability of transcript isoforms. This impedes the study of alternative splicing, in particular for lowly expressed isoforms. In this project, we develop and evaluate a de novo transcript isoform assembler, which clusters a set of guiding contigs by similarity, aligns short reads to the guiding contigs, and assembles each clustered set of short reads individually. We need to test our method on real datasets and will do so using stranded and non-stranded RNA-seq data from six eukaryotic species.