Earth Biogenome Project collaborations

NAISS 2023/6-236


NAISS Medium Storage

Principal Investigator:

Henrik Lantz


Uppsala universitet

Start Date:


End Date:


Primary Classification:

10610: Bioinformatics and Systems Biology (methods development to be 10203)

Secondary Classification:

10203: Bioinformatics (Computational Biology) (applications to be 10610)

Tertiary Classification:

10615: Evolutionary Biology




This storage project will be used for active projects performed by the genome assembly and annotation experts at NBIS (SciLifeLab Bioinformatics platform). All of the projects contribute to the international Earth Biogenome Project which aims to assemble genomes of all eukaryotes on Earth. The projects we are working on are from three different sources, but all with a similar structure and similar type of data. VR-EBP - Received funding in the 2020 VR-call "Increased accessibility to existing infrastructures". The title of the application was "A Swedish Earth Biogenome Project platform: building a pipeline and proof of principle studies. The project is driven by the NBIS and NGI platforms at SciLifeLab together with several researchers in Uppsala and Stockholm. There are also external partners in the project where the efforts of "increased accessibility" are aimed, including SVA and the Swedish Agency for Marine and Water management. The funding is for 4 years (2020-2024). The aim is to develop workflows and pipelines to better allow SciLifeLab to support these types of projects in the future. 8 different eukaryotes (1 fish, 1 mammal, 3 bees, 3 fungi) were selected, and all are close to completion. Apart from genome assembly and annotation there is also a population genomics component to these projects, and this will also require active storage. European Reference Genome Atlas (ERGA) - ERGA is a European branch of the Earth Biogenome Project and is financed through Horizon Europe (EU). The funded part (BGE) started officially Sep 1 2022, but before this there was an also an ERGA pilot project where a limited selection of organisms were analyzed with other funding. The ERGA Pilot includes 5 organisms and we expect 6 new projects this autumn. Swedish projects - We are regularly contacted by Swedish research groups that need help with genome projects, and these projects will also be analyzed and stored in this storage project. The data used will be mostly be long read PacBio Hifi data, and Illumina short reads to be used in Hi-C, annotation, and population genomics.