This project aims to generate a high-quality draft genome assembly and molecular marker dataset for Leucocoryne spp., a geophyte genus endemic to the “Desierto Florido” in northern Chile. Using a hybrid sequencing strategy, PacBio HiFi long reads (~23 GB BAM file) will be combined with Illumina NovaSeq PE150 paired-end data from 17 individuals (~2 GB each).
Genome assembly, polishing, and quality control will be performed using the nf-core/genomeassembler pipeline implemented with Nextflow, ensuring reproducibility and scalability on NAISS HPC resources. The workflow will include de novo assembly (Hifiasm), hybrid polishing (Pilon), and genome completeness assessment (BUSCO, QUAST, Merqury).
The final outcome will be a high-quality contig library suitable for molecular marker discovery (SNPs, SSRs, and ORFs) to support taxonomic identification and conservation genomics of native xerophytic species. The assembled genome will represent the first genomic reference for Leucocoryne, contributing to comparative studies in Amaryllidaceae.