The genome sequence of the snout, Hypena proboscidalis (Linnaeus, 1758)

We present a genome assembly from an individual female Hypena proboscidalis (the snout; Arthropoda; Insecta; Lepidoptera; Erebidae). The genome sequence is 637 megabases in span. The majority of the assembly is scaffolded into 31 chromosomal pseudomolecules, with the Z sex chromosome assembled.


Introduction
Caterpillars of Hypena proboscidalis (the snout) are specialised herbivores of nettle plants; the common and binomial names reference the prominent labial palps of the adult. The genome of H. proboscidalis was sequenced as part of the Darwin Tree of Life Project, a collaborative effort to sequence all of the named eukaryotic species in the Atlantic Archipelago of Britain and Ireland. Here we present a chromosomally complete genome sequence for H. proboscidalis, based on one female specimen from Wytham Woods, Oxfordshire, UK.

Genome sequence report
The genome was sequenced from a single female H. proboscidalis collected from Wytham Woods, Oxfordshire, UK (latitude 51.772, longitude -1.338). A total of 23-fold coverage in Pacific Biosciences single-molecule long reads (N50 18 kb) and 55-fold coverage in 10X Genomics read clouds were generated. Primary assembly contigs were scaffolded with chromosome conformation Hi-C data. Manual assembly curation corrected 34 missing/misjoins and removed 14 haplotypic duplications, reducing the assembly length by 2.17% and the scaffold number by 28.57%, and increasing the scaffold N50 by 14.21%. The final assembly has a total length of 637 Mb in 56 sequence scaffolds with a scaffold N50 of 22 Mb (Table 1). The majority, 98.3%, of assembly sequence was assigned to 31 chromosomal-level scaffolds, representing 30 autosomes (numbered by sequence length), and the Z sex chromosome (Figure 1- Figure 4; Table 2). The assumed sex chromosome karyotype is Z0. The assembly has a BUSCO v5. 1.2 (Simão et al., 2015) completeness of 98.7% using the lepidop-tera_odb10 reference set. While not fully phased, the assembly deposited is of one haplotype. Contigs corresponding to the second haplotype have also been deposited.

Methods
A female H. proboscidalis, ilHypProb1, and a second specimen of unknown sex, ilHypProb2, were collected from Wytham Woods, Oxfordshire, UK (latitude 51.772, longitude -1.338) by Douglas Boyes, University of Oxford using a light trap. The specimens were snap-frozen in dry ice using a Cool-Rack before transferring to the Wellcome Sanger Institute (WSI).
DNA was extracted from head and thorax tissue of ilHyp-Prob1 by the Scientific Operations core at the WSI using the Qiagen MagAttract HMW DNA kit, according to the manufacturer's instructions. RNA was extracted from ilHypProb2 in the Tree of Life Laboratory at the WSI using TRIzol (Invitrogen), according to the manufacturer's instructions. RNA was then eluted in 50 μl RNAse-free water and its concentration RNA assessed using a Nanodrop spectrophotometer and Qubit Fluorometer using the Qubit RNA Broad-Range (BR) Assay kit. Analysis of the integrity of the RNA was done using Agilent RNA 6000 Pico Kit and Eukaryotic Total RNA assay.
Pacific Biosciences HiFi circular consensus and 10X Genomics read cloud DNA sequencing libraries, in addition to PolyA RNA-Seq libraries, were constructed according to the manufacturers' instructions. Sequencing was performed by the Scientific Operations core at the WSI on Pacific Biosciences SEQUEL II (HiFi) Illumina HiSeq X (10X) and Illumina HiSeq 4000 (RNA-Seq) instruments. Hi-C data were   By agreeing with and signing up to the Sampling Code of Practice, the Darwin Tree of Life Partner agrees they will meet the legal and ethical requirements and standards set out within this document in respect of all samples acquired for, and    The genome sequence is released openly for reuse. The H. proboscidalis genome sequencing initiative is part of the Darwin Tree of Life (DToL) project. All raw sequence data and the assembly have been deposited in INSDC databases. The genome will be annotated using the RNA-Seq data and presented through the Ensembl pipeline at the European Bioinformatics Institute. Raw data and assembly accession identifiers are reported in Table 1.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.