Introduction to long-read genome assembly
New genome sequencing technologies are producing much longer reads. This workshop combines an introductory presentation on the theory of de novo genome assembly with hands-on practice. We will use a cut-down data set of bacterial FASTQ reads from PacBio (long read) sequencing. Using command-line tools, we will assemble the reads with the tool Canu and correct the assembly with short read Illumina data.
Please note This is a short workshop so we won’t cover the following: eukaryote data, nanopore reads, or a comparison of different assembly tools.
By the end of this training, participants will be able to use some command-line tools for long-read genome assembly and polishing, on simple bacterial data sets.
This workshop requires a basic familiarity with Unix and genomics concepts.
Workshop attendees must bring a laptop that can connect to WiFi.