Gap-filling, assembly correction and quantitative assessment of genome scaffold assembly

Darlon V. Lantican; Susan R. Strickler; Alma O. Canama; Roanne R. Gardoce; Lukas A. Mueller; Hayde F. Galvez

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Gap-filling, assembly correction and quantitative assessment of genome scaffold assembly

DL Darlon V. Lantican

SS Susan R. Strickler

AC Alma O. Canama

RG Roanne R. Gardoce

LM Lukas A. Mueller

HG Hayde F. Galvez

This method is extracted from research article: G3 (Bethesda), Jun 2019

De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species

DOI: 10.1534/g3.119.400215

Request a Protocol

Ask a question

Favorite

Post-processing of the resulting scaffolds was done using the gap-filling function of the PBJelly software package (English et al. 2012). The PacBio SMRT sequence data were used to anchor and further improve the contiguity of the scaffolds and reduce the number of ambiguous base ‘N’. Further correction was achieved using PILON automated genome assembly improvement tool (Walker et al. 2014). The binary alignment map (BAM) file was generated using Bowtie2 (Langmead and Salzberg 2012) by mapping the pre-processed Illumina Miseq PE reads to the scaffold assembly; output of which was input data for PILON assembly correction. The quality of the resulting assembly was assessed through a local Perl script as previously described, as well as TopHat2 (Kim et al. 2013) alignment of quality trimmed (Trimmomatic v0.36; SLIDINGWINDOW: 5:30; LEADING:5; TRAILING:5; MINLEN:100; Bolger et al. 2014) RNA-seq reads (SRR1173229). The quality is further evaluated with the Benchmarking Universal Single Copy Ortholog (BUSCO) program using the plant-specific database OrthoDB consisting of 1440 genes (Simão et al. 2015).

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol