DArT Sequencing and Genotyping

Dung Ho My Nguyen; Thitipong Panthum; Jatupong Ponjarat; Nararat Laopichienpong; Ekaphan Kraichak; Worapong Singchat; Syed Farhan Ahmad; Narongrit Muangmai; Surin Peyachoknagul; Uthairat Na-Nakorn; Kornsorn Srikulnath

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

DArT Sequencing and Genotyping

DN Dung Ho My Nguyen

TP Thitipong Panthum

JP Jatupong Ponjarat

NL Nararat Laopichienpong

EK Ekaphan Kraichak

WS Worapong Singchat

SA Syed Farhan Ahmad

NM Narongrit Muangmai

SP Surin Peyachoknagul

UN Uthairat Na-Nakorn

KS Kornsorn Srikulnath

This method is extracted from research article: Front Genet, Jan 2021

An Investigation of ZZ/ZW and XX/XY Sex Determination Systems in North African Catfish (Clarias gariepinus, Burchell, 1822)

DOI: 10.3389/fgene.2020.562856

Request a Protocol

Ask a question

Favorite

A detailed description of the DArTseq^TM methodology can be found in Jaccoud et al. (2001). The method often produces 69 base pairs (bp) long sequences. Genotyping of multiple loci was performed using DArTseq^TM (Diversity Arrays Technology Pty Ltd., Canberra, Australian Capital Territory, Australia) for SNP loci and in silico DArT (PA of restriction fragments in the representation; PA loci) to determine the candidate sex-specific loci between male and female individuals. Approximately 100 ng of DNA from each sample was used for the development of DArTseq^TM arrays. DNA samples were subjected to digestion/ligation reactions as described by Kilian et al. (2012) and digested with PstI and a second restriction endonuclease (SphI). Ligation reactions were performed using two adaptors: a PstI compatible adaptor consisting of an Illumina flow-cell attachment sequence, primer sequence, and a unique barcode sequence; and a SphI compatible adaptor consisting of an Illumina flow-cell attachment region. Ligated fragments were then amplified by PCR using the following parameters: initial denaturation at 94°C for 1 min, followed by 30 cycles of 94°C for 20 s, 58°C for 30 s, and 72°C for 45 s with a final extension step at 72°C for 7 min. Equimolar amounts of amplification products from each individual were pooled and subjected to Illumina’s proprietary cBot^¹ bridge PCR followed by sequencing on the Illumina HiSeq 2000 platform. Single read sequencing was run for 77 cycles.

Sequences were processed using proprietary DArTseq^TM analytical pipelines (Ren et al., 2015). Initially, the HiSeq 2000 output (FASTQ file) was processed to filter poor-quality sequences. Two different thresholds of quality were applied. For the barcode region (allowing parsing of sequences into specific sample libraries), we applied stringent selection (minimum Phred pass score of 30, minimum pass length percentage 75). For the remainder of the sequence, relaxed thresholds were applied (minimum Phred pass score 10, minimum pass length percentage 50). Approximately 2,000,000 sequences per individual were identified and used in marker calling. Finally, identical sequences were combined into “fastqcoll” files that were used in the secondary proprietary pipeline (DArTsoft14) for SNP and PA loci calling. To this end, we used the “reference-free” algorithm implemented in DArTsoft14. The sequence clusters were then parsed into SNP and in silico DArTseq^TM markers utilizing a range of metadata parameters derived from the quantity and distribution of each sequence across all samples in the analysis. Multiple libraries of the same individual were included in the DArTseq^TM genotyping process, enabling reproducibility scores to be calculated for each candidate marker. Outputs by DArTsoft14 were then filtered on the basis of reproducibility values, average count for each sequence (sequencing depth), balance of average counts for each SNP allele, and call-rate (proportion of samples for which the marker was scored).

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol