2.7. Sequencing Data Quality Control

KN Karen Leth Nielsen
MO Markus Harboe Olsen
AP Albert Pallejá
SE Søren Røddik Ebdrup
NS Nikolaj Sørensen
OL Oksana Lukjancenko
RM Rasmus L. Marvig
KM Kirsten Møller
NF Niels Frimodt-Møller
FH Frederik Boëtius Hertz
request Request a Protocol
ask Ask a question
Favorite

Quality control of the raw FASTQ files was performed using KneadData v. 0.6.1. Human reads were removed with Trimmomatic v. 0.36; the reads were quality trimmed by removing the Nextera adapter sequences, leading and trailing bases with a Phred score below 20, and trailing bases in which the Phred score over a window of size 4 drops below 20. Trimmed reads shorter than 100 bases were discarded. Reads that mapped to the human reference genome GRCh38 (Bowtie2 v. 0.2.3.2 using default settings) were discarded. Only read pairs in which both reads passed filtering were retained; these were classified as high-quality non-host (HQNH) reads.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A