Data processing and OTU clustering for ITS amplicons

RS Rafael Soares Correa de Souza
VO Vagner Katsumi Okura
JA Jaderson Silveira Leite Armanhi
BJ Beatriz Jorrín
NL Núria Lozano
MS Márcio José da Silva
MG Manuel González-Guerrero
LA Laura Migliorini de Araújo
NV Natália Cristina Verza
HB Homayoun Chaichian Bagheri
JI Juan Imperial
PA Paulo Arruda
request Request a Protocol
ask Ask a question
Favorite

The ITS targeted ribosomal gene region is longer46 than the 16S V4 sequences. An initial analysis indicated that many paired-end reads from ITS libraries did not overlap. This result is a indication that read length might not be enough to cover the entire ITS region of some microbial groups. In order to avoid bias, we decided to use only the first read of the paired-end sequences to perform the ITS amplicon analysis. Single reads were tail trimmed to 175 bp and filtered for a 0.5 expected error threshold using the “fastq_filter” from the USEARCH command. Sugarcane ITS sequences were trimmed using DUK software with parameters “-k 20 -c 2” and a reference sequence obtained from an in-house sugarcane reference assemblage. Reads with no recognizable primer sequences were discarded (primer filter) and the remaining reads were tail primed to the primer sequence (tail trimming). The proceeding steps of dereplicating, discarding singletons and clustering was done using the UPARSE pipeline.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

post Post a Question
0 Q&A