For each metagenomic dataset, both quality-controlled (QC) and assembled (contigs) reads were downloaded from IMG/M ER in August 2017. To obtain coverage profile of contigs from each metagenomic assembly, the QC reads were mapped to contigs using BWA v0.7.15 with the bwa-mem algorithm (Li & Durbin, 2009). This generated SAM files that were subsequently converted into BAM files using SAMtools v1.3 (Li et al., 2009). Combining coverage profile and tetranucleotide frequency information, genomes were reconstructed from each metagenome with MetaBAT version 2.10.2 with default parameters (Kang et al., 2019). Quality of the reconstructed genomes was estimated with CheckM v1.0.8 (Parks et al., 2015). Only MAGs that were at least 50% complete and with less than 10% contamination, were retained for subsequent analyses. These MAGs have been deposited at the Sequence Read Archive (SRA) under the BioProject accession number PRJNA560329; genomes are available with accession numbers SRR9983610–SRR9984198 (Table S2). Additionally, the MAGs have been deposited at the NCBI’s Assembly Database under the accessions WQRH00000000–WRNX00000000 (Table S2).
For each MAG, CheckM was also used to extract 16S rRNA gene sequences as well as a set of 43 phylogenetically informative marker genes consisting primarily of 29 ribosomal proteins (PF00466, PF03946, PF00298, PF00572, PF00238, PF00252, PF00861, PF00687, PF00237, PF00276, PF00831, PF00297, PF00573, PF00281, PF00673, PF00411, PF00164, PF00312, PF00366, PF00203, PF00318, PF00189, PF03719, PF00333, PF00177, PF00410, PF00380, PF03947, PF00181), nine RNA polymerase domains (PF04563, PF04997, PF00623, PF05000, PF04561, PF04565, PF00562, PF04560, PF01192), two tRNA ligases (TIGR00344 and TIGR00422), a signal peptide binding domain (PF02978), a translation-initiation factor 2 (PF11987) and a TruB family pseudouridylate synthase (PF01509). Finally, CheckM was also used for a preliminary taxonomic classification of the MAGs by phylogenetic placement of the MAGs into the CheckM reference genome tree.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.