Bioinformatic processing and statistical analysis of microbiome data

Simon Bahrndorff; Aritz Ruiz-González; Nadieh de Jonge; Jeppe Lund Nielsen; Henrik Skovgård; Cino Pertoldi

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Bioinformatic processing and statistical analysis of microbiome data

SB Simon Bahrndorff

AR Aritz Ruiz-González

NJ Nadieh de Jonge

JN Jeppe Lund Nielsen

HS Henrik Skovgård

CP Cino Pertoldi

This method is extracted from research article: BMC Genomics, Jan 2020

Integrated genome-wide investigations of the housefly, a global vector of diseases reveal unique dispersal patterns and bacterial communities across farms

DOI: 10.1186/s12864-020-6445-z

Request a Protocol

Ask a question

Favorite

The obtained sequence libraries were subjected to quality control using trimmomatic (v0.32) [53]. Reads were merged using FLASH (v1.2.7) [54]. Reads were formatted for use with the UPARSE workflow [55], prior to chimeric read removal, de-replication and clustering into Operational Taxonomic Units (OTUs) at 97% sequence similarity using USEARCH7. Taxonomy was assigned using RDP classifier [56] as implemented in QIIME [57], using Silva release 132 as the reference database [58].

The statistical analysis and visualization of microbial community data was performed in R version 3.5.1 [59] via RStudio version 1.1.463 (http://www.rstudio.com), using the R packages ampvis2, vegan and ggplot2 [60–62]. Beta diversity was calculated for microbiome comparison between housefly from different locations using Bray-Curtis dissimilarity [63], and visualized using non-metric multi-dimensional scaling (NMDS). The microbial community structure was visualized using heatmaps. Relationships between prevalence of potential pathogenic organisms based on literature and the sampled populations were explored using hierarchical clustering using Bray-Curtis distances.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol