Weighted correlation network analysis (WGCNA) for metabolome and transcriptome

Sofia M. Murga-Garrido; Qilin Hong; Tzu-Wen L. Cross; Evan R. Hutchison; Jessica Han; Sydney P. Thomas; Eugenio I. Vivas; John Denu; Danilo G. Ceschin; Zheng-Zheng Tang; Federico E. Rey

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Weighted correlation network analysis (WGCNA) for metabolome and transcriptome

SM Sofia M. Murga-Garrido

QH Qilin Hong

TC Tzu-Wen L. Cross

EH Evan R. Hutchison

JH Jessica Han

ST Sydney P. Thomas

EV Eugenio I. Vivas

JD John Denu

DC Danilo G. Ceschin

ZT Zheng-Zheng Tang

FR Federico E. Rey

This method is extracted from research article: Microbiome, May 2021

Gut microbiome variation modulates the effects of dietary fiber on host metabolism

DOI: 10.1186/s40168-021-01061-6

Request a Protocol

Ask a question

Favorite

In order to group the biochemicals that were highly correlated, we built the co-expression network using WGCNA [102]. The WGCNA is an efficient and robust method in grouping metabolomic and transcriptomic data [103, 104] and allowed us to summarize each module by its module eigenvalue. A one-sided Fisher test was used to determine if a pathway was enriched within the turquoise and blue modules in metabolomic data. P values were then adjusted using Benjamini-Hochberg method, and a cut-off of P < 0.05 and FDR adjusted-P < 0.05 were chosen to determine if a pathway was significantly enriched. We used Pearson’s correlation between expression profile of each gene and module eigenvalue to identify module membership. Using the module eigenvalue, the module-traits relationships were estimated by calculating Pearson’s correlations between the module eigenvalue and the traits of interest. We considered 0.90 as a correlation cut-off to choose soft-thresholding power and set the minimal module size as 20. For metabolome, the metabolites were clustered into 8 modules plus 43 unclustered metabolites. The transformed values of the unclustered metabolites were combined with standardized module eigenvalues in the following analysis. For the transcriptome data, 14 modules (defined as clusters of highly interconnected genes) were identified by using DynamicTree Cut algorithm. WGCNA led to 14 different modules by using DynamicTree Cut algorithm. Over-representation of genes in the blue module was characterized based on gene ontology biological process and KEGG pathways using clusterProfiler [105].

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol