Statistical analysis

Marion Régnier; Arnaud Polizzi; Sarra Smati; Céline Lukowicz; Anne Fougerat; Yannick Lippi; Edwin Fouché; Frédéric Lasserre; Claire Naylies; Colette Bétoulières; Valentin Barquissau; Etienne Mouisel; Justine Bertrand-Michel; Aurélie Batut; Talal Al Saati; Cécile Canlet; Marie Tremblay-Franco; Sandrine Ellero-Simatos; Dominique Langin; Catherine Postic; Walter Wahli; Nicolas Loiseau; Hervé Guillou; Alexandra Montagner

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Statistical analysis

MR Marion Régnier

AP Arnaud Polizzi

SS Sarra Smati

CL Céline Lukowicz

AF Anne Fougerat

YL Yannick Lippi

EF Edwin Fouché

FL Frédéric Lasserre

CN Claire Naylies

CB Colette Bétoulières

VB Valentin Barquissau

EM Etienne Mouisel

JB Justine Bertrand-Michel

AB Aurélie Batut

TS Talal Al Saati

CC Cécile Canlet

MT Marie Tremblay-Franco

SE Sandrine Ellero-Simatos

DL Dominique Langin

CP Catherine Postic

WW Walter Wahli

NL Nicolas Loiseau

HG Hervé Guillou

AM Alexandra Montagner

This method is extracted from research article: Sci Rep, Apr 2020

Hepatocyte-specific deletion of Pparα promotes NAFLD in the context of obesity

DOI: 10.1038/s41598-020-63579-3

Request a Protocol

Ask a question

Favorite

Biochemical, qPCR and phenotypic data were analysed using and graphpad software. Differential effects were assessed on log2 transformed data by performing ANOVA followed by Sidak post-hoc tests. p-values < 0.05 were considered significant.

Hierarchical clustering of lipid quantification data was performed using R (R Development Core Team, 2018) with the heatmap.2 function from the package, gplots. Data were log2 transformed, then centred and scaled by lipid. Hierarchical clustering was applied to the samples and the lipids using 1-Pearson correlation coefficient as distance and Ward’s criterion (Ward.D2) for agglomeration. All the data represented on the heat map had adjusted p-values <0.05 for one or more comparisons performed with an analysis of variance.

Microarray data were analyzed using R and Bioconductor packages (www.bioconductor.org, v 3.0), as described in GEO accession GSE123354. Raw data (median signal intensity) were filtered, log2 transformed, corrected for batch effects (microarray washing bath) and normalized using quantile method^⁶⁴.

A model was fitted using the limma lmFit function^⁶⁵ considering array weights using arrayWeights function. Pair-wise comparisons between biological conditions were applied using specific contrasts. A correction for multiple testing was applied using Benjamini-Hochberg procedure^⁶⁶ for False Discovery Rate (FDR). Probes with FDR ≤ 0.05 were considered to be differentially expressed between conditions.

In addition to the differential analysis, a multivariate exploratory analysis was performed. A Sparse Partial Least Squares Discriminant Analysis^⁶⁷ (sPLS-DA) was conducted using mixOmics package^⁶⁸ under to select the most discriminative variables (genes) that help classify the samples according to their experimental conditions among their expression values. Twenty iterations of 5-fold cross-validation was used to evaluate the model performance for the selection of the most informative components (6 components chosen) using all the variables (PLS-DA). Then a “sparse” PLS-DA model was parametrized selecting the first 100, 120, 80, 40, 100 and 20 (chosen according the performance results of 20 iterations of 5-fold cross-validations) most discriminant variables on the components 1 to 6 respectively.

Hierarchical clustering was applied to the samples and the differentially expressed probes using 1-Pearson correlation coefficient as distance and Ward’s criterion for agglomeration. The clustering results are illustrated as a heatmap of expression signals. Gene network and enrichment of KEGG pathways was either performed using the online software STRING V.11^⁶⁹ or Metascape^⁷⁰. Correlation graphic chart was generated using the chart correlation function from the Performance Analytics package.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol