Multivariate analysis

Ilona Dudka; Kristina Lundquist; Pernilla Wikström; Anders Bergh; Gerhard Gröbner

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Multivariate analysis

ID Ilona Dudka

KL Kristina Lundquist

PW Pernilla Wikström

AB Anders Bergh

GG Gerhard Gröbner

This method is extracted from research article: J Transl Med, Nov 2023

Metabolomic profiles of intact tissues reflect clinically relevant prostate cancer subtypes

DOI: 10.1186/s12967-023-04747-7

Request a Protocol

Ask a question

Favorite

The NMR derived spectral dataset was further analyzed by multivariate analysis methods as provided by SIMCA V17 (Umetrics, Umeå, Sweden). Since metabolomic data, especially NMR spectral data, are characterized by a high degree of collinearity, we applied multivariate analysis methods, the principal component analysis (PCA) and the orthogonal partial least squares discriminant analysis (OPLS-DA). Those methods take into account correlations between metabolites and have been widely used to identify biomarkers in metabolomics studies [29, 30]. PCA was used to generate a first overview of information contained in the data, since it reduces the dimensionality of such datasets to increase interpretability and to minimize information loss. Thus, the original data can be described in a lower-dimensional space, defined by the principal components, which are ordered according to their ability to capture the total variance of the data. The score values represent the coordinates of the samples in the lower-dimensional space defined by the principal components. The principal components are displayed in a two-dimensional score plot, allowing visualization of the distribution and grouping of the samples in the new variable space [29]. Accordingly, by inspecting the score plot the homogeneity of the samples can be evaluated and any possible trends and outliers between the samples become visible. Thereafter, a supervised multivariate analysis OPLS-DA was performed to identify the discriminatory features for each comparison of the different assigned groupings. Significant metabolites were selected based on the p(corr) > 0.5 from the OPLS-DA models, where p(corr) is defined as the loadings rescaled as a correlation coefficient between the original data and the scores, thereby standardizing the range from − 1.0 to 1.0. There is no consensus on what p(corr) cutoff represents significance, but an absolute p(corr) > 0.4–0.5 is commonly used [31–33]. The quality of the OPLS-DA models was evaluated by using the default sevenfold crossvalidation in SIMCA and the built-in permutation plot (in short: permuting the y-variable 200 times and subsequently correlating these results with that of the original models). Analysis of variance of cross-validated predictive residuals (CV-ANOVA) was used to assess the significance of the OPLS-DA models, where a p-value lower than 0.05 is associated with a significant model.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol