Calculation of compositional means, metric variances, and Aitchison distances

Dieter M. Tourlousse; Koji Narita; Takamasa Miura; Mitsuo Sakamoto; Akiko Ohashi; Keita Shiina; Masami Matsuda; Daisuke Miura; Mamiko Shimamura; Yoshifumi Ohyama; Atsushi Yamazoe; Yoshihito Uchino; Keishi Kameyama; Shingo Arioka; Jiro Kataoka; Takayoshi Hisada; Kazuyuki Fujii; Shunsuke Takahashi; Miho Kuroiwa; Masatomo Rokushima; Mitsue Nishiyama; Yoshiki Tanaka; Takuya Fuchikami; Hitomi Aoki; Satoshi Kira; Ryo Koyanagi; Takeshi Naito; Morie Nishiwaki; Hirotaka Kumagai; Mikiko Konda; Ken Kasahara; Moriya Ohkuma; Hiroko Kawasaki; Yuji Sekiguchi; Jun Terauchi

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Calculation of compositional means, metric variances, and Aitchison distances

DT Dieter M. Tourlousse

KN Koji Narita

TM Takamasa Miura

MS Mitsuo Sakamoto

AO Akiko Ohashi

KS Keita Shiina

MM Masami Matsuda

DM Daisuke Miura

MS Mamiko Shimamura

YO Yoshifumi Ohyama

AY Atsushi Yamazoe

YU Yoshihito Uchino

KK Keishi Kameyama

SA Shingo Arioka

JK Jiro Kataoka

TH Takayoshi Hisada

KF Kazuyuki Fujii

ST Shunsuke Takahashi

MK Miho Kuroiwa

MR Masatomo Rokushima

MN Mitsue Nishiyama

YT Yoshiki Tanaka

TF Takuya Fuchikami

HA Hitomi Aoki

SK Satoshi Kira

RK Ryo Koyanagi

TN Takeshi Naito

MN Morie Nishiwaki

HK Hirotaka Kumagai

MK Mikiko Konda

KK Ken Kasahara

MO Moriya Ohkuma

HK Hiroko Kawasaki

YS Yuji Sekiguchi

JT Jun Terauchi

This method is extracted from research article: Microbiome, Apr 2021

Validation and standardization of DNA extraction and library construction methods for metagenomics-based human fecal microbiome measurements

DOI: 10.1186/s40168-021-01048-3

Request a Protocol

Ask a question

Favorite

Microbiome community compositions are given by a vector x = [ x₁, …, x_D ] of D strictly non-negative elements representing the abundances of each part (species, genes, …) in the community, subject to a total sum constraint.

Following standard concepts and definitions [47, 48], the central tendency (center or compositional mean) of a compositional data set X = [ x₁, …, x_n ], where x_j = [ x_1,j, …, x_D,j ] represents one of n individual compositions, was calculated as the closed geometric mean:

where g_i is the geometric mean of the abundance of part i across the n compositions and clo represents the closure operation:

where κ is the closure constant, usually set to 1 or 100%.

Dispersion of a compositional data set X is known as the metric (or total) variance, denoted as mvar(X), and can be calculated based on the variation matrix, denoted as varmat(X), of all possible logratio variances:

For calculation, we used the functions variation and mvar in the R package compositions v2.0 [49] to obtain variation matrices and metric variances, respectively. Based on the variation matrix, we also calculated the contribution of each logratio variance to the metric variance.

The distance between two compositions x = [ x₁, …, x_D ] and y = [ y₁, …, y_D ] is known as the Aitchison distance (d_A), calculated as:

This is equivalent to the Euclidean distance after centered log ratio (clr) transformation:

where g(x) is the geometric mean of the abundances across parts of x. Accordingly, compositional principal component analysis (PCA) was performed using R’s stats prcomp function, after clr transformation of the abundance data.

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol