Poincaré ball and Lorentz model of the hyperbolic space

Jiarui Ding; Aviv Regev

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Poincaré ball and Lorentz model of the hyperbolic space

JD Jiarui Ding

AR Aviv Regev

This method is extracted from research article: Nat Commun, May 2021

Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces

DOI: 10.1038/s41467-021-22851-4

Request a Protocol

Ask a question

Favorite

The Poincaré ball model represents the hyperbolic space as the interior of a unit ball in the Euclidean space: $P = z \in R^{M + 1} ∣ ∥ z ∥ < 1, z_{0} = 0, M \in Z^{+}$ , where $z = {(z_{0}, \dots, z_{M})}^{T}$ . The distance between two points $z_{1}, z_{2} \in P$ is defined as:

where $\cosh^{- 1} (z) = \ln (z + \sqrt{z^{2} - 1})$ is the inverse hyperbolic cosine function, which is monotonically increasing for $z \geq 1$ . The symbol $∥ \cdot ∥$ represents the Euclidean norm. Notice that $\cosh^{- 1} (1 + z) = \ln (1 + z + \sqrt{z^{2} + 2 z})$ , which approximates $\sqrt{2 z}$ when $\lim z \to 0$ and $\ln (2 z)$ for $\lim z \to + \infty$ . When both $z_{1}$ and $z_{2}$ are close to the origin with zero norm, $d (z_{1}, z_{2}) \approx \cosh^{- 1} (1 + 2 ∥ z_{1} - z_{2} ∥^{2}) \approx 2 ∥ z_{1} - z_{2} ∥$ . Therefore, the Poincaré ball model resembles Euclidean geometry near the center of the unit hyperball. The induced norm of a point $z \in P$ is

As $z$ moves aways from the origin and approaches the border with $∥ z ∥ \approx 1$ , the induced norm $∥ z ∥_{P}$ grows exponentially. Hyperbolic geometry is useful to represent data with an underlying approximate hierarchical structure.

The Lorentz model is a model of the hyperbolic space and points of this model satisfy $H^{M} = {z \in R^{M + 1} ∣ z_{0} > 0, ⟨ z, {z ⟩}_{H} = - 1}$ , where $⟨ z, z {' ⟩}_{H} = - z_{0} z_{0}' + \sum_{i = 1}^{M} z_{i} z_{i}'$ is the Lorentzian inner product (or Minkowski inner product when $z \in R^{4}$ ). The special one-hot vector $𝛍_{0} = {(1, 0, \dots, 0)}^{T}$ is the origin of the hyperbolic space. The distance between two points of the Lorentz model is defined as:

The tangent space of $H^{M}$ at point $𝛍 \in H^{M}$ is defined as $T_{𝛍} H^{M} : = {z ∣ ⟨ 𝛍, {z ⟩}_{H} = 0}$ , i.e., all the vectors that pass point $𝛍$ and are orthogonal to vector $𝛍$ based on the Lorentzian inner product. A point ${(z_{0}, z_{1}, \dots, z_{M})}^{T}$ in the Lorentz model can be conveniently mapped to the Poincaré ball^²¹ for visualization:

We discard the first element as it is a constant of zero.

We used wrapped normal priors and wrapped normal posteriors defined in the Lorentz model to embed cells to a hyperbolic space^{²⁵,³⁴,⁷⁹}. A wrapped normal distribution in $H^{M}$ is constructed by first defining a normal distribution on the tangent space $T_{𝛍_{0}} H^{M}$ (a Euclidean subspace in $R^{M + 1}$ ) at the origin $𝛍_{0} = {(1, 0, \dots, 0)}^{T}$ of the hyperbolic space. Samples from a normal distribution on the tangent space are parallel-transported to desired locations and further projected onto the final hyperbolic space^²⁵.

We used a set of invertible functions to transform samples from a normal distribution $N (z ∣ 0, I_{M} σ)$ in $R^{M}$ to samples from a wrapped normal distribution in $H^{M}$ with mean of $𝛍$ , where $σ \in R^{M}$ is the standard deviation of components $z_{1}$ to $z_{M}$ , respectively, and $I_{M}$ is the identity matrix in $R^{M}$ ^²⁵,⁵⁵. First, let $z_{0} = {(0, z_{0}')}^{T}$ , which can be considered as a sample vector from $T_{𝛍_{0}} H^{M}$ , where $z_{0}'$ is sampled from $N (z ∣ 0, I_{M} σ)$ . Next, $z_{0}$ is parallel-transported to vector $z_{1}$ in the tangent space $T_{𝛍} H^{M}$ at $𝛍$ , in a parallel manner (i.e., $z_{1}$ and $z_{0}$ pointing in the same direction relative to the geodesic between $𝛍_{0}$ and $𝛍$ ) and vector norm preserving (i.e., $⟨ z_{0}, {z_{0} ⟩}_{H} = ⟨ z_{1}, {z_{1} ⟩}_{H}$ )^²⁵,⁸⁰:

with $α = - ⟨ 𝛍_{0}, 𝛍 ⟩_{H}$ .

Finally, the exponential map^{²⁴,²⁵,⁷⁹} projects vector $z_{1}$ in the tangent space $T_{𝛍} H^{M}$ back to the hyperbolic space by:

such that the vector norm is preserved: $∥ z_{1} ∥_{H} = \sqrt{⟨ z_{1}, z_{1} ⟩_{H}} = d_{H} (𝛍, z)$ .

The likelihood after the invertible transformations can be calculated by

The encoder outputs a vector $h$ in the tangent space at the origin ( $T_{𝛍_{0}} H^{M}$ , so $∥ h ∥_{H} = ∥ h ∥_{2}$ ) and can be mapped to $H^{M}$ using the exponential map (the first zero element of $h$ is omitted) to get $𝛍$ :

Given a sample $z$ from the wrapped normal distribution, we need to evaluate its density $\log p (z)$ for calculating the $K L$ -divergence term of the ELBO. We can use the inverse exponential map and the inverse parallel transport to compute the corresponding $z_{1}$ and $z_{0}$ , respectively, for evaluating the density:

where $β = - ⟨ 𝛍, z ⟩_{H}$ and $α = - ⟨ 𝛍_{0}, 𝛍 ⟩_{H}$ . We now have all the ingredients to compute Eq. (¹) for each training point.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol