2.2. Variational Auto-Encoder

Jun Lin; Hongwei Ma

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

2.2. Variational Auto-Encoder

JL Jun Lin

HM Hongwei Ma

This method is extracted from research article: Sensors (Basel), Apr 2024

Structural Damage Detection Based on the Correlation of Variational Autoencoder Neural Networks Using Limited Sensors

DOI: 10.3390/s24082616

Ask a question

Favorite

Variational Autoencoder (VAE) addresses the limitations of autoencoder neural networks, which cannot generate data on their own and struggle to determine the accurate distribution of feature information in the hidden layers. By introducing a hidden variable Z in the hidden layer and controlling its distribution, VAE aims to make the output controllable [33]. The study explores various statistical feature extraction algorithms and ultimately focuses on the Deep Bayesian Network for analysis. This network can represent relationships between variables using neural networks and effectively analyze complex structured data to identify feature information accurately.

VAE is a probabilistic model based on variational inference, aiming to establish a generative model rather than just an image network. By extracting feature information through an approximate model function and reducing errors, VAE enhances computational efficiency. The text emphasizes the importance of imposing constraints on the network to ensure convergence during training and prevent potential variables from affecting final prediction results. By adding specific conditional restrictions, VAE overcomes the drawbacks of autoencoder neural networks and reflects the relationship between hidden variable Z and visible variable X. Overall, VAE generates effective and reasonable feature information within the network, with the visible variable X being generated by the hidden variable Z. As shown in Figure 2, with z following a Gaussian distribution N(0,1). Sampling z from $p (z)$ , data is auto-generated through $p_{θ} (x | z)$ . Thus, the observable variable x is generated by the latent variable z, and $z \to x$ represents the generative model $p_{θ} (x | z)$ , which, from the perspective of an autoencoder, acts as the decoder. $p_{θ} (x | z)$ can be implemented through neural networks. $x \to z$ is the recognition model $q_{φ} (z | x)$ , similar to an encoder in an autoencoder. The overall structure of a VAE, as shown in Figure 3, differs from a standard autoencoder in that VAE imposes additional constraints on the hidden layer, making it controllable.

Internal operation of VAE.

The structure of VAE.

After obtaining p and q, in order to achieve a good result, q needs to be as close to p as possible. The key is to measure the gap between q and p. Variational Autoencoder (VAE) uses KL divergence to measure the difference between q and p. A smaller KL value indicates a closer distance between them. In VAE, estimation of the parameters of the generative model is required, thus assuming an unknown distribution that satisfy the following relationship:

The above Formula (1) is called the relative entropy, Kullback-Leibler divergence, or KL divergence between $p (x)$ and $q (x)$ . Since the above formulas are not symmetric in structure, $D_{K L} (p ‖q) \neq D_{K L} (q ‖p)$ .

By deriving the formula from the previous section, it can be seen that the Variational Autoencoder (VAE) needs to reduce the gap between p and q. In practical applications, for the latent variable z in the hidden layer, $p_{θ} (x | z)$ will be 0, $p_{θ} (x)$ represents the distribution that needs to be satisfied when generating the data set x, so the calculation of $p_{θ} (x | z)$ with $p_{θ} (x)$ does not affect it. The core idea of VAE is to sample z and calculate $p_{θ} (x)$ from the sampled z, $E_{q_{ϕ} (z | x)} p_{θ} (x | z)$ should be as closely related to $p_{θ} (x)$ as possible. The Kullback-Leibler divergence between $p_{θ} (z | x)$ and $q_{ϕ} (z | x)$ is:

Applying Bayes’ rule to $p_{θ} (z | x)$ , substituting $p_{θ} (x | z)$ and $p_{θ} (x)$ into Formula (2) allows us to transform the KL divergence into:

Formula (3) is the foundation of VAE. In order to make q as close to p as possible, the KL divergence should be minimized. Therefore, the left side of the equation should be maximized. When $q_{ϕ} (z | x)$ is given, a stochastic gradient descent algorithm is used to optimize the right side. Therefore, instead of relying on z, X is predicted through training $q_{ϕ} (z | x)$ . VAE can compress high-dimensional data into low-dimensional z, and then the generative network will produce a distribution that is as similar as possible to the original data.

Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol