Variational Autoencoders and Maximum Likelihood Estimation
In my previous blog, we explored maximum likelihood estimation (MLE) and how it can be used to derive commonly used loss functions. It also turns out that MLE is widely being used in generative models like Variational Autoencoders (VAE) and Diffusion models (DDPM). In this blog, we will explore how the loss function of Variational Autoencoders are derived. VAEs are latent variable generative models. They can solve a few tasks: Act as a generative model that mimics the data distribution that it was trained on. Approximate posterior inference of the latent variable $z$ given an observed variable $x$. In other words, it can be used to learn lower dimensional representations of the data it was trained on. Preliminary Information In this section, let us explore the tools necessary to derive a tractable form of the log-likelihood that we need to optimize. ...