Overview #

Diffusion Models #

Forward Noising Steps

  1. Given an image go to random noise
  2. Progressively add more noise
    • Starting at the first timestamp T_0, progressively add more and more noise until you get to 100% noise

Reverse Denoising Steps

  1. Given an image, can we learn to estimate the image at T-1 (the previous timestamp)
    • This is the reverse denoising step
    • This is the reverse of the forward noising step
    • All the two steps differ by is the noise function
      • The noise applied is different at each timestamp determined by a variance schedule

Sampling New Instances #

Beyond Images #

Latent Diffusion Models #

The process is as follows:

  1. Encode input into a latent space representation
  2. Sample from the latent space representation
  3. Go through the diffusion process on this latent space representation to genearte a high-quality image

Since we are operating on the latent space rather than raw pixels, any input (e.g. text) can be encoded into the latent space for generation