https://jalammar.github.io/illustrated-stable-diffusion/

https://www.youtube.com/watch?v=NhdzGfB1q74&ab_channel=rupertai

https://scale.com/guides/diffusion-models-guide#getting-started-with-diffusion-models

The goal of any diffusion model is to create an image from random gaussian noise based on its training and the input parameters

Going from the Latent space to the Pixel space-It goes from numeric representations of inputs and training data(vectors) and then uses an encoder-decoder model to generate the corresponding output in pixels, this process is repeated until a final image is generated

U-Net Architecture

Understanding Diffusion Models and U-Net Architecture

Notes from YouTube Video: "Understanding Diffusion Models" by rupertai

  1. Introduction to Diffusion Models:
  2. Forward Diffusion Process:
  3. Reverse Diffusion Process:
  4. Training the Model:
  5. Applications:

U-Net Architecture

The U-Net architecture is a type of convolutional neural network (CNN) originally designed for biomedical image segmentation. It has since been widely adopted in various fields, including diffusion models, due to its powerful ability to capture spatial information and perform detailed image generation tasks.

Key Features of U-Net:

  1. Encoder-Decoder Structure:
  2. Skip Connections: