Skip to main content

Diffusion Models




What is the Diffusion model?

Diffusion models lie in the domain of Computer Vision. Diffusion is the model of deep learning that deals with latent or hidden variables in an image by adding or removing noise.

The diffusion model is a computational framework used in psychology and neuroscience to describe decision-making processes and response times in tasks involving perceptual discrimination or categorization. It proposes that decision-making involves accumulating evidence over time from noisy sensory inputs until a threshold is reached, at which point a decision is made.

What is noise?

Unwanted information that disrupts the clarity and accuracy of the picture.

the context of the diffusion model, "noise" refers to random variability or fluctuations in the accumulation of evidence over time. These fluctuations can arise from various sources, including variability in sensory inputs, neural noise, and random fluctuations in cognitive processing.





Stable Diffusion:

Stable diffusion is a type of Diffusion model used to generate text to image.

How Diffusion Models are made?

Diffusion models are made to make the image noisy, then, by removing that noise and getting back the original image.

In simpler terms, stable diffusion implies that the decision-making process has settled into a consistent pattern, and further accumulation of evidence does not significantly alter the outcome.

Why are Latent Diffusion models made?

In the beginning, Stable diffusion was used but, DM is very expensive and more computational. Therefore, Latent Diffusion Models are made. LDM converts the picture into latent(hidden)  representation (vector representation) due to which its computations decrease, and then apply diffusion model concept on it. LDM is 48 times faster than Diffusion( Pixel by Pixel).

How many types of Diffusion process?

Two types of Diffusion process are there;

  1. Forward Diffusion Process: In this process, noise is added into the image continuously.

  2. Reverse/ Backward Diffusion Process: In this process, noise is removed from the image until it gets the original image.  

How to add a noise in an image

There are two ways of adding noise into the picture.

  • By unit time:

When you gradually add noise to an image over time, it means you're slowly making the image more distorted. Instead of adding all the noise at once, you do it little by little. This helps you control how much the image changes. By adjusting how quickly you add the noise and how much you add each time, you can decide how blurry or messy you want the image to look. This method is useful when you want to carefully change an image without making it too messy."
  • By CoSin:

This method uses a special math trick, often using something called cosine transformations, to add fuzziness to a picture. By using this math trick, we can add fuzziness in a way that we can control, making the picture look different in the way we want it to.

Some popular AI Image Generator Tools:

Here are some popular AI image generator tools:

  1. DeepArt.io:

  2. DeepArt uses neural networks to apply artistic styles to images, allowing users to create unique artworks by combining different styles with their own photos.


  3. RunwayML:

  4. RunwayML offers a range of AI-powered tools for creatives, including image generation, style transfer, and image manipulation. It provides an intuitive interface for experimenting with various AI models.


  5. DeepDream Generator:

  6. DeepDream Generator uses Google's DeepDream algorithm to generate psychedelic and surreal images. Users can upload their own images and customize parameters to create unique visual effects.

    1. DALL-E:

    2. Developed by OpenAI, DALL-E is an AI model capable of generating images from textual descriptions. It can create novel and realistic images based on specific prompts provided by users.


    3. Artbreeder:

    4. Artbreeder allows users to blend and morph images using AI algorithms, creating new and unique artworks. It offers a range of features for exploring and manipulating visual content.


    5. GANPaint Studio:

    6. GANPaint Studio uses generative adversarial networks (GANs) to edit and manipulate images in various ways. Users can add or remove objects, change colors, and perform other modifications using AI-powered tools.


    7. DeepFaceLab:

    8. DeepFaceLab is a powerful tool for face swapping and facial reenactment using deep learning techniques. It enables users to create realistic face swaps in videos and images with advanced customization options.

  7. Here are some generic prompt with generated AI images examples. Use these prompts and change words or phrases to your desired output from AI tools. 
Prompt:
Image of 3D cartoonish backside of a beautiful girl with long and brown hair in pink shiny gown under the tree alone;
Negative prompt:
Blurriness and fade colors:





Prompt:
Draw an image of black and white scenery with waterfall.
Negative Prompt:
Nothing










Comments

Popular posts from this blog

Fundamentals of Artificial Intelligence

Fundamentals of Artificial Intelligence   What is Artificial Intelligence ? Artificial Intelligence is a branch of computer science that deals with the development of methods that allow computers to learn without explicit programming. Artificial Intelligence is a way of making a digital brain by teaching computers different methods without human interference. In Artificial Intelligence computers learn themselves on the basis of prior experience and learning. Classical AI and Today's AI:   In previous times, AI was used manually or with human intervention. A person used to find a problem and solution by himself, then train the computer after setting rules and solutions related to that particular problem. Today's AI reduced the human effort and hard work. Nowadays, a person has to ask the problem only, AI gives all possible solutions related to that problem. Goal of AI: The goal of AI is to perform tasks as humans can do. For example: Making decisions and judge etc. Value Creati...

Machine Learning Vs Deep Learning

Basics of Machine Learning, Deep learning or Neural Networks:   Machine learning: Machine Learning is a great part of the Artificial intelligence field and is a popular way to execute AI projects through different methods. Normally in ML, there is no need to instruct computers, data is given to the system and according to prior experience and data, rules are made to apply further.  In the Machine Learning domain, models are trained on learning how to create rules for themselves to perform lookalike tasks by taking specific inputs by their desired possible outputs. ML typically requires processed or organized data with the involvement of humans. What is Deep Learning Study of neural networks is called deep learning.  DL is the study of doing ML with neural networks. In 2012, Neural Networks made up of basic and fundamental mathematical functions like addition, subtraction, multiplication and division etc. these networks  are like neurons in the human brain to make ...