Concept Intermediate · 3 min read

What is a denoising diffusion model

Q: What is a denoising diffusion model

A denoising diffusion model is a type of generative AI model that learns to create data by gradually removing noise from a noisy input. It works by reversing a diffusion process that adds noise step-by-step, enabling it to generate high-quality images, audio, or other data from random noise.

Quick answer

A denoising diffusion model is a type of generative AI model that learns to create data by gradually removing noise from a noisy input. It works by reversing a diffusion process that adds noise step-by-step, enabling it to generate high-quality images, audio, or other data from random noise.

Denoising diffusion model is a generative AI model that creates data by learning to reverse a noise-adding process, effectively denoising step-by-step to produce realistic outputs.

How it works

A denoising diffusion model starts with pure noise and learns to reverse a process that gradually adds noise to data. Imagine a photo slowly getting fuzzier and fuzzier until it’s just static noise. The model trains to undo this fuzziness step-by-step, recovering the original image. This is like learning to unscramble a scrambled puzzle by reversing each scrambling step.

Technically, the model learns the conditional probability of denoising at each step, using a neural network to predict the noise component and subtract it. Over many steps, it transforms random noise into a coherent sample that resembles the training data distribution.

Concrete example

Here is a simplified Python example using the diffusers library to generate an image with a pretrained denoising diffusion model:

python

import os
from diffusers import DDPMPipeline

# Load a pretrained denoising diffusion model pipeline
pipeline = DDPMPipeline.from_pretrained("google/ddpm-cifar10-32")

# Generate an image starting from noise
image = pipeline().images[0]

# Save or display the image
image.save("generated_image.png")
print("Image generated and saved as generated_image.png")

output

Image generated and saved as generated_image.png

When to use it

Use denoising diffusion models when you need high-quality generative outputs such as images, audio, or video synthesis. They excel at producing diverse, realistic samples and are robust to mode collapse compared to GANs. However, they require more computation and inference time due to iterative denoising steps.

Do not use them when you need ultra-fast generation or when simpler models suffice, as diffusion models trade speed for quality and diversity.

Key terms

Term	Definition
Diffusion process	A gradual noise-adding procedure that corrupts data step-by-step.
Denoising	The process of removing noise to recover original data.
Generative model	An AI model that can create new data samples resembling training data.
Neural network	A machine learning model used to predict noise and denoise data.
Iterative sampling	Generating data by repeatedly applying denoising steps starting from noise.

✅

Key Takeaways

Denoising diffusion models generate data by learning to reverse a noise corruption process step-by-step.
They produce high-quality, diverse outputs but require multiple inference steps, making them slower than some alternatives.
Use them for tasks like image and audio synthesis where quality and diversity are critical.
The core mechanism involves predicting and removing noise iteratively using a neural network.
Pretrained diffusion pipelines like those in the diffusers library enable easy experimentation with these models.

Verified 2026-04 · google/ddpm-cifar10-32

Verify ↗