Concept Beginner · 3 min read

What is autograd in PyTorch

Q: What is autograd in PyTorch

Autograd in PyTorch is an automatic differentiation system that records operations on tensors to compute gradients for backpropagation. It enables efficient gradient calculation needed for training neural networks without manual derivative coding.

Quick answer

Autograd in PyTorch is an automatic differentiation system that records operations on tensors to compute gradients for backpropagation. It enables efficient gradient calculation needed for training neural networks without manual derivative coding.

Autograd is PyTorch's automatic differentiation engine that records tensor operations to compute gradients for model optimization.

How it works

Autograd tracks all operations on tensors that have requires_grad=True by building a dynamic computational graph. Each tensor operation creates nodes in this graph representing functions and their inputs. When you call backward() on a tensor, autograd traverses this graph in reverse, applying the chain rule to compute gradients automatically. This is like keeping a detailed recipe of all steps taken to produce a result, then retracing those steps backward to find how each ingredient (input) affects the final dish (output).

Concrete example

python

import torch

# Create tensors with requires_grad=True to track computation
x = torch.tensor(2.0, requires_grad=True)
y = torch.tensor(3.0, requires_grad=True)

# Define a function of x and y
z = x * y + y ** 2

# Compute gradients
z.backward()

print(f"dz/dx: {x.grad}")  # Should print gradient of z w.r.t x
print(f"dz/dy: {y.grad}")  # Should print gradient of z w.r.t y

output

dz/dx: 3.0
dz/dy: 7.0

When to use it

Use autograd when training neural networks or any model requiring gradient-based optimization, such as backpropagation in deep learning. It automates derivative calculations, saving time and reducing errors. Avoid manual gradient computation unless for educational purposes or custom gradient functions where autograd cannot infer derivatives.

Key terms

Term	Definition
Autograd	PyTorch's automatic differentiation engine for computing gradients.
Tensor	Multi-dimensional array with optional gradient tracking.
Computational graph	Dynamic graph recording tensor operations for backpropagation.
Backward()	Method to compute gradients by traversing the computational graph in reverse.
requires_grad	Tensor attribute that enables gradient tracking.

✅

Key Takeaways

Autograd automates gradient computation by dynamically building a computational graph during tensor operations.
Set requires_grad=True on tensors to track operations for gradient calculation.
Call backward() on the output tensor to compute gradients for all inputs involved.
Autograd is essential for training neural networks with gradient descent optimization.

Verified 2026-04

Verify ↗