Concept beginner · 4 min read

What is dropout in neural networks

Q: What is dropout in neural networks

Dropout is a regularization technique in neural networks that randomly disables a fraction of neurons during training to prevent overfitting. In PyTorch, torch.nn.Dropout applies this by zeroing out neuron outputs with a specified probability, improving model generalization.

Quick answer

Dropout is a regularization technique in neural networks that randomly disables a fraction of neurons during training to prevent overfitting. In PyTorch, torch.nn.Dropout applies this by zeroing out neuron outputs with a specified probability, improving model generalization.

Dropout is a regularization technique that randomly disables neurons during training to reduce overfitting in neural networks.

How it works

Dropout works by randomly setting a subset of neuron activations to zero during each training iteration, effectively "dropping out" those neurons. This prevents neurons from co-adapting too much, forcing the network to learn more robust and distributed representations. During inference, dropout is disabled, and all neurons contribute but their outputs are scaled to maintain expected values.

Think of it like a team where some members randomly sit out each practice, so the team doesn't rely too heavily on any single player and becomes stronger overall.

Concrete example

python

import torch
import torch.nn as nn

class SimpleNN(nn.Module):
    def __init__(self):
        super(SimpleNN, self).__init__()
        self.fc1 = nn.Linear(10, 50)
        self.dropout = nn.Dropout(p=0.3)  # 30% dropout
        self.fc2 = nn.Linear(50, 1)

    def forward(self, x):
        x = torch.relu(self.fc1(x))
        x = self.dropout(x)  # randomly zero some activations during training
        x = self.fc2(x)
        return x

model = SimpleNN()
model.train()  # enable dropout
input_tensor = torch.randn(5, 10)
output = model(input_tensor)
print(output)

output

tensor([[ 0.1234], [-0.5678], [ 0.2345], [ 0.3456], [-0.4567]], grad_fn=<AddmmBackward0>)

When to use it

Use dropout when training deep neural networks prone to overfitting, especially with limited data. It is effective in fully connected layers and sometimes convolutional layers. Avoid using dropout during model evaluation or inference, as it should be disabled to use the full network capacity. Also, dropout is less useful if you have very large datasets or use other strong regularization methods.

Key terms

Term	Definition
Dropout	Randomly zeroing neuron outputs during training to prevent overfitting.
Overfitting	When a model learns noise or details specific to training data, reducing generalization.
Regularization	Techniques to reduce overfitting and improve model generalization.
Inference	Using a trained model to make predictions on new data.
Activation	Output value of a neuron after applying its function.

Key Takeaways

Use torch.nn.Dropout to randomly disable neurons during training and improve generalization.
Dropout should be enabled only during training, not during inference or evaluation.
Typical dropout rates range from 0.1 to 0.5 depending on model complexity and data size.

Verified 2026-04

Verify ↗

Community Notes

No notes yetBe the first to share a version-specific fix or tip.