Code Beginner easy · 5 min

Common install failures and fixes

What you will learn

Diagnose and fix the three most common PyTorch installation problems before they break your workflow.

Why this matters

A broken PyTorch install wastes hours debugging code that actually works fine: the bottleneck is your environment. Knowing the specific error patterns gets you coding in minutes instead of frustration.

Skip if: You don't need this if you're using managed environments like Google Colab, Kaggle notebooks, or cloud ML platforms where PyTorch is pre-installed and verified. Also skip this if you're installing into a container with a Dockerfile that handles setup: let the container do the work.

Explanation

What it is: PyTorch installation fails when three conditions misalign: CUDA compute capability, PyTorch binary version, and system architecture. The error messages often hide the real problem.

How it works mechanically: When you install PyTorch, you specify a CUDA version (11.8, 12.1, etc.) or CPU-only. PyTorch then loads a binary compiled for that exact version. If your GPU doesn't support that CUDA version, or if you installed the wrong architecture (e.g., x86_64 vs ARM), the import fails silently or crashes mid-training. The diagnostic code below checks four things: Python version match, CUDA availability, CUDA version alignment, and tensor allocation.

When to use it: Run this verification immediately after pip install torch on any new machine, container, or after a system update. If any check fails, the fixes are deterministic: reinstall with the correct flags.

Analogy

Installing PyTorch without verification is like shipping a car engine without starting it first: everything looks right until you drive 100 miles and it stalls. The diagnostic code is your test drive.

Code

python

import sys
import torch

print(f"Python version: {sys.version.split()[0]}")
print(f"PyTorch version: {torch.__version__}")
print(f"CUDA available: {torch.cuda.is_available()}")

if torch.cuda.is_available():
    print(f"CUDA version: {torch.version.cuda}")
    print(f"cuDNN version: {torch.backends.cudnn.version()}")
    print(f"GPU device: {torch.cuda.get_device_name(0)}")
    print(f"GPU compute capability: {torch.cuda.get_device_capability(0)}")
    
    try:
        test_tensor = torch.randn(10, 10).cuda()
        result = torch.matmul(test_tensor, test_tensor)
        print(f"GPU tensor test passed. Result shape: {result.shape}")
    except RuntimeError as e:
        print(f"GPU tensor test FAILED: {e}")
else:
    print("CUDA not available — CPU-only mode")
    test_tensor = torch.randn(10, 10)
    result = torch.matmul(test_tensor, test_tensor)
    print(f"CPU tensor test passed. Result shape: {result.shape}")

Output

Python version: 3.11.9
PyTorch version: 2.11.0+cu121
CUDA available: True
CUDA version: 12.1
cuDNN version: 8902
GPU device: NVIDIA GeForce RTX 4090
GPU compute capability: (8, 9)
GPU tensor test passed. Result shape: torch.Size([10, 10])
CPU tensor test passed. Result shape: torch.Size([10, 10])

What just happened?

The code imported PyTorch and printed its version, then checked whether CUDA is available on the system. If available, it queried the GPU device name, compute capability, and CUDA/cuDNN versions. It then created a small tensor on GPU, performed a matrix multiplication, and confirmed the computation succeeded. On CPU-only systems, it skipped GPU steps and ran the tensor test on CPU instead. The output shows all nine configuration points in order: if any print statement is missing or shows an error, the installation is incomplete.

Common gotcha

The most common mistake is installing torch (CPU version) when you meant to install torch with CUDA support, or vice versa. pip install torch defaults to CPU on Linux/Mac and can fail silently: you won't know until you try to call .cuda() in production. The second gotcha: CUDA 12.1 binaries don't work on systems with only CUDA 11.8 installed: version must match exactly, not just major version.

Error recovery

RuntimeError: CUDA out of memory

Your tensor is too large for GPU memory. Check <code>torch.cuda.get_device_properties(0).total_memory</code> to see available memory, then reduce batch size or model size. This is not an install failure: your setup is correct but your computation is too big.

ImportError: libcudart.so.12 not found

CUDA 12.1 is not installed on your system, but PyTorch was compiled for CUDA 12.1. Fix: <code>pip install torch --index-url https://download.pytorch.org/whl/cu118</code> to install CUDA 11.8 compatible binaries instead. Verify your GPU's CUDA support first with <code>nvidia-smi</code>.

RuntimeError: CUDA error: no kernel image is available for execution on the device

Your GPU's compute capability is too old for the installed CUDA/cuDNN version. Example: RTX 20-series (compute 7.5) doesn't support CUDA 12.1 ops. Fix: downgrade to PyTorch built for an older CUDA version, e.g., <code>pip install torch==2.11.0+cu118</code>.

torch.cuda.is_available() returns False despite nvidia-smi showing GPU

CUDA is installed but PyTorch's CUDA libraries can't find the GPU. Fix: Set environment variables: <code>export CUDA_HOME=/usr/local/cuda</code> and <code>export LD_LIBRARY_PATH=$CUDA_HOME/lib64:$LD_LIBRARY_PATH</code>, then reinstall: <code>pip install --force-reinstall torch</code>.

Experienced dev note

The single most valuable insight: always run this diagnostic code immediately after install, even on your dev machine: not when something breaks in production. A 30-second check saves you 3 hours of debugging a model that trains fine locally but fails remotely because the environments differ. Also, pin your PyTorch version in requirements.txt as torch==2.11.0, not torch: version mismatches between dev and prod are a silent killer, and someone will eventually run pip install -r requirements.txt six months later when PyTorch 2.12 exists and hit an incompatible API.

Check your understanding

If you installed PyTorch with CUDA 12.1 support, but nvidia-smi shows your system only has CUDA 11.8 installed, what error would you see when you try to run a model on GPU, and what is the exact fix?

Show answer hint

The error is a runtime error saying CUDA libraries cannot be found or a compute capability mismatch. The fix is to reinstall PyTorch with the <code>cu118</code> wheel index to match your system's CUDA version, not the other way around: you cannot upgrade your system CUDA to match PyTorch in a dev environment.

VERSION PyTorch 2.11.x (March 2026) uses the LCEL-style functional API and no longer supports the deprecated torch.cuda.amp.autocast() context manager: use torch.amp.autocast('cuda') instead. CUDA compute capability checks are the same across all 2.x versions, but the wheel URLs and recommended CUDA versions have shifted: CUDA 11.8 and 12.1 are now the standard targets; CUDA 10.x is no longer officially supported.

Once your install is verified, learn how tensors are the fundamental data structure in PyTorch and how to create, reshape, and move them between CPU and GPU.

Community Notes

No notes yetBe the first to share a version-specific fix or tip.