High severity beginner · Fix: 2-5 min

Wrong language specified / Language mismatch

openai.whisper - Language parameter mismatch or incorrect language code

What this error means

Whisper transcribes audio in the wrong language because the language parameter doesn't match the actual audio content, or the language code format is invalid.

Stack trace

traceback

Transcription result in wrong language detected.
Audio content: English speech
Specified language: 'es' (Spanish)
Transcription output: Spanish text interpretation of English audio

OR

whisper.load_model('base')
result = model.transcribe('audio.mp3', language='invalid_code')
KeyError: 'invalid_code' not found in supported languages

OR

No language specified, Whisper auto-detects and chooses wrong language (e.g., detects Portuguese instead of Spanish)

QUICK FIX

Always specify language explicitly with ISO-639-1 code: model.transcribe('audio.mp3', language='en') instead of relying on auto-detection.

Why it happens

Whisper auto-detects language when none is specified, and this detection can fail on short audio clips, background noise, or similar-sounding languages. When you explicitly specify a language parameter with an invalid code (e.g., 'spanish' instead of 'es') or a code that doesn't match the audio content, Whisper forces transcription in that language, producing gibberish or incorrect text. The language parameter must use ISO-639-1 two-letter codes (es, en, fr, pt, etc.): longer codes or language names cause errors.

Detection

Compare the transcribed text language to your expected language before accepting the result. Add language confidence checks by examining the detected_language field in Whisper's output, or use a secondary language detection library (textblob, langdetect) on the transcription result to verify accuracy.

Causes & fixes

Language parameter uses invalid format (e.g., 'english' or 'Spanish' instead of ISO-639-1 code)

✓ Fix

Convert language names to two-letter ISO-639-1 codes: 'es' for Spanish, 'en' for English, 'fr' for French, 'pt' for Portuguese, 'de' for German, 'it' for Italian, 'ja' for Japanese, 'zh' for Chinese, 'ru' for Russian, 'ar' for Arabic. See full list at https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes

No language specified and Whisper auto-detects wrong language (common with short clips or low-quality audio)

✓ Fix

Explicitly specify the language parameter: model.transcribe('audio.mp3', language='en') to force transcription in the correct language and bypass unreliable auto-detection.

Audio actually contains speech in a different language than expected due to content error or file mix-up

✓ Fix

Verify the audio file content by listening to it first. If multi-language audio, use Whisper's detected_language field from the output to identify what language was actually detected and transcribe accordingly.

Using deprecated Whisper model versions that have different language support or detection behavior

✓ Fix

Update to whisper-large-v3 (latest) which has improved language detection accuracy: model = whisper.load_model('large-v3'). For production, use OpenAI's Whisper API (whisper-1) which is always up-to-date.

Code: broken vs fixed

Broken - triggers the error

python

import whisper
import os

# BROKEN: Using language name instead of ISO code
model = whisper.load_model('base')
result = model.transcribe(
    'speech.mp3',
    language='English'  # ❌ WRONG: 'English' is not valid, must be 'en'
)
print(result['text'])

# BROKEN: Relying on auto-detection which fails on Spanish-sounding English
result2 = model.transcribe('ambiguous_audio.mp3')  # ❌ No language specified — may detect Portuguese instead of Spanish
print(result2['text'])

Fixed - works correctly

python

import whisper
import os

# FIXED: Using correct ISO-639-1 language code
model = whisper.load_model('large-v3')  # ✅ Use latest model for better accuracy
result = model.transcribe(
    'speech.mp3',
    language='en'  # ✅ FIXED: 'en' is valid ISO-639-1 code for English
)
print(f"Transcription: {result['text']}")
print(f"Detected language: {result.get('language', 'unknown')}")

# FIXED: Explicitly specify language to bypass unreliable auto-detection
result2 = model.transcribe(
    'ambiguous_audio.mp3',
    language='es'  # ✅ FIXED: Explicitly force Spanish transcription
)
print(f"Transcription: {result2['text']}")

# ALTERNATIVE: Use OpenAI API for production (always up-to-date)
from openai import OpenAI
client = OpenAI(api_key=os.environ.get('OPENAI_API_KEY'))

with open('speech.mp3', 'rb') as f:
    transcript = client.audio.transcriptions.create(
        model='whisper-1',
        file=f,
        language='en'  # ✅ Explicit language parameter
    )
print(f"API Transcription: {transcript.text}")

Changed 'English' to 'en' (valid ISO-639-1 code) and explicitly specified language to prevent auto-detection errors. Used whisper-large-v3 for better accuracy and showed both local model and OpenAI API approaches.

⚠

Workaround

If you cannot immediately fix the language code format, wrap transcription in try/except and catch KeyError for invalid codes. On error, fall back to auto-detection or use a language detection library on the input audio first: use librosa or pydub to extract a sample, detect language with langdetect, then pass the detected code to Whisper. Alternatively, manually verify the audio language before calling transcribe().

✓

Prevention

Always validate language codes against a whitelist of ISO-639-1 codes before calling transcribe(). Build a mapping function: VALID_LANGUAGES = {'en': 'English', 'es': 'Spanish', 'fr': 'French', ...} and check user input against it. For production, use OpenAI's Whisper API (whisper-1) with explicit language parameters: never rely on auto-detection for critical workflows. Log the detected_language field from Whisper output to audit transcription accuracy.

Python 3.8+ · openai-whisper >=20231117 · tested on 20240314

Verified 2026-04 · whisper-1, whisper-large-v3

Verify ↗

Community Notes

No notes yetBe the first to share a version-specific fix or tip.