Debug Fix beginner · 3 min read

How to handle streaming events in OpenAI Assistants API

Q: How to handle streaming events in OpenAI Assistants API

Use the stream=True parameter in your client.chat.completions.create() call to receive streaming events from the OpenAI Assistants API. Iterate over the returned event generator to process partial responses in real time.

Quick answer

Use the stream=True parameter in your client.chat.completions.create() call to receive streaming events from the OpenAI Assistants API. Iterate over the returned event generator to process partial responses in real time.

ERROR TYPE code_error

⚡ QUICK FIX

Add stream=True to your API call and iterate over the response to handle streaming events properly.

Why this happens

Developers often call the OpenAI Assistants API without enabling streaming, resulting in the entire response being returned only after completion. Attempting to handle streaming without stream=True or without iterating over the event generator causes no partial data to be received, leading to blocking or missing real-time updates.

Typical broken code example:

python

from openai import OpenAI
import os

client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}]
    # Missing stream=True
)

# Trying to access streaming events incorrectly
for event in response:  # This will fail because response is not iterable
    print(event.choices[0].delta.content)

output

TypeError: 'ChatCompletion' object is not iterable

The fix

Enable streaming by passing stream=True in the chat.completions.create() call. The method then returns an iterable generator of events. Iterate over this generator to receive partial message chunks as they arrive, enabling real-time processing and display.

This approach reduces latency and improves user experience in chat applications.

python

from openai import OpenAI
import os

client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
    stream=True
)

for event in response:
    chunk = event.choices[0].delta
    if hasattr(chunk, "content") and chunk.content:
        print(chunk.content, end='', flush=True)
print()

output

Hello, how can I assist you today?

Preventing it in production

Implement robust error handling and retries around streaming calls to handle network interruptions gracefully. Validate that stream=True is set when streaming is desired. Use timeouts and backoff strategies to maintain reliability.

Consider fallback to non-streaming calls if streaming fails. Monitor streaming latency and partial data completeness to ensure smooth user experience.

Related errors

Error	Cause	Quick fix
TypeError: 'ChatCompletion' object is not iterable	Missing stream=True, response is not a generator	Add stream=True and iterate over response
Incomplete streaming output	Network interruption or improper iteration	Add retry logic and validate streaming loop
No partial content in events	Accessing wrong attribute or ignoring delta	Use event.choices[0].delta.content correctly

✅

Key Takeaways

Always set stream=True to receive streaming events from OpenAI Assistants API.
Iterate over the response generator to process partial message chunks in real time.
Implement retries and error handling to maintain streaming reliability in production.

Verified 2026-04 · gpt-4o

Verify ↗