OpenAI WebSocket Mode for Responses API
About OpenAI WebSocket Mode for Responses API
Launched March 1, 2026
Every agent turn, you’re resending the full context. Again. That overhead compounds fast. WebSocket Mode for the Responses API keeps a persistent connection, sends only incremental inputs, and cuts end-to-end latency by up to 40% on heavy tool-call workflows.