Skip to content

swooby/openai-realtime-push-to-talk

 
 

Repository files navigation

OpenAI Realtime Console: Push-To-Talk (PTT)

Demo


This fork of https://github.com/openai/openai-realtime-console was created on 2025/02/07 and is a work in progress for a hosted demo Push-To-Talk test webapp for the OpenAI Realtime WebRTC [and maybe future retro WebSocket] API.


This is an example application showing how to use the OpenAI Realtime API with WebRTC.

Installation and usage

Before you begin, you'll need an OpenAI API key - create one at https://platform.openai.com/settings/api-keys with the following permissions:

  • Models = Read
  • Model capabilities = Write

Save the generated key in a safe place and enter it in the "OpenAI API Key" field in the main page.

This [fully open source] app only ever uses your OpenAI API Key to directly request a one-minute ephemeral key from api.openai.com.
The key is otherwise never logged, stored, or sent anywhere else.

Code pushed to main automatically deploys to the cloud, so the code running in the cloud IS the code that is on main.

THIS IS SAFE! There is effectively zero difference in risk between:

  1. You entering your [dangerous] OpenAI API Key in an open source cloud run client only app.
    -and/or-
  2. You entering that same key in an .env file for a locally run open source server + client app.

You still have to review and trust the code either way; You can always run the client only code locally if you don't trust the cloud run version. If you trust running the code locally on your machine and not from the cloud then you may have trust issues.


Running this application locally requires Node.js to be installed. Install dependencies for the application with:

npm install

Start the application server with:

npm run dev

This should start the console application on http://localhost:3000.

This application is a minimal template that uses vite to build the React frontend contained in the /client folder.

This application shows how to send and receive Realtime API events over the WebRTC data channel and configure client-side function calling. You can also view the JSON payloads for client and server events using the logging panel in the UI.

For a more comprehensive example, see the OpenAI Realtime Agents demo built with Next.js, using an agentic architecture inspired by OpenAI Swarm.

Previous WebSockets version

The previous version of this application that used WebSockets on the client (not recommended in browsers) can be found here.

License

MIT

About

React app for inspecting, building and debugging with the Realtime API

Resources

License

Stars

Watchers

Forks

Languages

  • JavaScript 90.6%
  • CSS 7.9%
  • HTML 1.5%