Skip to content

Copy of the cbh123/narrator project, but in NodeJS with Sockets

Notifications You must be signed in to change notification settings

leo1994/ai-narrator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChatGPT Vision and ElevenLabs

Someone at my work shared a Tweet about this project here, and I decided to make a clone in Node.JS

demo.mov

Requirements

How it works

The difference between my project and the original project is that I am using socket.io to Stream the audio to the client using Socket.io. How it works is that the client uses the webcam and captures it, after which it converts the image to base64 and sends it to the server. The server receives the image and sends it to the ChatGPT Vision API to get a description of the image based on the prompt. After that, it sends the description to the ElevenLabs API to get the audio file of the description, and finally, the server sends the audio file to the client using Socket.io.

How to use

  1. Clone the repo.
  2. Run npm install.
  3. Create a .env file using the .env.example file as a template.
  4. Run npm start:client to start the client.
  5. Run npm start:server to start the server.
  6. Go to http://localhost:8080 and enjoy!

About

Copy of the cbh123/narrator project, but in NodeJS with Sockets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published