Skip to content

Gemini Multimodal Live App with Next.js Framework, welcome to my YouTube channel for more interesting projects.

Notifications You must be signed in to change notification settings

yeyu2/gemini-nextjs

Repository files navigation

Multimodal Realtime App with Gemini 2.0 by Next.js Framework

A demonstration project showing how to build a realtime multimodal application using Google's Gemini 2.0 API and Next.js. This app can process audio, video, and generate transcripts in realtime.

Watch Demo Video

Watch the Tutorial Video

Features

  • Realtime audio/video(image) interaction with Gemini 2.0 Multimodal Live API
  • Live transcription by Gemini 1.5/2.0 GenerativeAI API
  • Built with Next.js for optimal performance

Architecture

Block Diagram

Prerequisites

  • Node.js 18+ installed
  • API key for Gemini 2.0 Model

Getting Started

  1. Clone the repository
git clone https://github.com/yeyu2/gemini-nextjs.git
cd gemini-nextjs
  1. Install dependencies
npm install
# or
yarn install
  1. Set up environment variables
cp .env.example .env.local

Add your Gemini API key to .env.local:

GEMINI_API_KEY=your_api_key_here
  1. Run the development server
npm run dev
# or
yarn dev

Open http://localhost:3000 with your browser to see the application.

About

Gemini Multimodal Live App with Next.js Framework, welcome to my YouTube channel for more interesting projects.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published