AutoShow

Outline

Project Overview
Setup
Run AutoShow Node Scripts
Project Structure
Contributors

Project Overview

AutoShow automates the processing of audio and video content from various sources, including YouTube videos, playlists, podcast RSS feeds, and local media files. It leverages advanced transcription services and language models (LLMs) to perform transcription, summarization, and chapter generation.

Prompts and Content Formats

AutoShow can generate diverse content formats including:

Summaries and Chapters:
- Concise summaries
- Detailed chapter descriptions
- Bullet-point summaries
- Chapter titles with timestamps
Social Media Posts:
- X (Twitter)
- Facebook
- LinkedIn
Creative Content:
- Rap songs
- Rock songs
- Country songs
Educational and Informative Content:
- Key takeaways
- Comprehension questions
- Frequently asked questions (FAQs)
- Curated quotes
- Blog outlines and drafts

Key Features

Support for multiple input types (YouTube links, RSS feeds, local video and audio files)
Integration with various:
- LLMs (ChatGPT, Claude, Gemini, Deepseek, Fireworks, Together)
- Transcription services (Whisper.cpp, Deepgram, Assembly)
Local LLM support with Ollama
Customizable prompts for generating titles, summaries, chapter titles/descriptions, key takeaways, and questions to test comprehension
Markdown output with metadata and formatted content

AutoShow Pipeline

The AutoShow workflow includes the following steps that feed sequentially into each other:

The user provides a content input (video URL, playlist, RSS feed, or local file) and front matter is created based on the content's metadata.
The audio is downloaded (if necessary).
Transcription is performed using the selected transcription service.
A customizable prompt is inserted containing instructions for the show notes or other content forms.
The transcript is processed by the selected LLM service to generate the desired output based on the selected prompts.

Setup

scripts/setup.sh checks to ensure a .env file exists, Node dependencies are installed, and the whisper.cpp repository is cloned and built. Run the script with the setup script in package.json.

npm run setup

Run AutoShow Node Scripts

Run on a single YouTube video.

npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk"

Run on a YouTube playlist.

npm run as -- --playlist "https://www.youtube.com/playlist?list=PLCVnrVv4KhXPz0SoAVu8Rc1emAdGPbSbr"

Run on a list of arbitrary URLs.

npm run as -- --urls "content/examples/example-urls.md"

Run on a local audio or video file.

npm run as -- --file "content/examples/audio.mp3"

Run on a podcast RSS feed.

npm run as -- --rss "https://ajcwebdev.substack.com/feed"

Use local LLM.

npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --ollama

Use 3rd party LLM providers.

npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --chatgpt
npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --claude
npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --gemini
npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --deepseek
npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --fireworks
npm run as -- --video "https://www.youtube.com/watch?v=MORMZXEaONk" --together

Example commands for all available CLI options can be found in docs/examples.md.

Project Structure

Root Level Configuration

Root-Level Files
- tsconfig.json: TypeScript configuration file specifying compiler options.
- railway.json: Configuration for Railway deployment.
- package.json: Contains project dependencies, scripts, and metadata.
- .env.example: Example environment variables file for configuration.
- .dockerignore: Specifies files/folders ignored by Docker during builds.
Database Schema (prisma)
- prisma/schema.prisma: Defines the Prisma ORM schema for database structure and models.
- prisma/migrations/20250303081347_shownotes/migration.sql: SQL script detailing schema migrations for Prisma.
Shared Resources (shared)
- shared/constants.ts: Globally shared constants across multiple modules.

Node CLI and Server Backend

Main Entry Points (src)
- commander.ts: CLI setup using Commander.js.
- commander-utils.ts: Helper functions for CLI usage.
- db.ts: Initializes the database connection via Prisma.
- fastify.ts: Sets up and configures the Fastify web server.

Process Commands and Process Steps

Process Commands (src/process-commands)
- file.ts: Processes local audio/video files.
- video.ts: Processes single YouTube videos.
- urls.ts: Processes videos listed in a URL file.
- playlist.ts: Processes YouTube playlists.
- channel.ts: Processes all videos from YouTube channels.
- channel-utils.ts: Helpers specific to YouTube channel processing.
- rss.ts: Processes podcast RSS feeds.
- rss-utils.ts: Helpers for RSS feed processing.
Process Steps (src/process-steps)
- 01-generate-markdown.ts: Creates initial markdown file with metadata.
- 01-generate-markdown-utils.ts: Utility functions for the markdown generation step.
- 02-download-audio.ts: Downloads audio from YouTube videos.
- 02-download-audio-utils.ts: Utility functions for the audio download step.
- 03-run-transcription.ts: Manages transcription processes.
- 03-run-transcription-utils.ts: Utility functions for the transcription step.
- 04-select-prompt.ts: Defines prompts for summarization and chapter creation.
- 05-run-llm.ts: Runs language model processes based on prompts.
- 05-run-llm-utils.ts: Utility functions for running language models.

Transcription and LLM Services

Transcription Services (src/transcription)
- whisper.ts: Implements transcription with Whisper.cpp.
- deepgram.ts: Integration with Deepgram API for transcription.
- assembly.ts: Integration with AssemblyAI API for transcription.
Language Models (src/llms)
- ollama.ts: Integration with local Ollama models.
- chatgpt.ts: Integration with OpenAI's GPT models.
- claude.ts: Integration with Anthropic's Claude models.
- gemini.ts: Integration with Google's Gemini models.
- fireworks.ts: Integration with Fireworks open-source models.
- together.ts: Integration with Together open-source models.
- deepseek.ts: Integration with DeepSeek AI models.

Utility Files

Utility Files (src/utils)
- create-clips.ts: Utility to create video/audio clips.
- logging.ts: Reusable logging utilities using Chalk for colorized output.
- types.ts: Commonly used TypeScript types.
- dash-documents.ts: Helpers or scripts related to Dash payments.
- node-utils.ts: Node.js-specific utilities.
Embeddings Utilities (src/utils/embeddings)
- create-embed.ts: Functions for creating embeddings.
- query-embed.ts: Functions for querying embeddings.
Image Generation Utilities (src/utils/images)
- black-forest-labs-generator.ts: Integration for image generation with Black Forest Labs.
- dalle-generator.ts: Integration for OpenAI's DALL·E image generation.
- stability-ai-generator.ts: Integration for Stability AI image generation.
- combined-generator.ts: Combines multiple image generators.
- utils.ts: Common image-related helper functions.
- index.ts: Centralized exports for image utilities.

Astro Web Frontend

Web Frontend Configuration Files (web Module):
- astro.config.ts: Configuration for Astro web application.
- package.json: Dependencies and scripts for web frontend.
- tsconfig.json: TypeScript configuration for web module.
Web Source Files (web/src):
- env.d.ts: Type declarations for environment variables.
- site.config.ts: Site-specific configuration settings.
- types.ts: Shared TypeScript types.
- styles/global.css: Global CSS styles.
Pages (web/src/pages):
- index.astro: Homepage.
- 404.astro: 404 error page.
- show-notes/[id].astro: Dynamic pages for individual show notes.
Layouts (web/src/layouts):
- Base.astro: Base layout used across pages.
Components (web/src/components):
- BaseHead.astro: Common HTML head elements.
App Components (web/src/components/app):
- App.tsx
- Form.tsx
- ShowNote.tsx
- ShowNotes.tsx
Grouped Components (web/src/components/app/groups):
- LLMService.tsx
- ProcessType.tsx
- Prompts.tsx
- TranscriptionService.tsx

Contributors

✨Hello beautiful human! ✨Jenn Junod host of Teach Jenn Tech & Shit2TalkAbout

Name	Name	Last commit message	Last commit date
Latest commit ajcwebdev Merge pull request #152 from ajcwebdev/staging Mar 9, 2025 d79af6d · Mar 9, 2025 History 591 Commits
.github	.github	tests	Mar 5, 2025
content	content	update tests	Mar 6, 2025
docs	docs	update cli tests	Mar 6, 2025
prisma	prisma	dash integration and cli docs	Mar 3, 2025
shared	shared	change cost to cents	Mar 5, 2025
src	src	change cost to cents	Mar 5, 2025
test	test	fix test files	Mar 6, 2025
web	web	fix test files	Mar 6, 2025
.dockerignore	.dockerignore	optimize docker compose build	Nov 5, 2024
.env.example	.env.example	update readme project structure	Mar 1, 2025
.gitignore	.gitignore	update tests	Mar 6, 2025
LICENSE	LICENSE	mit ftw	Jun 26, 2024
README.md	README.md	update tests	Mar 6, 2025
package-lock.json	package-lock.json	dash contract	Mar 3, 2025
package.json	package.json	fix script	Mar 6, 2025
railway.json	railway.json	railway config file	Feb 18, 2025
tsconfig.json	tsconfig.json	create dir for clips	Feb 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoShow

Outline

Project Overview

Prompts and Content Formats

Key Features

AutoShow Pipeline

Setup

Run AutoShow Node Scripts

Project Structure

Root Level Configuration

Node CLI and Server Backend

Process Commands and Process Steps

Transcription and LLM Services

Utility Files

Astro Web Frontend

Contributors

About

Sponsor this project

Packages

Contributors 2

Languages

License

ajcwebdev/autoshow

Folders and files

Latest commit

History

Repository files navigation

AutoShow

Outline

Project Overview

Prompts and Content Formats

Key Features

AutoShow Pipeline

Setup

Run AutoShow Node Scripts

Project Structure

Root Level Configuration

Node CLI and Server Backend

Process Commands and Process Steps

Transcription and LLM Services

Utility Files

Astro Web Frontend

Contributors

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Packages 0

Contributors 2

Languages

Packages