Skip to content

Orchestrate Modal and OpenAI workloads with Dagster

Notifications You must be signed in to change notification settings

dagster-io/dagster-modal-demo

 
 

Repository files navigation

Dagster × Modal Demo

Dagster Modal Banner

Orchestrate your flexible compute workloads with the combined forces of Dagster and Modal.

In this example project we show how you can write a pipeline that automatically detects newly published podcasts, transcribes them using the power of GPUs, and notifies you with the summary so that you can focus on the things that truly matter (not listening to people talk into fancy microphones).

Architecture

Architecture Diagram

Example Asset Lineage

Screenshot Dagster Lineage

Getting started

Install the project dependencies:

pip install -e ".[dev]"

Run Dagster:

dagster dev

Open http://localhost:3000 in your browser.

References

Modal

Dagster

OpenAI

Miscellaneous

More Screenshots

Screenshot of R2 Storage

Screenshot R2 Storage

Screenshot of Summary Asset

Screenshot Dagster Summary Asset

Example Podcast Feeds

Podcasts are sourced from RSS feeds as; a table containing a sample of feeds is below:

Title RSS Feed
Practical AI: Machine Learning, Data Science, LLM RSS Feed
The Data Exchange with Ben Lorica RSS Feed
Hub & Spoken: Data RSS Feed
Making Data Simple RSS Feed
The Data Chief RSS Feed

Languages

  • Python 99.5%
  • Makefile 0.5%