Friends-Trivia-Chatbot

Overview

This is an AI Chatbot designed specifically for fans of the popular sitcom Friends. It utilizes Retrieval-Augmented Generation (RAG) and Large Language Model (LLM), specifically LLaMA 2, which has been fine-tuned using Replicate. This setup enables the application to deliver precise, context-sensitive responses to intricate questions about the show’s content, plot, and characters. The app is built with Streamlit and includes features such as session chat history and the ability to choose from multiple LLaMA2 API endpoints on Replicate.

Ask anything about the Friends series on our app here!

How to Start

Prerequisites

Relative API key(s) (optional; e.g. for embedding model)
Python 3.11 or higher
Git Large File Storage (LFS) for handling large datasets and model files

Installation

Install dependencies.
- [Optional but recommended]
  - Create a virtual python environment with
```
   python -m venv .venv
```
  - Activate it with
```
   source .venv/bin/activate
```
- Install dependencies with
```
   pip install -r requirements.txt
```
Create the Chroma DB:

python populate_database.py

Setup before being able to do inference:
- Case 1: If you choose to run the base Llama 2 model locally, you'll need to have Ollama installed and run ollama serve in a seperate terminal.
- Case 2: If you choose to do inference with replicate with our models locally, you'll need to have REPLICATE_API_TOKEN setup as an environment variable.
- Case 3: You can simply test run our deployed project on streamlit: friends-rag.streamlit.app.
Test run to query the Chroma DB, the below command will return an output based on RAG and the selected model:

python query_data.py "Which role does Adam Goldberg plays?"

Start the App locally:

streamlit run app.py

Setup & Configurations

For finetuning, we opted to create our own dataset of Question-Answer pairs relevant to the domain, enhancing the effectiveness of RAG.
Domain-specific files, including trivia.txt and s1_s2.jsonl, are organized in the data folder. Using Langchain, this data is indexed in a vector database located in the chroma folder, which can be expanded with additional content as needed.
The user interface and deployment of the application are handled through Streamlit.
Users have the flexibility to choose from several LLaMA2 chat API endpoints, including the base LLaMA2, finetuned LLaMA2, base with RAG, and finetuned with RAG.
Each version of the model (base, finetuned, with and without RAG) is hosted on Replicate.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
chroma		chroma
data		data
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py
evaluation.txt		evaluation.txt
get_embedding_function.py		get_embedding_function.py
populate_database.py		populate_database.py
query_data.py		query_data.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Friends-Trivia-Chatbot

Overview

How to Start

Prerequisites

Installation

Setup & Configurations

References

About

Releases

Packages

Languages

ltaoming/Friends-Trivia-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Friends-Trivia-Chatbot

Overview

How to Start

Prerequisites

Installation

Setup & Configurations

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages