GitHub - jonaskahn/asktube: AskTube - An AI-powered YouTube video summarizer and QA assistant powered by Retrieval Augmented Generation (RAG) 🤖. Run it entirely on your local machine with Ollama, or cloud-based models like Claude, OpenAI, Gemini, Mistral, and more.

AskTube - An AI-powered YouTube video summarizer and QA assistant powered by Retrieval Augmented Generation (RAG) 🤖

Run it entirely on your local machine with Ollama, or cloud-based models like Claude, OpenAI, Gemini, Mistral, and more

🤷🏽 Why does this project exist?

I’ve seen several GitHub repositories offering AI-powered summaries for YouTube videos, but none include Q&A functionality.
I want to implement a more comprehensive solution while also gaining experience with AI to build my own RAG application.

🔨 Technology

Language: Python, JS
Server: Python@v3.10, Bun@v1
Framework/Lib: Sanic, Peewee, Pytubefix, Sentence Transformers, Sqlite, Chroma, NuxtJs/DaisyUI, etc.
Embedding Provider (Analysis Provider):
- OpenAI
- Gemini
- VoyageAI
- Mistral
- Sentence Transformers (Local)
AI Provider:
- OpenAI
- Claude
- Gemini
- Mistral
- Ollama (Local)
Speech To Text:

🗓️ Next Todo Tasks

🚀 How to run ?

For the first time running, the program maybe a bit slow due they need to install local models.

Run on your machine

Ensure you installed:
- Python 3.10
  - Windows User, please download here
  - Linux, MacOS User, please use homebrew or your install package command (apt, dnf, etc)
  - Or use conda
- Poetry
  - Windows User open Powershell and run:
```
(Invoke-WebRequest -Uri https://install.python-poetry.org -UseBasicParsing).Content | py -
```
  - Linux, MacOS User open Terminal and run:
```
curl -sSL https://install.python-poetry.org | python3 -
```
- Bun
- ffmpeg
  - MacOS User
```
brew install ffmpeg
```
  - Linux User
```
# Ubuntu
sudo apt install ffmpeg
# Fedora
sudo dnf install -y ffmpeg
```
  - Windows, please follow this tutorial Install ffmpeg for Windows

Clone repostiory

git clone https://github.com/jonaskahn/asktube.git

Create file .env in asktube/engine directory:
- Locally
- Free with some limitations
Run program
- You may need to run first:
```
poetry env use python
```
- Open terminal/cmd/powershell in asktube/engine directory, then run:
```
poetry install && poetry python engine/server.py
```
- Open terminal/cmd/powershell in asktube/web directory, then run:
```
bun install && bun run dev
```
Open web: http://localhost:3000

With docker (In process)

Before You Start

I built these services to docker images, but if you want to build local images, please run build.local.bat for Windows or build.local.amd64.sh or build.local.aarch64.sh for MacOS, Linux

If you have a GPU (cuda or rocm), please refer ENV settings above, change params like above

Locally

Use local.yaml compose file to start
Open terminal/cmd/powershell in asktube directory

docker compose -f compose/local.yaml pull && docker compose -f compose/local.yaml up -d

After run, you need install Ollama model qwen2 and llama3.1 for QA

docker run ollama ollama run qwen2
docker run ollama ollama run llama3.1

Free (with rate limit)

You need to go Google Gemini and VoyageAI to register account and generate your own API keys:
- Gemini is free with your Google Account
- VoyageAI (recommended by Claude) gives you free 50M tokens (a huge amount) but you need to add your credit card first.
Replace your ENV setting in docker file free and start docker
Open terminal/cmd/powershell in asktube directory

docker compose -f compose/free.yaml pull && docker compose -f compose/free.yaml up -d

Ideal

Using VoyageAI for embedding texts
Using OpenAI and Claude for QA, register account and generate your own API keys
Replace your ENV setting in docker file ideal and start docker
Open terminal/cmd/powershell in asktube directory

docker compose -f compose/ideal.yaml pull && docker compose -f compose/ideal.yaml up -d

Result

Open web: http://localhost:8080

💡 Architecture

The real implementation might differ from this art due to its complexity.

1️⃣ Extract data from given URL

2️⃣ Storing embedding chapter subtitles

3️⃣ Asking (included enrich question)

🪧 Notice

Do not use this for production. This aimed for end-users on their local machines.

Do not request any advanced features for management.

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
compose		compose
docs		docs
engine		engine
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RELEASE		RELEASE
build.local.aarch64.sh		build.local.aarch64.sh
build.local.amd64.sh		build.local.amd64.sh
build.local.bat		build.local.bat
build.sh		build.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤷🏽 Why does this project exist?

🔨 Technology

🗓️ Next Todo Tasks

🚀 How to run ?

Run on your machine

With docker (In process)

💡 Architecture

1️⃣ Extract data from given URL

2️⃣ Storing embedding chapter subtitles

3️⃣ Asking (included enrich question)

🪧 Notice

🏃🏽‍➡️ Demo & Screenshot

Start from v0.2.2

Watch "AskTube First Demo" on YouTube

✍🏿 For development

⁉️ FAQ and Troubleshooting

About

Releases 6

Packages

Languages

License

jonaskahn/asktube

Folders and files

Latest commit

History

Repository files navigation

🤷🏽 Why does this project exist?

🔨 Technology

🗓️ Next Todo Tasks

🚀 How to run ?

Run on your machine

With docker (In process)

💡 Architecture

1️⃣ Extract data from given URL

2️⃣ Storing embedding chapter subtitles

3️⃣ Asking (included enrich question)

🪧 Notice

🏃🏽‍➡️ Demo & Screenshot

Start from v0.2.2

Watch "AskTube First Demo" on YouTube

✍🏿 For development

⁉️ FAQ and Troubleshooting

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages