🎙️ RAG Voice Assistant project

Prerequisites

Before you begin, ensure you have the following installed:

Python: Version 3.10 or higher (Download Python)
Node.js: Version 18 or higher (Download Node.js)
pnpm: Install globally using npm: npm install -g pnpm
API Keys: Obtain necessary API keys for:
- OpenAI
- LiveKit
- Deepgram

Follow these steps to set up and run the project:

1. 📥 Clone the Repository

git clone https://github.com/your-repo-name.git 
cd your-repo-name

2. 📦 Install Dependencies

Install the required Python libraries:

pip install -r requirements.txt

3. 🔑 Configure Environment Variables

Set up your environment variables by copying the example file and adding your API keys:

Copy .env.example to .env.local in the project root:
```
cp .env.example .env.local
```
Edit .env.local and provide the values for:
- LIVEKIT_URL
- LIVEKIT_API_KEY
- LIVEKIT_API_SECRET
- OPENAI_API_KEY
- DEEPGRAM_API_KEY (required if using Deepgram)

4. ⚙️ Set Up the Backend

Navigate to the backend directory:

cd backend

Vector Store Setup (for RAG):

You need a vector store for Retrieval-Augmented Generation (RAG). Choose one of the following options:

Option 1: Generate Locally (you can modify vectorstore.py to use your own dataset)
```
bash scripts/vectorstore.sh
```

Option 2: Download Pre-built Store for our dataset

pip install gdown
gdown --folder https://drive.google.com/drive/folders/1TlG4wPt0vxXO938jI3UMDO270ttn-VNk?usp=sharing

Run the Backend Agent:

Once the vector store is ready, start the backend agent:

python agent.py dev

Evaluate the Backend Agent: To evaluate the backend agent's performance with RAG, follow these steps:

Connect to Hugging Face by running the following command in your terminal:
```
huggingface-cli login
```
RAGAS Evaluation: We test the GPT-4o language model performance with RAG using the RAGAS library. Run the evaluation script:
```
bash scripts/text_benchmark.sh
```
You can vary parameters such as chunk size, embedding models, and the number of relevant documents retrieved.
Latency Benchmarking: We also measure the latency of the Real-time model (Multimodal) compared with a traditional STT-LLM-TTS pipeline (VoicePipelineAgent). Please set up the frontend (instructions in part 5) and run:
```
python voice_benchmark.py dev
```

5. 🖥️ Set Up the Frontend (Web)

Open a new terminal window, navigate to the frontend directory, and install dependencies:

cd frontend
pnpm install

Start the Development Server:

pnpm dev

Your frontend application should now be running and accessible in your browser (typically at http://localhost:3000).

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
backend		backend
frontend		frontend
mobile-frontend		mobile-frontend
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎙️ RAG Voice Assistant project

Prerequisites

1. 📥 Clone the Repository

2. 📦 Install Dependencies

3. 🔑 Configure Environment Variables

4. ⚙️ Set Up the Backend

5. 🖥️ Set Up the Frontend (Web)

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

HoangP8/rag-realtime

Folders and files

Latest commit

History

Repository files navigation

🎙️ RAG Voice Assistant project

Prerequisites

1. 📥 Clone the Repository

2. 📦 Install Dependencies

3. 🔑 Configure Environment Variables

4. ⚙️ Set Up the Backend

5. 🖥️ Set Up the Frontend (Web)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages