Skip to content

A real-time speech recognition system powered by Groq and ElevenLabs, designed to listen for audio input, recognize speech, and respond with AI-driven dialogue. Customize the trigger word and personality for dynamic conversations. Includes speech synthesis for vocal replies and a live console interface with Rich library visuals.

Notifications You must be signed in to change notification settings

official-alex/AI-Assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎀 Speech Recognition AI Conversation System

This repository contains a real-time speech recognition system powered by Groq and ElevenLabs. It listens for audio input 🎧, processes it using an AI personality πŸ€–, and responds via speech synthesis πŸŽ™οΈ.


🌟 Features:

  • πŸ› οΈ Custom Trigger Word: Set your own trigger word for AI activation.
  • 🧠 AI Personality: Customize the AI's personality and response style.
  • πŸ”Š Speech Synthesis: Uses ElevenLabs API to convert text responses into speech.

πŸš€ Setup Instructions

  1. πŸ“₯ Clone the Repository:

    https://github.com/official-alex/AI-Assistant.git
    cd AI-Assistant
  2. πŸ“¦ Install Dependencies: Ensure Python 3.8+ is installed, then run:

    pip install -r requirements.txt
  3. πŸ”‘ Set Up Environment Variables: Edit the .env file in the project root and add the following:

    GROQ_API_KEY=your_groq_api_key_here
    ELEVENLABS_API_KEY=your_elevenlabs_api_key_here
    
    • For the Groq API key, go to Groq Console πŸ”—.
    • For the ElevenLabs API key, log in to ElevenLabs, click your profile (bottom left), and navigate to "API Keys" πŸ”‘.
  4. ▢️ Run the Application: Start the live speech recognition system with:

    python main.py
  5. βš™οΈ Modify AI Personality & Trigger Word: Change the trigger_word and personality in the LiveSpeechRecognition class to customize how the AI interacts with you.


πŸ“ Usage

  • ▢️ Start Listening: The system will start listening for your input.
  • πŸ—£οΈ Trigger Word: Say the trigger word (default: bob) to activate AI processing.
  • πŸ›‘ Stop: You can say "stop" to halt the recognition.

πŸ’Ύ Save Transcriptions and Errors

Upon termination, the system saves a transcript of recognized speech to transcription_history.txt and logs any errors in error_log.txt.

About

A real-time speech recognition system powered by Groq and ElevenLabs, designed to listen for audio input, recognize speech, and respond with AI-driven dialogue. Customize the trigger word and personality for dynamic conversations. Includes speech synthesis for vocal replies and a live console interface with Rich library visuals.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages