Skip to content
Change the repository type filter

All

    Repositories list

    • mup

      Public
      maximal update parametrization (µP)
      Jupyter Notebook
      MIT License
      93001Updated Sep 5, 2024Sep 5, 2024
    • Python
      MIT License
      0210Updated Sep 5, 2024Sep 5, 2024
    • AI powered speech denoising and enhancement
      Python
      MIT License
      1341.3k391Updated Jun 21, 2024Jun 21, 2024
    • resemble.ai API SDK
      TypeScript
      MIT License
      3821Updated Jun 5, 2024Jun 5, 2024
    • Python
      0010Updated May 8, 2024May 8, 2024
    • PyTSMod

      Public
      An open-source Python library for audio time-scale modification.
      Python
      GNU General Public License v3.0
      27400Updated Apr 10, 2024Apr 10, 2024
    • aiortc

      Public
      WebRTC and ORTC implementation for Python using asyncio
      Python
      BSD 3-Clause "New" or "Revised" License
      759000Updated Mar 27, 2024Mar 27, 2024
    • aioice

      Public
      asyncio-based Interactive Connectivity Establishment (RFC 5245)
      Python
      BSD 3-Clause "New" or "Revised" License
      51000Updated Feb 15, 2024Feb 15, 2024
    • TypeScript
      2700Updated Dec 16, 2023Dec 16, 2023
    • Go
      1200Updated Nov 13, 2023Nov 13, 2023
    • Run OpenAI Whisper as a Cog model
      Python
      Apache License 2.0
      37100Updated Nov 8, 2023Nov 8, 2023
    • Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
      Python
      Apache License 2.0
      149000Updated Oct 25, 2023Oct 25, 2023
    • A python package to analyze and compare voices with deep learning
      Python
      Apache License 2.0
      4242.7k401Updated Oct 12, 2023Oct 12, 2023
    • A Heroku buildpack for ffmpeg that always downloads the latest static build
      Shell
      MIT License
      720000Updated Aug 21, 2023Aug 21, 2023
    • g2pW

      Public
      Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
      Python
      Apache License 2.0
      38000Updated Jul 8, 2023Jul 8, 2023
    • univnet

      Public
      Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
      Python
      BSD 3-Clause "New" or "Revised" License
      46000Updated May 19, 2023May 19, 2023
    • NeMo

      Public
      NeMo: a toolkit for conversational AI
      Python
      Apache License 2.0
      2.4k900Updated Jan 18, 2023Jan 18, 2023
    • espeak-ng

      Public
      eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
      C
      GNU General Public License v3.0
      883200Updated Nov 29, 2022Nov 29, 2022
    • Simple text to phonemes converter for multiple languages
      Python
      GNU General Public License v3.0
      1662001Updated Nov 21, 2022Nov 21, 2022
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Jupyter Notebook
      MIT License
      8.1k100Updated Oct 4, 2022Oct 4, 2022
    • Monotonic Alignment Search
      Cython
      MIT License
      148310Updated Sep 6, 2022Sep 6, 2022
    • reLaugh

      Public
      Supplementary materials of Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations
      HTML
      0100Updated Jun 25, 2022Jun 25, 2022
    • Benchmark Arabic text diacritization dataset
      Python
      MIT License
      18400Updated Oct 8, 2021Oct 8, 2021
    • Dockerfile
      7000Updated Sep 1, 2021Sep 1, 2021
    • Automatically deploy your project to GitHub Pages using GitHub Actions. This action can be configured to push your production-ready code into any branch you'd like.
      TypeScript
      MIT License
      356000Updated Aug 3, 2021Aug 3, 2021
    • This utility allows one to cut multiple clips from a single or multiple audio files.
      Python
      MIT License
      10500Updated May 17, 2021May 17, 2021
    • Deep Learning Examples
      Jupyter Notebook
      3.2k400Updated Apr 29, 2021Apr 29, 2021
    • Github Action for executing Helm commands on EKS (using aws-iam-authenticator)
      Dockerfile
      MIT License
      61100Updated Apr 14, 2021Apr 14, 2021
    • Resemble's voice cloning engine within Unity
      C#
      2716210Updated Feb 28, 2021Feb 28, 2021
    • This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.
      Python
      MIT License
      208410Updated Feb 12, 2021Feb 12, 2021