Skip to content

gramener/videoscribe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Video Scribe

Video Scribe is a web application that processes video files, extracting audio, transcribing speech, and capturing key frames. It provides a visual interface for analyzing video content and generating markdown-based transcripts with embedded images.

Features

  • Video upload and processing
  • Audio extraction and transcription
  • Key frame extraction
  • Interactive transcript editing
  • Markdown export with embedded images
  • Dark mode support

Prerequisites

  • Node.js (v14 or later)
  • FFmpeg

Installation

Clone the repository and install dependencies:

bash
git clone https://github.com/gramener/videoscribe.git
cd videoscribe
npm install

Usage

  1. Start the server:
npm start
  1. Open your browser and navigate to http://localhost:3000.
  2. Upload a video file and interact with the interface to process the video.
  3. Wait for the processing to complete. You'll see progress updates in the UI.
  4. Once processing is done, you can:
    • View and edit the transcript
    • Play the extracted audio
    • Toggle key frames on/off
    • Export the result as a markdown file

API Endpoints

  • POST /audio: Extract audio from uploaded video
  • POST /keyframes: Extract key frames from uploaded video

Both endpoints accept multipart form data with a file field containing the video file.

Technologies Used

  • Backend: Node.js, Express.js
  • Frontend: HTML, CSS, JavaScript (ES6+)
  • UI Framework: Bootstrap 5
  • Templating: lit-html
  • Audio Processing: FFmpeg
  • Transcription: Groq API (distil-whisper-large-v3-en model)

License

This project is licensed under the MIT License.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published