Post-Call Analytics System

This repository outlines the architecture and data flow for our post-call analytics system. We aim to transform raw Call Detail Records (CDRs) and associated audio into actionable insights, stored and analyzed in ClickHouse.

System Architecture Diagram

This diagram visualizes the end-to-end data pipeline, illustrating how data flows through various components, including Apache Pulsar topics, MinIO storage, and specialized processing modules.

System Architecture Diagram

flowchart TD
    A[FreeSWITCH CDR] --> B[speechriv-insight-record]
    B --> C[Record Processor]
    C --> D[MinIO Storage]
    C --> E[speechriv-insight-localaudio]
    E --> F[AI LAN Record Machine]
    F --> G[Download Audio from MinIO]
    G --> H[speechriv-insight-audioprocess]
    H --> I[Audio Processor]
    I --> J{speechriv_lang present?}
    J -- No --> K[speechriv-insight-lang]
    K --> L[Language Detector]
    L --> M{Detected Language}
    J -- Yes --> M
    M -- English --> N[speechriv-insight-eng-asr]
    M -- Other --> O[speechriv-insight-multi-asr]
    N --> P[English ASR Transcriber]
    P --> Q[speechriv-insight-eng-llm]
    O --> R[Multi-lang ASR Transcriber]
    R --> S[speechriv-insight-multi-llm]
    Q --> T[English LLM]
    S --> U[Multi-lang LLM]
    T --> V[speechriv-insight-clickhouse]
    U --> V
    V --> W[ClickHouse]

Data Flow Description

Our post-call analytics system is designed with a modular and scalable architecture, primarily leveraging Apache Pulsar for efficient message passing between services.

CDR Ingestion
- FreeSWITCH Server (CDR Source): Generates CDRs and audio.
- Pulsar Topic: speechriv-insight-record: Entry point for raw CDRs.
Audio Storage and Initial Routing
- Record Processor: Uploads audio to MinIO and forwards reference to speechriv-insight-localaudio.
- MinIO: Stores all raw audio files.
- Pulsar Topic: speechriv-insight-localaudio: Broadcasts MinIO reference to the AI LAN machine.
Audio Processing and Language Detection
- AI LAN Record Machine: Downloads audio from MinIO.
- Pulsar Topic: speechriv-insight-audioprocess: Sends audio for processing.
- Audio Processor: Splits stereo, checks or sets speechriv_lang.
- Language Detector: If lang missing, consumes from speechriv-insight-lang.
Automatic Speech Recognition (ASR)
- English ASR → speechriv-insight-eng-asr
- Multilang ASR → speechriv-insight-multi-asr
- Transcribers produce to LLM-specific topics.
Large Language Model (LLM) Processing
- English LLM → speechriv-insight-eng-llm
- Multi-lang LLM → speechriv-insight-multi-llm
- Outputs go to speechriv-insight-clickhouse.
Final Storage
- ClickHouse: Fast analytics-ready database for structured insights.

This detailed flow ensures efficient, secure, and scalable transformation of call recordings into structured, searchable insights.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Post-Call Analytics System

System Architecture Diagram

System Architecture Diagram

Data Flow Description

CDR Ingestion

Audio Storage and Initial Routing

Audio Processing and Language Detection

Automatic Speech Recognition (ASR)

Large Language Model (LLM) Processing

Final Storage

About

Uh oh!

Releases

Packages

telecmi/speechriv_insight_flow

Folders and files

Latest commit

History

Repository files navigation

Post-Call Analytics System

System Architecture Diagram

System Architecture Diagram

Data Flow Description

CDR Ingestion

Audio Storage and Initial Routing

Audio Processing and Language Detection

Automatic Speech Recognition (ASR)

Large Language Model (LLM) Processing

Final Storage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages