Language Tool Pipeline

Language Tool Pipeline:

  • Job Queue
  • Error Handling & Monitoring
  • REST API (ideally with returning progress)

Services that we need on long term:

  • Audio Extraction
  • Language Detection
  • ASR with Whisper
  • Text Summarization
  • NLP Term Extraction
  • ELG Implementation (mainly for TTT Translation Models)
  • Waveform Generation
  • Music/Speech Distinction
  • Audio Extraction
  • Diarisation
  • Generate subtitle formats: WebVTT, SRT, etc.
  • Audio improvement (e.g. normalization, reset dc offset, multiband compression, limiting, etc.)
  • Audio transcoding (FFMPEG/FREEAC: different formats, different qualities)
  • Video transcoding (FFMPEG: MP4 to HLS)

What we need first:

  • ASR
  • ELG Implementation
  • Music/Speech Distinction
  • Language Detection
  • accept json to translate key/value pairs (eg EBU metadata)
Edited by Leindecker Ingo