Language Tool Pipeline
Language Tool Pipeline:
- Job Queue
- Error Handling & Monitoring
- REST API (ideally with returning progress)
Services that we need on long term:
- Audio Extraction
- Language Detection
- ASR with Whisper
- Text Summarization
- NLP Term Extraction
- ELG Implementation (mainly for TTT Translation Models)
- Waveform Generation
- Music/Speech Distinction
- Audio Extraction
- Diarisation
- Generate subtitle formats: WebVTT, SRT, etc.
- Audio improvement (e.g. normalization, reset dc offset, multiband compression, limiting, etc.)
- Audio transcoding (FFMPEG/FREEAC: different formats, different qualities)
- Video transcoding (FFMPEG: MP4 to HLS)
What we need first:
- ASR
- ELG Implementation
- Music/Speech Distinction
- Language Detection
- accept json to translate key/value pairs (eg EBU metadata)
Edited by Leindecker Ingo