Improve quality of transcriptions and translations

There are some recent developments that might be of interest to us concerning transcriptions and translations:

Meta's Seemless m4t (transcription and TTT-translations for ~100 languages in one massive language model)

Whisper Implementation that reduces GPU load 3x times+:

Wav2Vec