Multilingual Automatic Speech Recognition for Low-resource Languages

Automatic Speech Recognition (ASR) models transcribe speech to text by a probabilistic prediction of the transcription given the speech audio. Despite recent advances in the field with cross-lingual and current unsupervised approaches, ASR systems for low-resource languages remain an open challenge.