PreTrain Wav2Vec2 in Dhivehi

PreTrain Wav2Vec2 in Dhivehi

There is currently only a multilingually pretrained model for Dhivehi Wav2Vec2. We would like to make a Wav2Vec2 only pre-trained on Dhivehi.

Model

A randomly initialized Wav2Vec2 model

Datasets

  • commonvoice has 18hrs in the last released dataset. [ 32hrs+ if mid 2021 dataset released in time]
  • podcast data [30hr]
  • others

Available training scripts

FlaxWav2Vec2 will be merged soon: [Flax] Add wav2vec2 by patrickvonplaten · Pull Request #12271 · huggingface/transformers · GitHub and a pretraining script should be relatively easy to be merged.

(Optional) Desired project outcome

The best Dhivehi ASR model

(Optional) Challenges

scraping publicly available Dhivehi audio from various sources

Am interested and would like to join this project

Awesome! Let’s finalize it directly

This is a very interesting project. I always wanted to work on speech recognition task. This is a great opportunity to learn and contribute. Looking forward to be a part of this project.