Open Speech and Language Resources



nicolingua-0004-west-african-va-asr-corpus

Identifier: SLR106

Summary: West African Virtual Assistant Speech Recognition Corpus

Category: Speech

License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Downloads (use a mirror closer to you):
nicolingua-0004-west-african-va-asr-corpus.tgz [254M]   ()   Mirrors: [US]   [EU]   [CN]  

About this resource:

This dataset contains 10,083 recorded utterances in French, Maninka, Pular and Susu from 49 speakers (16 female and 33 male) ranging from 5 to 76 years old on a variety of devices.

Please see our paper for more details on this dataset. Additional resources can be found in the following git repository: https://github.com/mdoumbouya/nicolingua

You can cite our work using the following BibTeX entry.

 @inproceedings{doumbouya2021usingradio,
    title={Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users},
    author={Doumbouya, Moussa and Einstein, Lisa and Piech, Chris},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    volume={35},
    year={2021}
  }

External URL: https://nicolingua.s3.eu-west-2.amazonaws.com/nicolingua-0004-west-african-va-asr-corpus.tgz