Open Speech and Language Resources



EmoV_DB

Identifier: SLR115

Summary: a database of emotional speech intended to be open-sourced and used for synthesis and generation purpose. It contains data for male and female actors in English (https://github.com/numediart/EmoV-DB)

Category: Speech

License: Click here to view the license

Downloads (use a mirror closer to you):
bea_Amused.tar.gz [102M]   ()   Mirrors: [US]   [EU]   [CN]  
bea_Angry.tar.gz [97M]   ()   Mirrors: [US]   [EU]   [CN]  
bea_Disgusted.tar.gz [132M]   ()   Mirrors: [US]   [EU]   [CN]  
bea_Neutral.tar.gz [105M]   ()   Mirrors: [US]   [EU]   [CN]  
bea_Sleepy.tar.gz [193M]   ()   Mirrors: [US]   [EU]   [CN]  
jenie_Amused.tar.gz [38M]   ()   Mirrors: [US]   [EU]   [CN]  
jenie_Angry.tar.gz [91M]   ()   Mirrors: [US]   [EU]   [CN]  
jenie_Disgusted.tar.gz [29M]   ()   Mirrors: [US]   [EU]   [CN]  
jenie_Neutral.tar.gz [47M]   ()   Mirrors: [US]   [EU]   [CN]  
jenie_Sleepy.tar.gz [57M]   ()   Mirrors: [US]   [EU]   [CN]  
josh_Amused.tar.gz [36M]   ()   Mirrors: [US]   [EU]   [CN]  
josh_Neutral.tar.gz [31M]   ()   Mirrors: [US]   [EU]   [CN]  
josh_Sleepy.tar.gz [32M]   ()   Mirrors: [US]   [EU]   [CN]  
sam_Amused.tar.gz [113M]   ()   Mirrors: [US]   [EU]   [CN]  
sam_Angry.tar.gz [68M]   ()   Mirrors: [US]   [EU]   [CN]  
sam_Disgusted.tar.gz [91M]   ()   Mirrors: [US]   [EU]   [CN]  
sam_Neutral.tar.gz [55M]   ()   Mirrors: [US]   [EU]   [CN]  
sam_Sleepy.tar.gz [96M]   ()   Mirrors: [US]   [EU]   [CN]  

About this resource:

The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems

Landpage: https://github.com/numediart/EmoV-DB

A description of the database here: https://arxiv.org/pdf/1806.09514.pdf

This dataset is built for the purpose of emotional speech synthesis. The transcript were based on the CMU arctic database: http://www.festvox.org/cmu_arctic/cmuarctic.data.

It includes recordings for four speakers- two males and two females.

The emotional styles are neutral, sleepiness, anger, disgust and amused.

Each audio file is recorded in 16bits .wav format

Spk-Je (Female, English: Neutral(417 files), Amused(222 files), Angry(523 files), Sleepy(466 files), Disgust(189 files))

Spk-Bea (Female, English: Neutral(373 files), Amused(309 files), Angry(317 files), Sleepy(520 files), Disgust(347 files))

Spk-Sa (Male, English: Neutral(493 files), Amused(501 files), Angry(468 files), Sleepy(495 files), Disgust(497 files))

Spk-Jsh (Male, English: Neutral(302 files), Amused(298 files), Sleepy(263 files))

File naming (audio_folder): anger_1-28_0011.wav - 1) first word (emotion style), 1-28 - annotation doc file range, Last four digit is the sentence number.

File naming (annotation_folder): anger_1-28.TextGrid - 1) first word (emotional style), 1-28- annotation doc range

You can cite the data using the following BibTeX entry:


@article{adigwe2018emotional,
  title={The emotional voices database: Towards controlling the emotion dimension in voice generation systems},
  author={Adigwe, Adaeze and Tits, No{\'e} and Haddad, Kevin El and Ostadabbas, Sarah and Dutoit, Thierry},
  journal={arXiv preprint arXiv:1806.09514},
  year={2018}
}

External URL: https://mega.nz/#F!KBp32apT!gLIgyWf9iQ-yqnWFUFuUHg