I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, sexual orientation, religion, skin color and geocoordinates of birth location. A global world where everybody is warmly welcome on any place on this planet and open and free knowledge and education is available to everyone.
(Thorsten Müller, 2021)
tl;dr Please don't use for evil!
Emotional dataset details:
- recordings from male single-speaker native german speaker (Thorsten Müller)
- audio optimized (Dominik Kreutz)
- 300 identical phrases recorded in 8 DIFFERENT EMOTIONS = 2.400 recordings = 175 minutes audio material
- ljspeech-1.1 directory structure
- samplerate 22.050Hz
- mono
- normalized to -24dB
- phrase length 59 - 148 chars
- no silence at beginning/ending
Recording length (300 identical phrases) for each emotion
- "neutral" (19 minutes)
- "amused" (18 minutes)
- "angry" (20 minutes)
- "disgusted" (23 minutes)
- "sleepy" (30 minutes)
- "surprised" (18 minutes)
- "drunk", recorded sober without being drunk, just pronounced it that way :-) (25 minutes)
- "whispering" (22 minutes)
Additional links
Please keep in mind that a am no professional speaker or voice actor. I'm just a guy sharing his voice.