This corpus were recorded in silence in-door environment using cellphone. It has 855 speakers. Each speaker has 120 utterances. All utterances were carefully transcribed and checked by human. Transcription accuracy is guaranteed. If there is any problem, we agree to correct them for you. The corpus contains:
	audio files;
	transcriptions;
	metadata;	

Please cite the data as “ST-CMDS-20170001_1, Free ST Chinese Mandarin Corpus”.

The data set is a subset of a much bigger data set which was recorded in the same environment as this open source data. Please visit our website www.surfing.ai for details.