A 1000-Hour Spontaneous Dialogue Mandarin Speech Recognition Database


A 1000-hour Spontaneous Mandarin Dialogue Corpus for Recognition has been released. It is recorded in realistic indoor environments using 48 kHz sampling frequency and 16-bit quantization accuracy mono PCM format with desktop microphone.

1400 native Mandarin speakers (1:1 gender distribution) with their respected dialectal characteristics are carefully selected. 

Each session contains two speakers with one of the 70 predefined topics, such as Music, Movie, Game, Health, Shopping, Transportation, Education, and etc.

During the corpus design and collection process, we have formed an efficient team with high standard capabilities. This leads to the great success for our corpus to have below 5% sentence-wise error.

This corpus can be used for training and testing the speech recognition system, as well as speech analysis. It has been well-acknowledged by industry as a corpus with high speech quality and recognition accuracy.