A noisy speech corpus (NOIZEUS) was developed
to facilitate comparison of speech enhancement algorithms among research groups.
The noisy database contains 30 IEEE sentences (produced by three male and three
female speakers) corrupted by eight different real-world noises at different
SNRs. The noise was taken from the
The description of this corpus was published in the following paper, which we ask that you cite when using NOIZEUS:
Hu, Y. and Loizou, P. (2007). “Subjective evaluation and comparison of speech enhancement algorithms,” Speech Communication, 49, 588-601. [pdf]
The NOIZEUS corpus was also used by our lab to evaluate the correlations of common objective measures used in speech enhancement. This work was reported in the following papers:
Hu, Y. and Loizou, P. (2008). “Evaluation of objective quality measures for speech enhancement,” IEEE Transactions on Speech and Audio Processing, 16(1), 229-238. [Matlab code]
Ma, J., Hu, Y. and Loizou, P. (2009). "Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions", Journal of the Acoustical Society of America, 125(5), 3387-3405 [ pdf ]
Speech Material |
Thirty sentences from the
IEEE sentence database [1] were recorded in a sound-proof booth using Tucker
Davis Technologies (TDT) recording equipment. The sentences were produced by
three male and three female speakers. The IEEE database (720 sentences)
was used as it contains phonetically-balanced sentences with relatively low
word-context predictability. The thirty sentences were selected from the IEEE
database so as to include all phonemes in the American English language. The list of sentences
recorded for NOIZEUS are given in Table 1. The sentences were originally sampled
at 25 kHz and downsampled to 8 kHz.
Filtering |
Figure 1: Frequency response of IRS filter.
Adding Noise |
Noise signals were taken
from the
The long-term spectra of the
above noises are given in [4]. The noise signals were added to the speech
signals at SNRs of 0dB, 5dB, 10dB, and 15dB.
Download files |
Train noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Babble noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Car noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Exhibition hall noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Restaurant noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Street noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Airport noise
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
Train station
SNR (dB) |
0 |
5 |
10 |
15 |
Noisy files (.zip) |
References
[1] IEEE
Subcommittee (1969). IEEE Recommended Practice for Speech Quality Measurements.
IEEE Trans. Audio and Electroacoustics,
AU-17(3), 225-246.
[2] ITU
P.862 (2000). Perceptual evaluation of
speech quality (PESQ), and objective method for end-to-end speech quality
assessment of narrowband telephone networks and speech codecs. ITU-T Recommendation P. 862
[3] ITU-T P.56 (1993). Objective measurement of
active speech level.
[4] H.
Hirsch, and D. Pearce (2000). “The
Any inquiries or suggestions about NOIZEUS, please direct them to Dr. Philip Loizou (email: [email protected])