Improved a priori SNR estimation with application in Log-MMSE speech estimation
MetadataShow full item record
A speech enhancement method utilizing the harmonic structure of speech is presented. The method is an extension of the well known minimum mean square error log-spectral amplitude estimator(Log MMSE) method for speech enhancement. The improvement lies specifically on a priori SNR estimation by utilizing harmonic structure of speech. The method is based on a conditional averaging operation over adjacent frequency bands for each processed data block. The actual frequency bands used in the conditional averaging is determined by a pitch detector. Thus voiced segments are averaged over frequency according to the pitch and the corresponding harmonic structure of voiced speech. Non-voiced segments are averaged over frequency according to a random number depending on the pitch value. The result is overall better SNR and SNRSeg values in white noise over the standard Log MMSE reference method. In babble noise, the estimator rendered similar SNR and SNRSeg values as the Log-MMSE reference method. Subjectively the residue background noise sounded more natural when using the suggested method. ©2009 IEEE.
Showing items related by title, author, creator and subject.
Guo, S.; He, Z.; Jiang, W.; Ou, Y.; Niu, K.; Rong, Yue; Caley, Michael; Duncan, Alec (2013)The underwater acoustic (UA) channel is dispersive in both time and frequency with severe frequency-dependent signal attenuation. Efficient channel estimation and tracking are crucial to coherent high-rate UA communication. ...
Singh, M.; Low, S.; Nordholm, Sven; Zang, Z. (2018)© 2017 Modulation domain has been reported to be a better alternative to time-frequency domain for speech enhancement, as speech intelligibility is closely linked with the modulation spectrum. Motivated by that, this paper ...
Development Of An Evaluation Tool For Use At The Design Stage Of Auditoria With Respect To Unassisted Speech ReinforcementMcMinn, Terrance (1996)This dissertation describes the development of an evaluation tool that can be used by an acoustican during the design stage of enclosures used for unassisted speech. Enclosures include lecture theatres, lecture halls and ...