Optimizaton and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement
Access Status
Authors
Date
2013Type
Metadata
Show full item recordCitation
Source Title
ISSN
Collection
Abstract
In this paper, an a priori signal-to-noise ratio (SNR) estimator with a modified sigmoid gain function is proposed for real-time speech enhancement. The proposed sigmoid gain function has three parameters, which can be optimized such that they match conventional gain functions. In addition, the joint temporal dynamics between the SNR estimate and the spectral gain function is investigated to improve the performance of the speech enhancement scheme. As the widely-used decision-directed (DD) a priori SNR estimate has a well-known one-frame delay that leads to the degradation of speech quality, a modified a priori SNR estimator is proposed for the DD approach to overcome this delay. Evaluations are performed by utilizing the objective evaluation metric that measures the trade-off between the noise reduction, the speech distortion and the musical noise in the enhanced signal. The results are compared using the PESQ and the SNRseg measures as well as subjective listening tests. Simulation results show that the proposed gain function, which can flexibly model exponential distributions, is a potential alternative speech enhancement gain function.
Related items
Showing items related by title, author, creator and subject.
-
Yong, Pei; Nordholm, Sven; Dam, Hai Huyen Heidi (2012)In this paper, a modified a priori SNR estimator is proposed for speech enhancement. The well-known decision-directed (DD) approach is modified by matching each gain function with the noisy speech spectrum at current frame ...
-
Chan, Kit Yan; Yong, P.; Nordholm, Sven; Yiu, C.; Lam, H. (2014)Commercial speech recognizers have made possible many speech control applications such as wheelchair, tone-phone, multifunctional robotic arms and remote controls, for the disabled and paraplegic. However, they have a ...
-
Chan, Kit; Yong, Pei; Nordholm, Sven; Yiu, Ka Fai; Lam, H. (2014)Commercial speech recognizers have made possible many speech control applications such as wheelchair, tone-phone, multifunctional robotic arms and remote controls, for the disabled and paraplegic. However, they have a ...