Compressive speech enhancement
Access Status
Authors
Date
2013Type
Metadata
Show full item recordCitation
Source Title
ISSN
School
Collection
Abstract
This paper presents an alternative approach to speech enhancement by using compressed sensing (CS). CS is a new sampling theory, which states that sparse signals can be reconstructed from far fewer measurements than the Nyquist sampling. As such, CS can be exploited to reconstruct only the sparse components (e.g., speech) from the mixture of sparse and non-sparse components (e.g., noise). This is possible because in a time-frequency representation, speech signal is sparse whilst most noise is non-sparse. Derivation shows that on average the signal to noise ratio (SNR) in the compressed domain is greater or equal than the uncompressed domain. Experimental results concur with the derivation and the proposed CS scheme achieves better or similar perceptual evaluation of speech quality (PESQ) scores and segmental SNR compared to other conventional methods in a wide range of input SNR.
Related items
Showing items related by title, author, creator and subject.
-
Pathirage, C.; Li, Jun; Li, L.; Hao, Hong; Liu, Wan-Quan; Wang, R. (2019)© The Author(s) 2018. This article proposes a deep sparse autoencoder framework for structural damage identification. This framework can be employed to obtain the optimal solutions for some pattern recognition problems ...
-
Babakmehr, M.; Harirchi, F.; Al Durra, A.; Muyeen, S.M.; Simões, M. (2018)© 2018 IEEE Due to system complexity and structural variations, real time power line outage detection (POD) and localization is a critical and challenging monitoring goal for modern smart grid (SG). Online monitoring of ...
-
Pham, DucSon (2015)Following advances in compressed sensing and high-dimensional statistics, many pattern recognition methods have been developed with l1 regularization, which promotes sparse solutions. In this work, we instead advocate ...