Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
dc.contributor.author | Davis, A. | |
dc.contributor.author | Nordholm, Sven | |
dc.contributor.author | Togneri, R. | |
dc.date.accessioned | 2017-01-30T14:09:29Z | |
dc.date.available | 2017-01-30T14:09:29Z | |
dc.date.created | 2012-01-10T20:00:57Z | |
dc.date.issued | 2006 | |
dc.identifier.citation | Davis, Alan and Nordholm, Sven and Togneri, Roberto. 2006. Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold. IEEE Transactions on Audio, Speech, and Language Processing. 14 (2): pp. 412-423. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/37889 | |
dc.description.abstract |
Traditionally voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal to noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set. | |
dc.publisher | IEEE Signal Processing Society | |
dc.relation.uri | http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1597247 | |
dc.subject | voice activity detector | |
dc.subject | - VAD | |
dc.subject | Voice activity detection | |
dc.subject | statistical decision | |
dc.subject | adaptive voice activity detection | |
dc.title | Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold | |
dc.type | Journal Article | |
dcterms.source.volume | 14 | |
dcterms.source.number | 2 | |
dcterms.source.startPage | 412 | |
dcterms.source.endPage | 423 | |
dcterms.source.issn | 1558-7916 | |
dcterms.source.title | IEEE Transactions on Audio, Speech, and Language Processing | |
curtin.department | Department of Electrical and Computer Engineering | |
curtin.accessStatus | Fulltext not available |