Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
Access Status
Fulltext not available
Authors
Davis, A.
Nordholm, Sven
Togneri, R.
Date
2006Type
Journal Article
Metadata
Show full item recordCitation
Davis, Alan and Nordholm, Sven and Togneri, Roberto. 2006. Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold. IEEE Transactions on Audio, Speech, and Language Processing. 14 (2): pp. 412-423.
Source Title
IEEE Transactions on Audio, Speech, and Language Processing
Additional URLs
ISSN
School
Department of Electrical and Computer Engineering
Collection
Abstract
Traditionally voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal to noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set.
Related items
Showing items related by title, author, creator and subject.
-
Kühnapfel, Thorsten (2009)For humans, hearing is the second most important sense, after sight. Therefore, acoustic information greatly contributes to observing and analysing an area of interest. For this reason combining audio and video cues for ...
-
Atee, Mustafa ; Lloyd, R.V.; Morris, T.; Cunningham, C. (2021)BACKGROUND: Recognizing pain in people with advanced dementia who cannot effectively communicate is difficult. As such, pain is underdetected and undermanaged in this group and can lead to behaviors and psychological ...
-
Scott, Donald E. (2009)This study was a 360 degree exploration of the effectiveness of online learning experiences facilitated via Voice-over-Internet-Protocol (VoIP) by incorporating the insights afforded by students, their lecturers, and the ...