Sound source localization for subband-based two speech separation in room environment
Access Status
Fulltext not available
Authors
Dam, H.
Nordholm, Sven
Date
2013Type
Conference Paper
Metadata
Show full item recordCitation
Dam, Hai Quang Hong and Nordholm, Sven. 2013. Sound source localization for subband-based two speech separation in room environment, in Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Nov 25-28 2013, pp. 252-256. Nha Trang, Vietnam: IEEE.
Source Title
Proceedings of 2013 International Conference on Room Environment
Source Conference
2013 Internaitonal Conference on Room Environment
ISBN
Collection
Abstract
This paper investigates the problem of subband speech separation from a mixture of two speech signals in a room environment. Due to the lack of source information, a sound source localization is proposed for beamformer design in each subband to obtain a good interference suppression with negligible expense on each target signal integrity. Evaluations on real speech data show that the proposed speech separation method offers a good interference suppression level whilst maintaining a low distortion level of the target source.
Related items
Showing items related by title, author, creator and subject.
-
Kühnapfel, Thorsten (2009)For humans, hearing is the second most important sense, after sight. Therefore, acoustic information greatly contributes to observing and analysing an area of interest. For this reason combining audio and video cues for ...
-
Source separation employing beamforming and SRP-PHAT localization in three-speaker room environmentsNordholm, Sven (2017)This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation ...
-
Cocks, Naomi; Sautin, L.; Kita, S.; Morgan, G.; Zlotowitz, S. (2009)Background: In order to comprehend fully a speaker's intention in everyday communication, information is integrated from multiple sources, including gesture and speech. There are no published studies that have explored ...