Multi-speaker separation employing microphone array and vertex finding algorithm
Access Status
Authors
Date
2018Type
Metadata
Show full item recordCitation
Source Title
ISBN
School
Collection
Abstract
© 2018 IEEE. This paper proposes a new speaker detection and signal separation algorithm for multiple speakers using microphone array data recorded in a room environment. The algorithm utilizes the fact that in multi-speaker conversations not all speakers are speaking simultaneously there are time segments when only a single speaker is active. Based on that observation a speech activity detector for each speaker (MVAD) has been developed. It is based on SRP-PHAT estimates for different blocks of data. We have shown that these estimates form vertexes in a convex polygon which can be employed to obtain MVAD detections. Those detections are then used to form Minimum Variance Distortionless Response (MVDR) beamformers. Evaluations based on real recorded speech data with 4 speakers show that the algorithm provides good interference suppression and low speech distortion for this difficult scenario.
Related items
Showing items related by title, author, creator and subject.
-
Lin, Shoufeng (2019)The thesis investigates the challenges of speaker localization in presence of strong reverberation, multi-speaker tracking, and multi-feature multi-speaker state filtering, using sound recordings from microphones. Novel ...
-
Jafari, I.; Haque, S.; Togneri, R.; Nordholm, Sven (2011)Recently, the concept of time-frequency masking has developed as an important approach to the blind source separation problem, particularly when in the presence of reverberation. However, previous research has been limited ...
-
Chong, N.; Wong, S.; Vo, Ba Tuong; Nordholm, Sven; Murray, Iain (2014)The "cocktail party problem" has always been a challenging problem to solve and many blind source separation algorithms have been proposed as solutions. This problem has mainly been discussed for non-moving sound sources ...