Multi-speaker separation employing microphone array and vertex finding algorithm
MetadataShow full item record
© 2018 IEEE. This paper proposes a new speaker detection and signal separation algorithm for multiple speakers using microphone array data recorded in a room environment. The algorithm utilizes the fact that in multi-speaker conversations not all speakers are speaking simultaneously there are time segments when only a single speaker is active. Based on that observation a speech activity detector for each speaker (MVAD) has been developed. It is based on SRP-PHAT estimates for different blocks of data. We have shown that these estimates form vertexes in a convex polygon which can be employed to obtain MVAD detections. Those detections are then used to form Minimum Variance Distortionless Response (MVDR) beamformers. Evaluations based on real recorded speech data with 4 speakers show that the algorithm provides good interference suppression and low speech distortion for this difficult scenario.
Showing items related by title, author, creator and subject.
Acoustic Speaker Localization with Strong Reverberation and Adaptive Feature Filtering with a Bayes RFS FrameworkLin, Shoufeng (2019)The thesis investigates the challenges of speaker localization in presence of strong reverberation, multi-speaker tracking, and multi-feature multi-speaker state filtering, using sound recordings from microphones. Novel ...
Jafari, I.; Haque, S.; Togneri, R.; Nordholm, Sven (2011)Recently, the concept of time-frequency masking has developed as an important approach to the blind source separation problem, particularly when in the presence of reverberation. However, previous research has been limited ...
Multiple moving speaker tracking via degenerate unmixing estimation technique and Cardinality Balanced Multi-target Multi-Bernoulli Filter (DUET-CBMeMBer)Chong, N.; Wong, S.; Vo, Ba Tuong; Nordholm, Sven; Murray, Iain (2014)The "cocktail party problem" has always been a challenging problem to solve and many blind source separation algorithms have been proposed as solutions. This problem has mainly been discussed for non-moving sound sources ...