Multiple speaker tracking with the GLMB filter
MetadataShow full item record
Funding and Sponsorship
© 2017 IEEE. In this paper we propose a new solution to the problem of tracking multiple speakers from multiple microphone arrays in a reverberant acoustic environment. The acoustic environment with its complex reflection patterns with its underlying data association uncertainty pose the two most significant challenges in the multi-speaker tracking problem. We provide an approach that employs individual Time Difference of Arrival measurements collected by pairs of microphones in using multiple distributed pairs in conjunction with the Generalized Labeled Multi-Bernoulli (GLMB) tracker. The distributed measurements together with the GLMB tracking filter exploits the spatiotemporal correlation of the true sources from data frame to data frame, whereas the spurious measurements arising from reverberations exhibit no temporal consistency as the speakers move in the room.
Showing items related by title, author, creator and subject.
Multiple moving speaker tracking via degenerate unmixing estimation technique and Cardinality Balanced Multi-target Multi-Bernoulli Filter (DUET-CBMeMBer)Chong, N.; Wong, S.; Vo, Ba Tuong; Nordholm, Sven; Murray, Iain (2014)The "cocktail party problem" has always been a challenging problem to solve and many blind source separation algorithms have been proposed as solutions. This problem has mainly been discussed for non-moving sound sources ...
Acoustic Speaker Localization with Strong Reverberation and Adaptive Feature Filtering with a Bayes RFS FrameworkLin, Shoufeng (2019)The thesis investigates the challenges of speaker localization in presence of strong reverberation, multi-speaker tracking, and multi-feature multi-speaker state filtering, using sound recordings from microphones. Novel ...
Hong Dam, H.; Nordholm, Sven (2018)© 2018 IEEE. This paper proposes a new speaker detection and signal separation algorithm for multiple speakers using microphone array data recorded in a room environment. The algorithm utilizes the fact that in multi-speaker ...