Multiple sound source tracking and identification via degenerate unmixing estimation technique and cardinality balanced multi-target multi-bernoulli filter (DUET-CBMeMBer) with track management
Access Status
Authors
Date
2015Type
Metadata
Show full item recordCitation
Source Title
ISBN
School
Collection
Abstract
In Source Separation research, 'cocktail party problem' is a challenging problem that research into source separation aims to solve. Many attempts have been made to solve this complex problem. A logical approach would be to break down this complex problem into several smaller problems which are solved in different stages - each considering various aspects. In this paper, we are providing a robust solution to a part of the problem by localizing and tracking multiple moving speech sources in a room environment. Here we study the separation problem for unknown number of moving sources. The DUET-CBMeMBer method we outline is capable of estimating the number of sound sources as well as tracking and labelling them. This paper proposes a track management technique that identifies sound sources based on their trajectory as an extension to the DUET-CBMeMBer technique.
Related items
Showing items related by title, author, creator and subject.
-
Storer, Christine; Noonan, John; Murray-Prior, Roy; Batt, Peter (2011)Tracking and tracing systems are being demanded by customers such as the major Australian supermarket chains, superior food service chains and globally in export markets such as the European Union and Asia. This includes ...
-
Mallick, M.; Vo, Ba-Ngu; Kirubarajan, T.; Arulampalam, S. (2013)Multitarget tracking has a long history spanning over 50 years and it refers to the problem of jointly estimating the number of targets and their states from sensor data. Today, multitarget tracking has found applications ...
-
Chong, Nicholas; Nordholm, Sven; Vo, Ba Tuong; Murray, Iain (2017)In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement ...