Blind Separation for Multiple Moving Sources with Labeled Random Finite Sets
dc.contributor.author | Ong, Jonah | |
dc.contributor.author | Vo, Ba Tuong | |
dc.contributor.author | Nordholm, Sven | |
dc.date.accessioned | 2023-03-09T08:07:56Z | |
dc.date.available | 2023-03-09T08:07:56Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Ong, J. and Vo, B.T. and Nordholm, S. 2021. Blind Separation for Multiple Moving Sources with Labeled Random Finite Sets. IEEE/ACM Transactions on Audio Speech and Language Processing. 29: pp. 2137-2151. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/90800 | |
dc.identifier.doi | 10.1109/TASLP.2021.3087003 | |
dc.description.abstract |
This paper proposes a novel solution for separating an unknown and time-varying number of moving acoustic sources in a blind setting using multiple microphone arrays. A standard steered-response power phase transform method is applied to extract source position measurements, which inevitably contain noise, false detections, missed detections, and are not labeled with the source identities. The imperfect measurements lead to the space-time permutation problem, as there is no information on how the measurements are associated to the sources in space, nor how the measurements are connected across time, if at all. To solve this problem, a labeled random finite set tracking framework is adopted to jointly estimate the source positions and their labels or identities. Based on these trajectory estimates, a corresponding set of time-varying generalized side-lobe cancellers is constructed to perform source separation. The overall algorithm operates in a block-wise or an online fashion and is scalable with the number of microphone arrays. The quality of the measurements, tracking, and separation, are evaluated respectively, with the OSPA metric, OSPA(2) metric, and ITU-T P.835 based listening tests, on both real-world and simulated data. | |
dc.language | English | |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | |
dc.relation.sponsoredby | http://purl.org/au-research/grants/arc/DP170104854 | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Science & Technology | |
dc.subject | Technology | |
dc.subject | Acoustics | |
dc.subject | Engineering, Electrical & Electronic | |
dc.subject | Engineering | |
dc.subject | Time measurement | |
dc.subject | Position measurement | |
dc.subject | Microphone arrays | |
dc.subject | Noise measurement | |
dc.subject | Acoustic measurements | |
dc.subject | Trajectory | |
dc.subject | Blind source separation | |
dc.subject | multi-object tracking | |
dc.subject | labeled random finite sets | |
dc.subject | acoustic localization | |
dc.subject | spatial filtering | |
dc.subject | TIME-VARYING NUMBER | |
dc.subject | ACOUSTIC SOURCE | |
dc.subject | TRACKING | |
dc.subject | IMPLEMENTATION | |
dc.subject | ALGORITHMS | |
dc.subject | SPEAKERS | |
dc.title | Blind Separation for Multiple Moving Sources with Labeled Random Finite Sets | |
dc.type | Journal Article | |
dcterms.source.volume | 29 | |
dcterms.source.startPage | 2137 | |
dcterms.source.endPage | 2151 | |
dcterms.source.issn | 2329-9290 | |
dcterms.source.title | IEEE/ACM Transactions on Audio Speech and Language Processing | |
dc.date.updated | 2023-03-09T08:07:55Z | |
curtin.department | School of Elec Eng, Comp and Math Sci (EECMS) | |
curtin.accessStatus | Open access | |
curtin.faculty | Faculty of Science and Engineering | |
curtin.contributor.orcid | Vo, Ba Tuong [0000-0002-3954-238X] | |
curtin.contributor.orcid | Nordholm, Sven [0000-0001-8942-5328] | |
curtin.contributor.orcid | Ong, Jonah [0000-0002-8019-0099] | |
curtin.contributor.researcherid | Nordholm, Sven [J-5247-2014] | |
dcterms.source.eissn | 2329-9304 | |
curtin.contributor.scopusauthorid | Vo, Ba Tuong [9846846600] | |
curtin.contributor.scopusauthorid | Nordholm, Sven [7005690573] |