An Online Solution for Localisation, Tracking and Separation of Moving Speech Sources
Access Status
Open access
Authors
Chong, Nicholas Ewe Hai
Date
2015Supervisor
Assoc. Prof. Ba Tuong
Prof. Sven Nordholm
Dr Iain Murray
Type
Thesis
Award
PhD
Metadata
Show full item recordSchool
Department of Electrical and Computer Engineering
Collection
Abstract
The problem of separating a time varying number of speech sources in a room is difficult to solve. The challenge lies in estimating the number and the location of these speech sources. Furthermore, the tracked speech sources need to be separated. This thesis proposes a solution which utilises the Random Finite Set approach to estimate the number and location of these speech sources and subsequently separate the speech source mixture via time frequency masking.
Related items
Showing items related by title, author, creator and subject.
-
Source separation employing beamforming and SRP-PHAT localization in three-speaker room environmentsNordholm, Sven (2017)This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation ...
-
Chong, Nicholas; Nordholm, Sven; Vo, Ba Tuong; Murray, Iain (2017)In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement ...
-
Dam, H.; Nordholm, Sven (2013)This paper investigates the problem of subband speech separation from a mixture of two speech signals in a room environment. Due to the lack of source information, a sound source localization is proposed for beamformer ...