An Online Solution for Localisation, Tracking and Separation of Moving Speech Sources
MetadataShow full item record
The problem of separating a time varying number of speech sources in a room is difficult to solve. The challenge lies in estimating the number and the location of these speech sources. Furthermore, the tracked speech sources need to be separated. This thesis proposes a solution which utilises the Random Finite Set approach to estimate the number and location of these speech sources and subsequently separate the speech source mixture via time frequency masking.
Showing items related by title, author, creator and subject.
Source separation employing beamforming and SRP-PHAT localization in three-speaker room environmentsNordholm, Sven (2017)This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation ...
Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency maskingChong, Nicholas; Nordholm, Sven; Vo, Ba Tuong; Murray, Iain (2017)In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement ...
Dam, H.; Nordholm, Sven (2013)This paper investigates the problem of subband speech separation from a mixture of two speech signals in a room environment. Due to the lack of source information, a sound source localization is proposed for beamformer ...