Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
MetadataShow full item record
This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation algorithm utilizes the steered response power phase transform for obtaining a localization estimate for each individual speech source in the frequency domain. Based on those estimates each desired speech signal is extracted from the speech mixture using an optimal beamforming technique. To solve the permutation problem, a permutation alignment algorithm based on the mutual output correlation is employed to group the output signals into the correct sources from each frequency bin. Evaluations using real speech recordings in a room environment show that the proposed blind speech separation algorithm offers high interference suppression level whilst maintaining low distortion level for each desired signal.
Showing items related by title, author, creator and subject.
Time-frequency clustering with weighted and contextual information for convolutive blind source separationJafari, I.; Atcheson, M.; Togneri, R.; Nordholm, Sven (2014)In this paper we investigate the use of observation weights and contextual time-frequency information for clustering-based blind source separation. Previous clustering-based approaches have successfully used clustering ...
A novel fuzzy clustering algorithm using observation weighting and context information for reverberant blind speech separationKuhne, M.; Togneri, R.; Nordholm, Sven (2009)Time-frequency masking has evolved as a powerful tool for tackling blind source separation problems. In previous work, mask estimation was performed with the help of well-known standard cluster algorithms. Spatial observation ...
Nordholm, Sven; Low, Siow Yong (2006)Speech signal extraction is becoming more and more important as evidently displayed by its numerous applications such as mobile phones, conference equipment and surveillance. This paper presents a blind method to enhance ...