Speech recognition enhancement using beamforming and a genetic algorithm
Access Status
Authors
Date
2009Type
Metadata
Show full item recordCitation
Source Title
Source Conference
Additional URLs
ISBN
Faculty
School
Remarks
Copyright © 2009 IEEE This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Collection
Abstract
This paper proposes a genetic algorithm (GA) based beamformer to optimize speech recognition accuracy for a pretrained speech recognizer. The proposed beamformer is designed to tackle the non-differentiable and non-linear natures of speech recognition by employing the GA algorithm to search for the optimal beamformer weights. Specifically, a population of beamformer weights is reproduced by crossover and mutation until the optimal beamformer weights are obtained. Results show that the speech recognition accuracies can be greatly improved even in noisy environments.
Related items
Showing items related by title, author, creator and subject.
-
Yiu, K.; Chan, Kit Yan; Grbić, N.; Nordholm, Sven (2012)In this paper, a new approach to designing beamformers for voice control device is proposed. It is well-known that under a strong near-field noise with low signal-to-noise ratios (SNR), the performance of speech recognition ...
-
Source separation employing beamforming and SRP-PHAT localization in three-speaker room environmentsNordholm, Sven (2017)This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation ...
-
Chan, Kit Yan; Nordholm, Sven; Yiu, Ka Fai; Togneri, R. (2013)Industrial automation with speech control functions is generally installed with a speech recognition sensor which is used as an interface for users to articulate speech commands. However, recognition errors are likely to ...