Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization

    Access Status
    Fulltext not available
    Authors
    Chan, Kit Yan
    Yiu, Cedric K.F.
    Dillon, Tharam S.
    Nordholm, Sven
    Ling, S.H.
    Date
    2012
    Type
    Journal Article
    
    Metadata
    Show full item record
    Citation
    Chan, Kit Yan and Yui, Cedric K.F. and Dillon, Tharam S. and Nordholm, S. and Ling, Sai Ho. 2012. Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization. IEEE Transactions on Industrial Informatics. PP (99): pp. 1-11.
    Source Title
    IEEE Transactions on Industrial Informatics
    DOI
    10.1109/TII.2012.2187910
    ISSN
    15513203
    School
    Department of Electrical and Computer Engineering
    URI
    http://hdl.handle.net/20.500.11937/16983
    Collection
    • Curtin Research Publications
    Abstract

    For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users and the automation system. However, users’ commands are often contaminated by environmental noise which degrades the performance of speech recognition for controlling automation systems. This paper presents a multichannel signal enhancement methodology to improve the performance of commercial speech recognizers. The proposed methodology aims to optimize speech recognition accuracy of a commercial speech recognizer in a noisy environment based on a beam former, which is developed by an intelligent particle swarm optimization. It overcomes the limitation of the existing signal enhancement approaches whereby the parameters inside commercial speech recognizers are required to be tuned, which is impossible in a real-world situation. Also, it overcomes the limitation of the existing optimization algorithm including gradient descent methods, genetic algorithms and classical particle swarm optimization that are unlikely to develop optimal beam formers for maximizing speech recognition accuracy. The performance of the proposed methodology was evaluated by developing beam formers for a commercial speech recognizer, which was implemented on warehouse automation. Results indicate a significant improvement regarding speech recognition accuracy.

    Related items

    Showing items related by title, author, creator and subject.

    • Multichannel filters for speech recognition using a particle swarm optimization
      Chan, Kit Yan; Yiu, Ka Fai; Nordholm, Sven (2012)
      Speech recognition has been used in various real-world applications such as automotive control, electronic toys, electronic appliances etc. In many applications involved speech control functions, a commercial speech ...
    • Speech Enhancement Strategy for Speech Recognition Microcontroller under Noisy Environments
      Chan, Kit Yan; Nordholm, Sven; Yiu, Ka Fai; Togneri, R. (2013)
      Industrial automation with speech control functions is generally installed with a speech recognition sensor which is used as an interface for users to articulate speech commands. However, recognition errors are likely to ...
    • A hybrid noise suppression filter for accuracy enhancement of commercial speech recognizers in varying noisy conditions
      Chan, Kit Yan; Yong, P.; Nordholm, Sven; Yiu, C.; Lam, H. (2014)
      Commercial speech recognizers have made possible many speech control applications such as wheelchair, tone-phone, multifunctional robotic arms and remote controls, for the disabled and paraplegic. However, they have a ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.