Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    On the integration of time-frequency masking speech separation and recognition in underdetermined environments

    Access Status
    Fulltext not available
    Authors
    Jafari, I.
    Haque, S.
    Togneri, R.
    Nordholm, Sven
    Date
    2012
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Jafari, I. and Haque, S. and Togneri, R. and Nordholm, S. 2012. On the integration of time-frequency masking speech separation and recognition in underdetermined environments, pp. 1613-1617.
    Source Title
    Conference Record - Asilomar Conference on Signals, Systems and Computers
    DOI
    10.1109/ACSSC.2012.6489303
    ISBN
    9781467350518
    School
    Department of Electrical and Computer Engineering
    URI
    http://hdl.handle.net/20.500.11937/5370
    Collection
    • Curtin Research Publications
    Abstract

    The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT60 = 300ms; this is indicative of the potential for future research in this field. © 2012 IEEE.

    Related items

    Showing items related by title, author, creator and subject.

    • A New Evidence Model for Missing Data Speech Recognition With Applications in Reverberant Multi-Source Environments
      Kuhne, M.; Togneri, R.; Nordholm, Sven (2011)
      Conventional hidden Markov model (HMM) decoders often experience severe performance degradations in practice due to their inability to cope with uncertain data in time-varying environments. In order to address this issue, ...
    • Face recognition based on Kinect
      Li, B.; Mian, A.; Liu, Wan-Quan; Krishna, Aneesh (2015)
      In this paper, we present a new algorithm that utilizes low-quality red, green, blue and depth (RGB-D) data from the Kinect sensor for face recognition under challenging conditions. This algorithm extracts multiple features ...
    • Using Kinect for face recognition under varying poses, expressions, illumination and disguise
      Li, Billy; Mian, A.; Liu, Wan-Quan; Krishna, Aneesh (2013)
      We present an algorithm that uses a low resolution 3D sensor for robust face recognition under challenging conditions. A preprocessing algorithm is proposed which exploits the facial symmetry at the 3D point cloud level ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.