Show simple item record

dc.contributor.authorWong, Y.W.
dc.contributor.authorCh’ng, S.I.
dc.contributor.authorSeng, K.P.
dc.contributor.authorAng, L.
dc.contributor.authorChin, S.W.
dc.contributor.authorChew, W.J.
dc.contributor.authorLim, Hann
dc.date.accessioned2017-01-30T11:38:43Z
dc.date.available2017-01-30T11:38:43Z
dc.date.created2014-11-19T01:13:25Z
dc.date.issued2011
dc.identifier.citationWong, Y.W. and Ch’ng, S.I. and Seng, K.P. and Ang, L. and Chin, S.W. and Chew, W.J. and Lim, H. 2011. A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities. Pattern Recognition Letters. 32 (13): pp. 1503-1510.
dc.identifier.urihttp://hdl.handle.net/20.500.11937/13681
dc.identifier.doi10.1016/j.patrec.2011.06.011
dc.description.abstract

Audio-visual recognition system is becoming popular because it overcomes certain problems of traditional audio-only recognition system. However, difficulties due to visual variations in video sequencecan significantly degrade the recognition performance of the system. This problem can be further complicated when more than one visual variation happen at the same time. Although several databases have been created in this area, none of them includes realistic visual variations in video sequence. With the aim to facilitate the development of robust audio-visual recognition systems, the new audio-visualUNMC-VIER database is created. This database contains various visual variations including illumination,facial expression, head pose, and image resolution variations. The most unique aspect of this database is that it includes more than one visual variation in the same video recording. For the audio part, the utterances are spoken in slow and normal speech pace to improve the learning process of audio-visual speech recognition system. Hence, this database is useful for the development of robust audio-visual person,speech recognition and face recognition systems.

dc.publisherElsevier BV, North-Holland
dc.subjectAudio-visual database
dc.subjectSpeech recognition
dc.subjectFace recognition
dc.subjectVisual variation
dc.titleA new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
dc.typeJournal Article
dcterms.source.volume32
dcterms.source.number13
dcterms.source.startPage1503
dcterms.source.endPage1510
dcterms.source.issn0010-4469
dcterms.source.titlePattern Recognition Letters
curtin.accessStatusFulltext not available


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record