Model based methods for locating, enhancing and recognising low resolution objects in video

Kramer, Annika

dc.contributor.author	Kramer, Annika
dc.contributor.supervisor	Prof. Svetha Venkatesh
dc.contributor.supervisor	Assoc. Prof. Tele Tan
dc.date.accessioned	2017-01-30T09:51:16Z
dc.date.available	2017-01-30T09:51:16Z
dc.date.created	2010-08-26T02:14:23Z
dc.date.issued	2009
dc.identifier.uri	http://hdl.handle.net/20.500.11937/585
dc.description.abstract	Visual perception is our most important sense which enables us to detect and recognise objects even in low detail video scenes. While humans are able to perform such object detection and recognition tasks reliably, most computer vision algorithms struggle with wide angle surveillance videos that make automatic processing difficult due to low resolution and poor detail objects. Additional problems arise from varying pose and lighting conditions as well as non-cooperative subjects. All these constraints pose problems for automatic scene interpretation of surveillance video, including object detection, tracking and object recognition.Therefore, the aim of this thesis is to detect, enhance and recognise objects by incorporating a priori information and by using model based approaches. Motivated by the increasing demand for automatic methods for object detection, enhancement and recognition in video surveillance, different aspects of the video processing task are investigated with a focus on human faces. In particular, the challenge of fully automatic face pose and shape estimation by fitting a deformable 3D generic face model under varying pose and lighting conditions is tackled. Principal Component Analysis (PCA) is utilised to build an appearance model that is then used within a particle filter based approach to fit the 3D face mask to the image. This recovers face pose and person-specific shape information simultaneously. Experiments demonstrate the use in different resolution and under varying pose and lighting conditions. Following that, a combined tracking and super resolution approach enhances the quality of poor detail video objects. A 3D object mask is subdivided such that every mask triangle is smaller than a pixel when projected into the image and then used for model based tracking. The mask subdivision then allows for super resolution of the object by combining several video frames. This approach achieves better results than traditional super resolution methods without the use of interpolation or deblurring.Lastly, object recognition is performed in two different ways. The first recognition method is applied to characters and used for license plate recognition. A novel character model is proposed to create different appearances which are then matched with the image of unknown characters for recognition. This allows for simultaneous character segmentation and recognition and high recognition rates are achieved for low resolution characters down to only five pixels in size. While this approach is only feasible for objects with a limited number of different appearances, like characters, the second recognition method is applicable to any object, including human faces. Therefore, a generic 3D face model is automatically fitted to an image of a human face and recognition is performed on a mask level rather than image level. This approach does not require an initial pose estimation nor the selection of feature points, the face alignment is provided implicitly by the mask fitting process.
dc.language	en
dc.publisher	Curtin University
dc.subject	object recognition
dc.subject	video surveillance
dc.subject	visual perception
dc.subject	object detection
dc.subject	video processing
dc.subject	person-specific shape information
dc.subject	face pose
dc.subject	3D generic face model
dc.title	Model based methods for locating, enhancing and recognising low resolution objects in video
dc.type	Thesis
dcterms.educationLevel	PhD
curtin.department	Department of Computing
curtin.accessStatus	Open access

Files in this item

Name:: 145037_Kramer full.pdf
Size:: 35.08Mb
Format:: PDF

This item appears in the following Collection(s)

Curtin Theses

Show simple item record

Model based methods for locating, enhancing and recognising low resolution objects in video

Files in this item

This item appears in the following Collection(s)

Related items