Multi-modal Visual Features Based Video Shot Boundary Detection
MetadataShow full item record
OAPA One of the essential pre-processing steps of semantic video analysis is the video shot boundary detection (SBD). It is the primary step to segment the sequence of video frames into shots. Many SBD systems using supervised learning have been proposed for years; however, the training process still remains its principal limitation. In this paper, a multi-modal visual features based SBD framework is employed that aims to analyse the behaviours of visual representation in terms of the discontinuity signal. We adopt a candidate segment selection that performs without the threshold calculation but uses the cumulative moving average of the discontinuity signal to identify the position of shot boundaries and neglect the non-boundary video frames. The transition detection is structurally performed to distinguish candidate segment into a cut transition and a gradual transition including fade in/out and logo occurrence. Experimental results are evaluated using the golf video clips and the TREC2001 documentary video dataset. Results show that the proposed SBD framework can achieve good accuracy in both types of video dataset compared with other proposed SBD methods.
Showing items related by title, author, creator and subject.
Tippaya, S.; Sitjongsataporn, S.; Tan, Tele; Chamnongthai, K.; Khan, M. (2015)© 2015 IEEE.Video shot boundary detection or shot segmentation is an integral part of semantic video analysis. The objective of this process is to automatically detect the boundary region in video that further segment the ...
Tippaya, Sawitchaya; Sitjongsataporn, S.; Tan, Tele; Chamnongthai, K. (2014)Video shot boundary detection is the process of automatically detecting the meaningful boundary in video data. It becomes an essential pre-processing step to video analysis, summarisation and other content-based retrieval. ...
Tippaya, S.; Tan, Tele; Khan, M.; Chamnongthai, K. (2015)© 2015 IEEE.Video shot boundary detection is the process of automatically detecting the meaningful boundary content in video. Most shot boundary categorisation techniques use features extracted from the video frames to ...