Topic transition detection using hierarchical hidden Markov and semi-Markov models

Phung, Dinh; Venkatesh, Svetha; Duong, Thi; Bui, Hung H.

Access Status

Fulltext not available

Authors

Phung, Dinh

Venkatesh, Svetha

Duong, Thi

Bui, Hung H.

Date

2005

Type

Conference Paper

Metadata

Show full item record

Citation

Phung, Dinh and Venkatesh, Svetha and Duong, Thi and Bui, Hung. 2005. Topic transition detection using hierarchical hidden Markov and semi-Markov models, in ACM Press (ed), 13th ACM International Conference on Multimedia (ACM 2005), Nov 6 2005, pp. 11-20. Singapore: Association for Computing Machinery (ACM).

Source Title

Proceedings of the 13th ACM International Conference on Multimedia

Source Conference

13th ACM International Conference on Multimedia (ACM 2005)

Additional URLs

http://doi.acm.org/10.1145/1101149.1101153

ISBN

1595930442

Faculty

School of Electrical Engineering and Computing

Department of Computing

Faculty of Science and Engineering

Remarks

ACM Copyright notice: Copyright © 2005 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept., ACM, Inc., fax +1 (212) 869-0481, or permissions@acm.org

URI

http://hdl.handle.net/20.500.11937/18410

Collection

Curtin Research Publications

Abstract

In this paper we introduce a probabilistic framework to exploit hierarchy, structure sharing and duration information for topic transition detection in videos. Our probabilistic detection framework is a combination of a shot classification step and a detection phase using hierarchical probabilistic models. We consider two models in this paper: the extended Hierarchical Hidden Markov Model (HHMM) and the Coxian Switching Hidden semi-Markov Model (S-HSMM) because they allow the natural decomposition of semantics in videos, including shared structures, to be modeled directly, and thus enabling ecient inference and reducing the sample complexity in learning. Additionally, the S-HSMM allows the duration information to be incorporated, consequently the modeling of long-term dependencies in videos is enriched through both hierarchical and duration modeling. Furthermore, the use of the Coxian distribution in the S-HSMM makes it tractable to deal with long sequences in video. Our experimentation of the proposed framework on twelve educational and training videos shows that both models outperform the baseline cases (at HMM and HSMM) and performances reported in earlier work in topic detection. The superior performance of the S-HSMM over theHHMM veries our belief that duration information is an important factor in video content modeling.