Topic transition detection using hierarchical hidden Markov and semi-Markov models
Access Status
Authors
Date
2005Type
Metadata
Show full item recordCitation
Source Title
Source Conference
Additional URLs
ISBN
Faculty
Remarks
ACM Copyright notice: Copyright © 2005 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept., ACM, Inc., fax +1 (212) 869-0481, or permissions@acm.org
Collection
Abstract
In this paper we introduce a probabilistic framework to exploit hierarchy, structure sharing and duration information for topic transition detection in videos. Our probabilistic detection framework is a combination of a shot classification step and a detection phase using hierarchical probabilistic models. We consider two models in this paper: the extended Hierarchical Hidden Markov Model (HHMM) and the Coxian Switching Hidden semi-Markov Model (S-HSMM) because they allow the natural decomposition of semantics in videos, including shared structures, to be modeled directly, and thus enabling ecient inference and reducing the sample complexity in learning. Additionally, the S-HSMM allows the duration information to be incorporated, consequently the modeling of long-term dependencies in videos is enriched through both hierarchical and duration modeling. Furthermore, the use of the Coxian distribution in the S-HSMM makes it tractable to deal with long sequences in video. Our experimentation of the proposed framework on twelve educational and training videos shows that both models outperform the baseline cases (at HMM and HSMM) and performances reported in earlier work in topic detection. The superior performance of the S-HSMM over theHHMM veries our belief that duration information is an important factor in video content modeling.
Related items
Showing items related by title, author, creator and subject.
-
Duong, Thi V. T. (2008)Modeling patterns in temporal data has arisen as an important problem in engineering and science. This has led to the popularity of several dynamic models, in particular the renowned hidden Markov model (HMM) [Rabiner, ...
-
Duong, Thi; Bui, Hung H.; Phung, Dinh; Venkatesh, Svetha (2005)This paper addresses the problem of learning and recognizing human activities of daily living (ADL), which isan important research issue in building a pervasive and smart environment. In dealing with ADL, we argue that ...
-
Duong, Thi; Phung, Dinh; Bui, Hung H.; Venkatesh, Svetha (2009)A challenge in building pervasive and smart spaces is to learn and recognize human activities of daily living (ADLs). In this paper, we address this problem and argue that in dealing with ADLs, it is beneficial to exploit ...