Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    MB3-Miner: Efficient mining eMBedded subTREEs using tree model guided candidate generation

    20277_downloaded_stream_265.pdf (435.5Kb)
    Access Status
    Open access
    Authors
    Chang, Elizabeth
    Tan, H.
    Dillon, Tharam S.
    Hadzic, Fedja
    Feng, L.
    Date
    2005
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Chang, Elizabeth and Tan, Henry and Dillon, Tharam S. and Hadzic, Fedja and Feng, Ling. 2005. : MB3-Miner: Efficient mining eMBedded subTREEs using tree model guided candidate generation, in Ras, Z.W. and Tsumoto, S. and Zighed, D.A. (ed), First International Workshop on Mining Complex Data (MCD) in conjunction with ICDM'05, Nov 27 2005, pp. 103-110. Houston, Texas, USA: IEEE.
    Source Title
    Proceedings of the First International Workshop on Mining Complex Data (MCD)
    Source Conference
    First International Workshop on Mining Complex Data (MCD) in conjunction with ICDM'05
    Faculty
    Curtin Business School
    School of Information Systems
    School
    Centre for Extended Enterprises and Business Intelligence
    Remarks

    Copyright 2005 IEEE

    This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

    URI
    http://hdl.handle.net/20.500.11937/33568
    Collection
    • Curtin Research Publications
    Abstract

    Tree mining has many useful applications in areas such as Bioinformatics, XML mining, Web mining, etc. In general, most of the formally represented information in these domains is a tree structured form. In this paper we focus on mining frequent embedded subtrees from databases of rooted labelled ordered subtrees. We propose a novel and unique embedding list representation that is suitable for describing embedded subtrees. This representation is completely different from the string-like or conventional adjacency list representation previously utilized for trees. We present the mathematical model of a breadth-first-search Tree Model Guided (TMG) candidate generation approach previously introduced in [8]. The key characteristic of the TMG approach is that it enumerates fewer candidates by ensuring that only valid candidates that conform to the structural aspects of the data are generated as opposed to the join approach. Our experiments with both synthetic and real-life datasets provide comparisons against one of the state-of-the-art algorithms, TreeMiner [15], and they demonstrate the effectiveness and the efficiency of the technique.

    Related items

    Showing items related by title, author, creator and subject.

    • Mining Induced/Embedded Subtrees using the Level of Embedding Constraint
      Tan, H.; Hadzic, Fedja; Dillon, T. (2012)
      The increasing need for representing information through more complex structures where semantics and relationships among data objects can be more easily expressed has resulted in many semi-structured data sources. Structure ...
    • Razor: Mining distance-constrained embedded subtrees
      Tan, H.; Dillon, Tharam S.; Hadzic, Fedja; Chang, Elizabeth (2006)
      Our work is focused on the task of mining frequent subtrees from a database of rooted ordered labelled subtrees. Previously we have developed an efficient algorithm, MB3 [12], for mining frequent embedded subtrees from a ...
    • Tree model guided candidate generation for mining frequent subtrees from XML
      Tan, Henry; Hadzic, Fedja; Dillon, Tharam S.; Chang, Elizabeth; Feng, Ling; Feng, L. (2008)
      Due to the inherent flexibilities in both structure and semantics, XML association rules mining faces few challenges, such as: a more complicated hierarchical data structure and ordered data context. Mining frequent ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.