MB3-Miner: Efficient mining eMBedded subTREEs using tree model guided candidate generation
dc.contributor.author | Chang, Elizabeth | |
dc.contributor.author | Tan, H. | |
dc.contributor.author | Dillon, Tharam S. | |
dc.contributor.author | Hadzic, Fedja | |
dc.contributor.author | Feng, L. | |
dc.date.accessioned | 2017-01-30T13:37:55Z | |
dc.date.available | 2017-01-30T13:37:55Z | |
dc.date.created | 2008-11-12T23:32:36Z | |
dc.date.issued | 2005 | |
dc.identifier.citation | Chang, Elizabeth and Tan, Henry and Dillon, Tharam S. and Hadzic, Fedja and Feng, Ling. 2005. : MB3-Miner: Efficient mining eMBedded subTREEs using tree model guided candidate generation, in Ras, Z.W. and Tsumoto, S. and Zighed, D.A. (ed), First International Workshop on Mining Complex Data (MCD) in conjunction with ICDM'05, Nov 27 2005, pp. 103-110. Houston, Texas, USA: IEEE. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/33568 | |
dc.description.abstract |
Tree mining has many useful applications in areas such as Bioinformatics, XML mining, Web mining, etc. In general, most of the formally represented information in these domains is a tree structured form. In this paper we focus on mining frequent embedded subtrees from databases of rooted labelled ordered subtrees. We propose a novel and unique embedding list representation that is suitable for describing embedded subtrees. This representation is completely different from the string-like or conventional adjacency list representation previously utilized for trees. We present the mathematical model of a breadth-first-search Tree Model Guided (TMG) candidate generation approach previously introduced in [8]. The key characteristic of the TMG approach is that it enumerates fewer candidates by ensuring that only valid candidates that conform to the structural aspects of the data are generated as opposed to the join approach. Our experiments with both synthetic and real-life datasets provide comparisons against one of the state-of-the-art algorithms, TreeMiner [15], and they demonstrate the effectiveness and the efficiency of the technique. | |
dc.publisher | IEEE | |
dc.subject | embedded subtree | |
dc.subject | tree model guided | |
dc.subject | information systems | |
dc.subject | TMG | |
dc.subject | frequent tree mining | |
dc.subject | treeminer | |
dc.subject | tree mining | |
dc.title | MB3-Miner: Efficient mining eMBedded subTREEs using tree model guided candidate generation | |
dc.type | Conference Paper | |
dcterms.source.startPage | 103 | |
dcterms.source.endPage | 110 | |
dcterms.source.title | Proceedings of the First International Workshop on Mining Complex Data (MCD) | |
dcterms.source.series | Proceedings of the First International Workshop on Mining Complex Data (MCD) | |
dcterms.source.conference | First International Workshop on Mining Complex Data (MCD) in conjunction with ICDM'05 | |
dcterms.source.conference-start-date | Nov 27 2005 | |
dcterms.source.conferencelocation | Houston, Texas, USA | |
dcterms.source.place | USA | |
curtin.note |
Copyright 2005 IEEE | |
curtin.note |
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. | |
curtin.department | Centre for Extended Enterprises and Business Intelligence | |
curtin.identifier | EPR-656 | |
curtin.accessStatus | Open access | |
curtin.faculty | Curtin Business School | |
curtin.faculty | School of Information Systems |