Model guided algorithm for mining unordered embedded subtrees
dc.contributor.author | Hadzic, Fedja | |
dc.contributor.author | Tan, H. | |
dc.contributor.author | Dillon, Tharam S. | |
dc.date.accessioned | 2017-01-30T14:07:41Z | |
dc.date.available | 2017-01-30T14:07:41Z | |
dc.date.created | 2011-03-20T20:01:52Z | |
dc.date.issued | 2010 | |
dc.identifier.citation | Hadzic, Fedja and Tan, Henry and Dillon, Tharam S. 2010. Model guided algorithm for mining unordered embedded subtrees. Web Intelligence and Agent Systems. 8 (4): pp. 413-430. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/37772 | |
dc.identifier.doi | 10.3233/WIA-2010-0200 | |
dc.description.abstract |
Large amount of online information is or can be represented using semi-structured documents, such as XML. The information contained in an XML document can be effectively represented using a rooted ordered labeled tree. This has made the frequent pattern mining problem recast as the frequent subtree mining problem, which is a pre-requisite for association rule mining form tree-structured documents. Driven by different application needs a number of algorithms have been developed for mining of different subtree types under different support definitions. In this paper we present an algorithm for mining unordered embedded subtrees. It is an extension of our general tree model guided (TMG) candidate generation framework and the proposed U3 algorithm considers all support definitions, namely, transaction-based, occurrence-match and hybrid support. A number of experiments are presented on synthetic and real world data sets. The results demonstrate the flexibility of our general TMG framework as well as its efficiency when compared to the existing state-of-the-art approach. | |
dc.publisher | IOS Press | |
dc.subject | data mining | |
dc.subject | Tree mining | |
dc.subject | unordered embedded subtrees | |
dc.subject | canonical form | |
dc.subject | algorithm | |
dc.title | Model guided algorithm for mining unordered embedded subtrees | |
dc.type | Journal Article | |
dcterms.source.volume | 8 | |
dcterms.source.number | 4 | |
dcterms.source.startPage | 413 | |
dcterms.source.endPage | 430 | |
dcterms.source.issn | 15701263 | |
dcterms.source.title | Web Intelligence and Agent Systems | |
curtin.department | Digital Ecosystems and Business Intelligence Institute (DEBII) | |
curtin.accessStatus | Fulltext not available |