A Statistical Interestingness Measures for XML based Association Rules

Mohd Shaharanee, Izwan; Hadzic, Fedja; Dillon, Tharam S.

dc.contributor.author	Mohd Shaharanee, Izwan
dc.contributor.author	Hadzic, Fedja
dc.contributor.author	Dillon, Tharam S.
dc.contributor.editor	Byoung Tak Zhang
dc.contributor.editor	Mehmet A Orgun
dc.date.accessioned	2017-01-30T12:06:26Z
dc.date.available	2017-01-30T12:06:26Z
dc.date.created	2011-03-21T20:01:27Z
dc.date.issued	2010
dc.identifier.citation	Mohd Shaharanee, Izwan Nizal and Hadzic, Fedja and Dillon, Tharam S. 2010. A Statistical Interestingness Measures for XML based Association Rules, in Zhang, B.T. and Orgun, M.A. (ed), Lecture Notes in Computer Science, Volume 6230: Trends in Artificial Intelligence (PRICAI 2010). pp. 194-205. Germany: Springer.
dc.identifier.uri	http://hdl.handle.net/20.500.11937/18190
dc.description.abstract	Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it is common that a large portion of them is not considered as interesting or significant for the application at hand. In this paper, we present a systematic approach to ascertain whether the discovered XML patterns are significant and not just coincidental associations, and provide a precise statistical approach to support this framework. The proposed strategy combines data mining and statistical measurement techniques to discard the non significant patterns. In this paper we considered the “Prions” database that describes the protein instances stored for Human Prions Protein. The proposed unified framework is applied on this dataset to demonstrate its effectiveness in assessing interestingness of discovered XML patterns by statistical means. When the dataset is used for classification/prediction purposes, the proposed approach will discard non significant XML patterns, without the cost of a reduction in the accuracy of the pattern set as a whole.
dc.publisher	Springer
dc.subject	data mining
dc.subject	semi-structured data
dc.subject	statistical analysis
dc.subject	interesting rules
dc.title	A Statistical Interestingness Measures for XML based Association Rules
dc.type	Book Chapter
dcterms.source.startPage	194
dcterms.source.endPage	205
dcterms.source.title	Lecture notes in computer science, volume 6230: trends in artificial intelligence (PRICAI 2010)
dcterms.source.isbn	9783642152450
dcterms.source.place	Heidelberg
dcterms.source.chapter	73
curtin.department	Digital Ecosystems and Business Intelligence Institute (DEBII)
curtin.accessStatus	Fulltext not available

Files in this item

Name:: 154697_20951_PUB-CBS-EEB-MC-56 ...
Size:: 169.9Kb
Format:: PDF

This item appears in the following Collection(s)

Curtin Research Publications

Show simple item record

A Statistical Interestingness Measures for XML based Association Rules

Files in this item

This item appears in the following Collection(s)

Related items