A Statistical Interestingness Measures for XML based Association Rules
MetadataShow full item record
Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it is common that a large portion of them is not considered as interesting or significant for the application at hand. In this paper, we present a systematic approach to ascertain whether the discovered XML patterns are significant and not just coincidental associations, and provide a precise statistical approach to support this framework. The proposed strategy combines data mining and statistical measurement techniques to discard the non significant patterns. In this paper we considered the “Prions” database that describes the protein instances stored for Human Prions Protein. The proposed unified framework is applied on this dataset to demonstrate its effectiveness in assessing interestingness of discovered XML patterns by statistical means. When the dataset is used for classification/prediction purposes, the proposed approach will discard non significant XML patterns, without the cost of a reduction in the accuracy of the pattern set as a whole.
Showing items related by title, author, creator and subject.
Quality and interestingness of association rules derived from data mining of relational and semi-structured dataMohd Shaharanee, Izwan Nizal (2012)Deriving useful and interesting rules from a data mining system are essential and important tasks. Problems such as the discovery of random and coincidental patterns or patterns with no significant values, and the generation ...
Mohd Shaharanee, I.; Dillon, Tharam S; Hadzic, Fedja (2009)Knowledge acquisition techniques have been well researched in the data mining community. Such techniques, especially when used for unsupervised learning, often generate a large quantity of rules and patterns. While many ...
Modelling the co-occurence of Streptococcus pneumoniae with other bacterial and viral pathogens in the upper respiratory tractJacoby, P.; Watson, K.; Bowman, J.; Taylor, A.; Riley, T.; Smith, D.; Lehmann, Deborah (2007)Go to ScienceDirect® Home Skip Main Navigation Links Brought to you by: The University of Western Australia Library Login: + Register Athens/Institution Login Not Registered? - User Name: Password: ...