Ascertaining data mining rules using statistical approaches
dc.contributor.author | Mohd Shaharanee, I. | |
dc.contributor.author | Dillon, Tharam S | |
dc.contributor.author | Hadzic, Fedja | |
dc.contributor.editor | Parvinder S. Sandhu | |
dc.date.accessioned | 2017-01-30T15:09:17Z | |
dc.date.available | 2017-01-30T15:09:17Z | |
dc.date.created | 2010-03-01T20:02:05Z | |
dc.date.issued | 2009 | |
dc.identifier.citation | Mohd Shaharanee, Izwan Nizal and Dillon, Tharam S. and Hadzic, Fedja. 2009. Ascertaining data mining rules using statistical approaches, in Parvinder S. Sandhu (ed), International Symposium on Computing, Communication and Control (ISCCC 2009), Oct 9 2009, pp. 180-188. Singapore: International Association of Computer Science and Information Technology (IACSIT). | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/43700 | |
dc.description.abstract |
Knowledge acquisition techniques have been well researched in the data mining community. Such techniques, especially when used for unsupervised learning, often generate a large quantity of rules and patterns. While many rules generated are useful and interesting, some information is not captured by those rules, such as already known patterns, coincidental patterns and patterns with no significant value for the real world applications. Sustaining the interestingness of rules generated by data mining algorithm is an active and important area of data mining research. Different methods have been proposed and have been well examined for discovering interestingness in rules. These measures often only reflect the interestingness with respect to the database being observed, and as such the rules will satisfy the constrains with respect to the sample data only, but not with respect to the whole data distribution. Therefore, one can still argue the usefulness of the rules and patterns with respect to their use in practical problems. As the data mining techniques are naturally data driven, it would benefit to affirm the generated hypothesis with a statistical methodology. In our research, we investigate how to combine data mining and statistical measurement techniques to arrive at more reliable and interesting set of rules. Such a combination is greatly essential to conquer the data overload in practical problems. A real world data set is used to explore the ways in which one can measure and verify the usefulness of rules from data mining techniques using statistical analysis. | |
dc.publisher | International Association of Computer Science and Information Technology (IACSIT) | |
dc.subject | Data mining | |
dc.subject | significant rules | |
dc.subject | statistical analysis | |
dc.title | Ascertaining data mining rules using statistical approaches | |
dc.type | Conference Paper | |
dcterms.source.startPage | 180 | |
dcterms.source.endPage | 188 | |
dcterms.source.title | Proceedings of the international symposium on computing, communication and control (ISCCC 2009) | |
dcterms.source.series | Proceedings of the international symposium on computing, communication and control (ISCCC 2009) | |
dcterms.source.isbn | 9789810838157 | |
dcterms.source.conference | International Symposium on Computing, Communication and Control (ISCCC 2009) | |
dcterms.source.conference-start-date | Oct 9 2009 | |
dcterms.source.conferencelocation | Singapore | |
dcterms.source.place | Singapore | |
curtin.department | Digital Ecosystems and Business Intelligence Institute (DEBII) | |
curtin.accessStatus | Fulltext not available | |
curtin.faculty | Curtin Business School | |
curtin.faculty | The Digital Ecosystems and Business Intelligence Institute (DEBII) |