Show simple item record

dc.contributor.authorHadzic, Fedja
dc.contributor.authorHecker, Michael
dc.contributor.editorMohand-Saïd Hacid
dc.identifier.citationHadzic, F. and Hecker, M. 2011. Alternative approach to tree-structured web log representation and mining, in Mohand-Saïd Hacid (ed), IEEE/WIC/ACM International Conference on Web Intelligence (WI 2011), Aug 22 2011. Lyon, France: IEEE Computer Society

More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent subtree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data.

dc.publisherIEEE Computer Society
dc.subjecttree-structured web logs
dc.subjectweb usage mining
dc.titleAlternative approach to tree-structured web log representation and mining
dc.typeConference Paper
dcterms.source.titleProceedings of the IEEE/WIC/ACM international conference on web intelligence (WI 2011)
dcterms.source.seriesProceedings of the IEEE/WIC/ACM international conference on web intelligence (WI 2011)
dcterms.source.conferenceIEEE/WIC/ACM International Conference on Web Intelligence (WI 2011)
dcterms.source.conference-start-dateAug 22 2011
dcterms.source.conferencelocationLyon, France

© 2011 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

curtin.departmentDigital Ecosystems and Business Intelligence Institute (DEBII)
curtin.accessStatusOpen access

Files in this item


This item appears in the following Collection(s)

Show simple item record