Characterisation of web spambots using self organising maps
dc.contributor.author | Hayati, Pedram | |
dc.contributor.author | Potdar, Vidyasagar | |
dc.contributor.author | Talevski, Alex | |
dc.contributor.author | Chai, Kevin | |
dc.date.accessioned | 2017-01-30T15:14:35Z | |
dc.date.available | 2017-01-30T15:14:35Z | |
dc.date.created | 2012-03-12T20:01:05Z | |
dc.date.issued | 2011 | |
dc.identifier.citation | Hayati, Pedram and Potdar, Vidyasagar and Talevski, Alex and Chai, Kevin. 2011. Characterisation of web spambots using self organising maps. Computer Systems Science and Engineering. 26 (2): pp. 87-96. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/44532 | |
dc.description.abstract |
The growth of spam in Web 2.0 environments not only reduces the quality and trust of the content but it also degrades the quality of search engine results. By means of web spambots, spammers are able to distribute spam content more efficiently to more targeted websites. Current anti-spam filtering solutions have not studied web spambots thoroughly and the characterisation of spambots remains an open area of research. In order to fill this research gap, this paper utilises Kohonen’s Self-Organising Map (SOM) to characterise web spambots. We analyse web usage data to profile web spambots based on three novel set of features i.e. action set, action frequency and action time. Our experimental results uncovered important characteristics of web spambots that 1) they focus on specific and limited actions compared with humans 2) they use multiple user accounts to spread spam content, hide their identity and bypass restrictions, 3) they bypass filling in submission forms and directly submit the content to the Web server in order to efficiently spread spam, 4) they can be categorise into 4 different categories based on their actions – content submitters, profile editors, content viewers and mixed behaviour, 5) they change their IP address based on different action to hide their tracks. Our results are promising and they suggest that our technique is capable of identifying spam in Web 2.0 applications. | |
dc.publisher | C R L publishing Ltd | |
dc.subject | spam 2.0 | |
dc.subject | behaviour analysis | |
dc.subject | web spambots | |
dc.subject | web usage mining | |
dc.subject | spam detection | |
dc.title | Characterisation of web spambots using self organising maps | |
dc.type | Journal Article | |
dcterms.source.volume | 26 | |
dcterms.source.number | 2 | |
dcterms.source.startPage | 87 | |
dcterms.source.endPage | 96 | |
dcterms.source.issn | 02676192 | |
dcterms.source.title | Computer Systems Science and Engineering | |
curtin.department | Digital Ecosystems and Business Intelligence Institute (DEBII) | |
curtin.accessStatus | Fulltext not available |