Storage cost of spam 2.0 in a web discussion forum
MetadataShow full item record
This paper presents an empirical research that identifies cost of Spam 2.0. This experiment is a part of ongoing research for identifying the cost of Spam 2.0 and focuses only on storage cost. The data is collected via a honeypot setup using a discussion forum for a period of 13 months. Forum provides a good place for the spammers to continue their spamming activities. Spamming give both direct and indirect cost towards forum owner and forum users. In this paper, we present a method to measure direct cost focusing only on storage cost. The main observation of the experiment is done towards 450,772 posts, 141 personal messages and 62,798 profiles. It uses 2.69 GB storage space. We first define our cost formula. We then set up a web based discussion forum and collect the information posted on the forum. This data is pre-processed to discover information that can be used in our formula. In order to identify the storage used for spam, we define related attributes based on maximum storage and impact factor features named as spam unit, and measure the storage taken by all these spam units. We evaluate the cost of storage based on three sources which are our real self-hosted server, commercial web hosting package and cloud hosting package. The experiment resulted that the storage cost for our research forum are AUD 23.66 based on self-hosted server, AUD133.90 for commercial web hosting, and AUD11.53 for cloud hosting. The highest storage cost for 10,000 spam posts, profiles and personal messages is AUD2.963, AUD0.068 and AUD0.056.
Showing items related by title, author, creator and subject.
Potdar, Vidyasagar; Firoozeh, N.; Ridzuan, Farida; Like, Y.; Mukhopadhyay, D.; Tejani, D. (2012)Spam 2.0 (or Web 2.0 Spam) is referred to as spam content that is hosted on Web 2.0 applications (blogs, forums, social networks etc.). Such spam differs from traditional spam as this is targeted at Web 2.0 applications ...
Goh, Kwang Leng (2013)Web spamming has tremendously subverted the ranking mechanism of information retrieval in Web search engines. It manipulates data source maliciously either by contents or links with the intention of contributing negative ...
Hayati, Pedram (2011)New Internet collaborative media introduce new ways of communicating that are not immune to abuse. A fake eye-catching profile in social networking websites, a promotional review, a response to a thread in online forums ...