Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Link-based web spam detection using weight properties

    Access Status
    Fulltext not available
    Authors
    Goh, K.
    Patchmuthu, Ravi Kumar
    Singh, Ashutosh Kumar
    Date
    2014
    Type
    Journal Article
    
    Metadata
    Show full item record
    Citation
    Goh, K. and Patchmuthu, R.K. and Singh, A.K. 2014. Link-based web spam detection using weight properties. Journal of Intelligent Information Systems. 43 (1): pp. 129-145.
    Source Title
    Journal of Intelligent Information Systems
    DOI
    10.1007/s10844-014-0310-y
    ISSN
    0925-9902
    School
    Curtin Sarawak
    URI
    http://hdl.handle.net/20.500.11937/28483
    Collection
    • Curtin Research Publications
    Abstract

    Link spam is created with the intention of boosting one target’s rank in exchange of business profit. This unethical way of deceiving Web search engines is known as Web spam. Since then many anti-link spam detection techniques have constantly being proposed. Web spam detection is a crucial task due to its devastation towards Web search engines and global cost of billion dollars annually. In this paper, we proposed a novel technique by incorporating weight properties to enhance the Web spam detection algorithms. Weight properties can be defined as the influences of one Web node towards another Web node. We modified existing Web spam detection algorithms with our novel technique to evaluate the performances on a large public Web spam dataset – WEBSPAM-UK2007. The overall performance have shown that the modified algorithms outperform the benchmark algorithms up to 30.5 % improvement at host level and 6.11 % improvement at page level.

    Related items

    Showing items related by title, author, creator and subject.

    • Methods for demoting and detecting Web spam
      Goh, Kwang Leng (2013)
      Web spamming has tremendously subverted the ranking mechanism of information retrieval in Web search engines. It manipulates data source maliciously either by contents or links with the intention of contributing negative ...
    • Addressing the new generation of spam (Spam 2.0) through Web usage models
      Hayati, Pedram (2011)
      New Internet collaborative media introduce new ways of communicating that are not immune to abuse. A fake eye-catching profile in social networking websites, a promotional review, a response to a thread in online forums ...
    • The changing nature of spam 2.0
      Potdar, Vidyasagar; Firoozeh, N.; Ridzuan, Farida; Like, Y.; Mukhopadhyay, D.; Tejani, D. (2012)
      Spam 2.0 (or Web 2.0 Spam) is referred to as spam content that is hosted on Web 2.0 applications (blogs, forums, social networks etc.). Such spam differs from traditional spam as this is targeted at Web 2.0 applications ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.