Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Theses
    • View Item
    • espace Home
    • espace
    • Curtin Theses
    • View Item

    A machine learning-based approach for automated quality assessment of user generated content in web forums

    169169_ChaiKevin2011_PhD_Thesis.pdf (2.957Mb)
    Access Status
    Open access
    Authors
    Chai, Kevin Eng Kwong
    Date
    2011
    Supervisor
    Dr. Vidyasagar Potdar
    Type
    Thesis
    Award
    PhD
    
    Metadata
    Show full item record
    School
    Digital Ecosystems and Business Intelligence Institute
    URI
    http://hdl.handle.net/20.500.11937/107
    Collection
    • Curtin Theses
    Abstract

    Web 2.0 platforms such as forums, blogs and wikis allow users from its community to contribute content. However, users often received little if any professional training in content creation and content is commonly published without peer review. Excessive low quality user contributions can lead to information overload, which describes the situation when a user feels overwhelmed with unwanted information. Information overload can cause users to withdraw from using a website therefore decreasing a website's overall sustainability through the loss of users from its community.Many Web 2.0 websites have relied on its users to manually rate the quality of User Generated Content (UGC) to deal with this problem. However, the major problems with this approach is that rating is voluntary so a large percentage of content often receives a lack of rating and UGC is often created at a faster rate than which it can be sufficiently rated. Therefore, automated content quality assessment models are required to address the problems caused by manual user rating.A number of automated models have been proposed in recent years for Web 2.0 platforms. However, we identified many limitations with these existing models in our literature review. For example, the majority of models are only suitable for a specific language such as English and have not effectively considered how content is used by the user community in the assessment process. Therefore, we propose a novel and language independent model that evaluates content, usage, reputation, temporal and structural dimensions of UGC for quality assessment to address these limitations..We developed our model using Web technologies and a supervised machine learning approach. More specifically, we employed a rule learner, a fuzzy logic classifier and Support Vector Machines. We validated our model on three operational Web forums and outperformed existing models in the literature in our experiments. We used the Friedman Test and Nemenyi test to verify our results and discovered that the performance improvements generated by our model are statistically significant over the existing models.

    Related items

    Showing items related by title, author, creator and subject.

    • Assessing Post Usage for Measuring the Quality of Forum Posts
      Chai, Kevin; Hayati, Pedram; Potdar, Vidyasagar; Wu, Chen; Talevski, Alex (2010)
      It has become difficult to discover quality content within forums websites due to the increasing amount of UserGenerated Content (UGC) on the Web. Many existing websites have relied on their users to explicitly rate content ...
    • Automatically measuring the quality of user generated content in forums
      Chai, Kevin; Wu, Chen; Potdar, Vidyasagar; Hayati, Pedram (2011)
      The amount of user generated content on the Web is growing and identifying high quality content in a timely manner has become a problem. Many forums rely on its users to manually rate content quality but this often results ...
    • Chemical machining of advanced ceramics
      Ting, Huey Tze (2013)
      Not until recently did we see an enormous surge of interest in the study of machining of advanced ceramics. This has resulted in significant advances lately in their development and usage. Machinable glass ceramics, ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.