Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    A survey in semantic web technologies-inspired focused crawlers

    116133_A%20survey%20in%20semantic%20Web04746736.pdf (125.9Kb)
    Access Status
    Open access
    Authors
    Dong, Hai
    Hussain, Farookh Khadeer
    Chang, Elizabeth
    Date
    2008
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Dong, Hai and Hussain, Farookh Khadeer and Chang, Elizabeth. 2008. A survey in semantic web technologies-inspired focused crawlers, in Shoniregun, C.A. (ed), Third IEEE International Conference on Digital Information Management, Nov 13 2008, pp. 934-936. London, UK: Institute of Electrical and Electronics Engineers (IEEE).
    Source Title
    Proceedings of the 3rd IEEE international conference on digital information management (ICDIM 2008)
    Source Conference
    3rd IEEE International Conference on Digital Information Management (ICDIM 2008)
    DOI
    10.1109/ICDIM.2008.4746736
    ISBN
    9781424429165
    Faculty
    Curtin Business School
    School of Information Systems
    School
    Centre for Extended Enterprises and Business Intelligence
    Remarks

    Copyright © 2008 IEEE This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

    URI
    http://hdl.handle.net/20.500.11937/7518
    Collection
    • Curtin Research Publications
    Abstract

    Crawlers are software which can traverse the internet and retrieve webpages by hyperlinks. In theface of the inundant spam websites, traditional web crawlers cannot function well to solve this problem.Semantic focused crawlers utilize semantic web technologies to analyze the semantics of hyperlinksand web documents. This paper briefly reviews the recent studies on one category of semantic focusedcrawlers ? ontology-based focused crawlers, which are a series of crawlers that utilize ontologies to linkthe fetched web documents with the ontological concepts (topics). The purpose of this is to organizeand categorize web documents, or filtering irrelevant webpages with regards to the topics. A briefcomparison are made among these crawlers, from six perspectives - domain, working environment,special functions, technologies utilized, evaluation metrics and evaluation results. The conclusion withrespect to this comparison is made in the final section.

    Related items

    Showing items related by title, author, creator and subject.

    • State of the art in semantic focused crawlers
      Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth (2009)
      Nowadays, the research of focused crawler approaches the field of semantic web, along with the appearance of increasing semantic web documents and the rapid development of ontology mark-up languages. Semantic focused ...
    • SOF: a semi-supervised ontology - learning - based focused crawler
      Dong, Hai; Hussain, Farookh (2013)
      The rapid increase in the volume of data available on the Internet makes it increasingly impractical for a crawler to index the whole Web. Instead, many intelligent crawlers, known as ontology-based semantic focused ...
    • State of the art in metadata abstraction crawlers
      Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth (2008)
      Nowadays, the research of crawlers moves closer to the semantic web, along with the appearance of increasing XML/RDF/OWL files and the rapid development of ontology mark-up languages. As an emerging concept, metadata ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.