Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    State of the art in metadata abstraction crawlers

    116038_State%20of%20the%20art%2004608573.pdf (215.7Kb)
    Access Status
    Open access
    Authors
    Dong, Hai
    Hussain, Farookh Khadeer
    Chang, Elizabeth
    Date
    2008
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Dong, Hai and Hussain, Farookh and Chang, Elizabeth. 2008. State of the art in metadata abstraction crawlers, in Wo, H. and Xie, H. (ed), IEEE International Conference on Industrial Technologies, Apr 21 2008, pp. 1-6, Chengdu, China: Institute of Electrical and Electronics Engineers (IEEE).
    Source Title
    Proceedings of the IEEE international conference on industrial technologies (ICIT 2008)
    Source Conference
    IEEE International Conference on Industrial Technologies (ICIT 2008)
    DOI
    10.1109/ICIT.2008.4608573
    ISBN
    9781424417056
    Faculty
    Curtin Business School
    School of Economics and Finance
    School
    Centre for Extended Enterprises and Business Intelligence
    Remarks

    Copyright © 2008 IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

    URI
    http://hdl.handle.net/20.500.11937/48258
    Collection
    • Curtin Research Publications
    Abstract

    Nowadays, the research of crawlers moves closer to the semantic web, along with the appearance of increasing XML/RDF/OWL files and the rapid development of ontology mark-up languages. As an emerging concept, metadata abstraction crawlers are a series of crawlers that aim to abstract metadata from normal HTML documents, based on various semantic web technologies. In this paper, we make a general survey of the current situation of metadata abstraction crawlers. Fourteen cases in this field are chosen as typical examples, and classified in five clusters. From seven perspectives we horizontally compare and contrast the semantic web crawlers in each cluster, and draw our conclusion in the final section.

    Related items

    Showing items related by title, author, creator and subject.

    • State of the art in semantic focused crawlers
      Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth (2009)
      Nowadays, the research of focused crawler approaches the field of semantic web, along with the appearance of increasing semantic web documents and the rapid development of ontology mark-up languages. Semantic focused ...
    • A semantic crawler based on an extended CBR algorithm
      Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth (2008)
      A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster ...
    • Self-adaptive semantic focused crawler for mining services information discovery
      Dong, Hai; Hussain, F. (2014)
      It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.