A semantic crawler based on an extended CBR algorithm

Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth

doi:10.1007/978-3-540-88875-8_135

116131_7956_PUB-CBS-EEB-MC-45228.pdf (3.569Mb)

Access Status

Open access

Authors

Dong, Hai

Hussain, Farookh Khadeer

Chang, Elizabeth

Date

2008

Type

Conference Paper

Metadata

Show full item record

Citation

Dong, Hai and Hussain, Farookh Khadeer and Chang, Elizabeth. 2008. A semantic crawler based on an extended CBR algorithm, in Meersman, R. and Tari, Z. and Herrero, P. (ed), 4th IFIP Workshop on Semantic Web & Web Semantics, Nov 10 2008, pp. 1076-1085. Monterrey, Mexico: Springer.

Source Title

Lecture notes in computer science LNCS 5333

Source Conference

4th IFIP Workshop on Semantic Web & Web Semantics (SWSS 2008)

DOI

10.1007/978-3-540-88875-8_135

ISBN

9783540888741

Faculty

Curtin Business School

School of Information Systems

School

Centre for Extended Enterprises and Business Intelligence

Remarks

The original publication is available at http://www.springerlink.com

URI

http://hdl.handle.net/20.500.11937/36321

Collection

Curtin Research Publications

Abstract

A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is based on a CBR algorithm which is adopted in the field of problem solving. We reveal the technical details with regard to ontological concept and metadata format, and the extended CBR algorithm. In addition, the system implementation and evaluation details are provided in detail, finalized by our conclusion and further works.