Design and Construction of Semantic Document Networks Using Concept Extraction

Boese, S.; Reiners, Torsten; Wood, Lincoln

187321_65599_wi2011paper.pdf (265.3Kb)

Access Status

Open access

Authors

Boese, S.

Reiners, Torsten

Wood, Lincoln

Date

2012

Type

Working Paper

Metadata

Show full item record

Citation

Boese, Simon and Reiners, Torsten and Wood, Lincoln C. 2012. Design and Construction of Semantic Document Networks Using Concept Extraction, School of Information Systems Working Paper Series, Curtin University of Technology, School of Information Systems.

Remarks

URI

http://hdl.handle.net/20.500.11937/38005

Collection

Curtin Research Publications

Abstract

Processing of unstructured documents according to their content is required in many disciplines; e.g., machine translation, text analysis and mining, and information extraction and retrieval. Whilst research in fields like text analysis, conceptualisation, or design of semantic networks progressed crucially over the last years, we still observe gaps between state-of-the-art algorithms to extract concepts from documents and how these concepts are linked effective and efficiently. This paper proposes a framework to store processed documents in a specialised semantic network database to enhance retrieval and analysis of common concepts in documents. We apply natural language reduction to calculate semantic cores for the concept-based indexing of stored documents. The developed prototype demonstrates an advanced document storage as well as a fast (semantical) retrieval of documents based on given key concepts.