Design and Construction of Semantic Document Networks Using Concept Extraction
Access Status
Authors
Date
2012Type
Metadata
Show full item recordCitation
Remarks
Copyright © 2012 Curtin University of Technology
Collection
Abstract
Processing of unstructured documents according to their content is required in many disciplines; e.g., machine translation, text analysis and mining, and information extraction and retrieval. Whilst research in fields like text analysis, conceptualisation, or design of semantic networks progressed crucially over the last years, we still observe gaps between state-of-the-art algorithms to extract concepts from documents and how these concepts are linked effective and efficiently. This paper proposes a framework to store processed documents in a specialised semantic network database to enhance retrieval and analysis of common concepts in documents. We apply natural language reduction to calculate semantic cores for the concept-based indexing of stored documents. The developed prototype demonstrates an advanced document storage as well as a fast (semantical) retrieval of documents based on given key concepts.
Related items
Showing items related by title, author, creator and subject.
-
Boese, S.; Reiners, Torsten; Wood, Lincoln (2014)There are many unstructured documents created in many disciplines which need to be (pre-) processed in one way or another for further integration and use in IT systems. The predominance of the Internet and large corporate ...
-
Zhu, Dengya (2007)With the exponential growth of the Web and the inherent polysemy and synonymy problems of the natural languages, search engines are facing many challenges such as information overload, mismatch of search results, missing ...
-
Dong, Hai; Hussain, Farookh Khadeer; Chang, Elizabeth (2008)Crawlers are software which can traverse the internet and retrieve webpages by hyperlinks. In theface of the inundant spam websites, traditional web crawlers cannot function well to solve this problem.Semantic focused ...