Improving the relevance of search results: search-term disambiguation and ontological filtering
Access Status
Authors
Date
2009Type
Metadata
Show full item recordCitation
ISBN
Faculty
Collection
Abstract
Synonymy & polysemy of natural languages together with information overload are two main factors that affect the relevance of Web hits. When users submit a query, search engines usually return a long list of hits with syntactic similarity. Users are confronted with choosing a needle from a haystack - relevant items from long lists of hits. This book proposes an improved strategy for increasing the relevance of Web search results via search term disambiguation and ontological filtering. Semantic characteristics of ontology categories are represented by a category-document and similarities of this and search results are evaluated using a Vector Space Model. Users choose a category to obtain only the search results classified under the selected category. Experimental data show the approach boosts the Web hits precision by more than 20%. The book should help shed some light on Web searching and word sense disambiguation, and should be useful to students and researchers in the fields of information retrieval, text classification, and data mining; or anyone else interested in Web searching.
Related items
Showing items related by title, author, creator and subject.
-
Zhu, Dengya (2007)With the exponential growth of the Web and the inherent polysemy and synonymy problems of the natural languages, search engines are facing many challenges such as information overload, mismatch of search results, missing ...
-
Zhu, Dengya (2010)Web search results are far from perfect due to the polysemous and synonymous characteristics of nature languages, information overload as the results of information explosion on the Web, and the flat list, “one size fits ...
-
Zhu, Dengya; Dreher, Heinz (2012)Short-term queries preferred by most users often result in a list of Web search results with low precision from a user perspective. The purpose of this research is to improve the relevance of Web search results via ...