Abstract:
Typical semantic-based search systems resolve semantic heterogeneity by augmenting keywords
through domain ontology. They consider individual keywords i.e. either concepts or relationships
of ontology, but ignore the semantic relationships that exist between keywords. Therefore, to
answer complex queries accurately is not possible even augmenting the query’s keyword with
different semantic relationships. To find the right document is only possible, if a system knows
the meanings of the concepts and relationships that exist among the concepts. The proposed system
takes concepts as well as the relationship that exists among them for considering the context. The
system performed searching by matching RDF triples rather than individual keywords. The
documents are ranked according to their relevance score of triples.
To validate the proposed semantic similarity measure, a prototype system has been
implemented. The proposed semantic similarity measure uses both the structure of ontology and
statistical information content to compute the semantic similarity. By combining a taxonomic
structure with empirical probability estimates, it provides a way of adapting a static knowledge
structure to multiple contexts. Through RDF triple matching, we have computed context based
information retrieval. The proposed system has been evaluated by repeating Charles and Miller
experiment and by comparing the proposed measure with several other similarity measures.
Experimental results demonstrate better performance over up-to-date similarity measures. We
have also evaluated our measure using Pilot Short Text Semantic Similarity Benchmark Data Set
(STASIS) and we have obtained 85% correlation with STASIS. In future, we intend to consider
the most appropriate sense of a concept to further improve its accuracy.