Autores
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka
Fecha de publicación
2010/9/23
Revista
IEEE Transactions on knowledge and Data Engineering
Volumen
23
Número
7
Páginas
977-990
Editor
IEEE
Descripción
Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, community mining, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words (or entities) remains a challenging task. We propose an empirical method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, we define various word co-occurrence measures using page counts and integrate those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, we propose a novel pattern extraction algorithm and a pattern clustering algorithm. The optimal combination of page counts-based co-occurrence …
Citas totales
201020112012201320142015201620172018201920202021131938333129191516139
Artículos de Google Académico
D Bollegala, Y Matsuo, M Ishizuka - IEEE Transactions on knowledge and Data …, 2010