2013 MiningEvidencesforNamedEntityDi

Jump to: navigation, search

Subject Headings:


Cited By


Author Keywords


Named entity disambiguation is the task of disambiguating named entity mentions in natural language text and link them to their corresponding entries in a knowledge base such as Wikipedia. Such disambiguation can help enhance readability and add semantics to plain text. It is also a central step in constructing high-quality information network or knowledge graph from unstructured text. Previous research has tackled this problem by making use of various textual and structural features from a knowledge base. Most of the proposed algorithms assume that a knowledge base can provide enough explicit and useful information to help disambiguate a mention to the right entity. However, the existing knowledge bases are rarely complete (likely will never be), thus leading to poor performance on short queries with not well-known contexts. In such cases, we need to collect additional evidences scattered in internal and external corpus to augment the knowledge bases and enhance their disambiguation power. In this work, we propose a generative model and an incremental algorithm to automatically mine useful evidences across documents. With a specific modeling of “background topic” and “unknown entities", our model is able to harvest useful evidences out of noisy information. Experimental results show that our proposed method outperforms the state-of-the-art approaches significantly: boosting the disambiguation accuracy from 43% (baseline) to 86% on short queries derived from tweets.



 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2013 MiningEvidencesforNamedEntityDiYang Li
Chi Wang
Fangqiu Han
Jiawei Han
Dan Roth
Xifeng Yan
Mining Evidences for Named Entity Disambiguation10.1145/2487575.24876812013