The SemTag system was an entity mention disambiguation system that used the Stanford TAP database.
- (Kulkarni et al., 2009) ⇒ Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, Soumen Chakrabarti. (2009). “Collective Annotation of Wikipedia Entities in Web Text.” In: Proceedings of ACM SIGKDD Conference (KDD-2009). doi:10.1145/1557019.1557073.
- The first Web-scale entity disambiguation system was SemTag . SemTag annotated about 250 million Web pages with IDs from the Stanford TAP entity catalog . The basic technique was to compare the surrounding context of a spot s with text metadata associated with candidate entity γ in TAP. SemTag preferred high precision over recall, proposing only about 450 million annotations, i.e., fewer than two annotations per page on average.
- (Dill et al., 2003a) ⇒ Stephen Dill, Nadav Eiron, David Gibson, Daniel Gruhl, Ramanathan V. Guha, Anant Jhingran, Tapas Kanungo, Sridhar Rajagopalan, Andrew Tomkins, John A. Tomlin, and Jason Y. Zien. (2003). “SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation.” In: Proceedings of the 12th International Conference on World Wide Web (WWW 2003). doi:10.1145/775152.775178