Entity Data Base
(Redirected from entity dataset)
		
		
		
		Jump to navigation
		Jump to search
		An Entity Data Base is a concept data base of entity records.
- AKA: Entity Catalog.
 - Context:
- It can be used by a Dictionary-based Algorithm to an Entity Mention Recognition Task.
 - It can be an input to an Entity Record Resolution Task.
 - …
 
 - Example(s):
- A Customer Database.
 - A Product Database.
 - The Gene Ontology Database.
 - The Rulers.org Database.
 - The NCBI Taxonomy Database.
 - an Audio Item Dataset.
 - a Video Item Dataset.
 
 - See: Ontology, Word Sense Inventory, Entity Record Set.
 
References
2009
- (Kulkarni et al., 2009) ⇒ Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, Soumen Chakrabarti. (2009). “Collective Annotation of Wikipedia Entities in Web Text.” In: Proceedings of ACM SIGKDD Conference (KDD-2009). doi:10.1145/1557019.1557073.
- QUOTE: To take the first step beyond keyword-based search toward entity-based search, suitable token spans ("spots") on documents must be identified as references to real-world entities from an entity catalog.
 
 
2004
- (Cohen & Sarawagi, 2004) ⇒ William W. Cohen, Sunita Sarawagi. (2004). “Exploiting Dictionaries in Named Entity Extraction: Combining semi-Markov extraction processes and data integration methods.” In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004) doi:10.1145/1014052.1014065
- QUOTE: We consider the problem of improving named entity recognition (NER) systems by using external dictionaries --- more specifically, the problem of extending state-of-the-art NER systems by incorporating information about the similarity of extracted entities to entities in an external dictionary.