2003 SemanticAnnotationIndexingRetrieval

(Kiryakov et al., 2003) ⇒ Atanas Kiryakov, Borislav Popov, Ivan Terziev, Dimitar Manov, Damyan Ognyanoff. (2003). “Semantic Annotation, Indexing, and Retrieval.” In: Proceedings of the 2nd International Semantic Web Conference (ISWC 2003). doi:10.1016/j.websem.2004.07.005

Subject Headings: KIM System, Semantic Annotation Task, Semantic Metadata, OntoText Lab.

Notes

It becomes the Journal Paper (Kiryakov et al., 2004)

Cited By

~449 http://scholar.google.com/scholar?q=%22Semantic+Annotation%2C+Indexing%2C+and+Retrieval.%22+2003

2003

(Popov et al., 2003) ⇒ Borislav Popov, Atanas Kiryakov, Damyan Ognyanoff, Dimitar Manov, Angel Kirilov, and Damyan Goranov, (2003). “KIM - Semantic Annotation Platform.” In: Proceedings of the 2nd International Semantic Web Conference.

Quotes

Abstract

The Semantic Web realization depends on the availability of a critical mass of metadata for the web content, associated with the respective formal knowledge about the world. We claim that the Semantic Web, at its current stage of development, is in a state of a critical need of metadata generation and usage schemata that are specific, well-defined and easy to understand. This paper introduces our vision for a holistic architecture for semantic annotation, indexing, and retrieval of documents with regard to extensive semantic repositories. A system (called KIM), implementing this concept, is presented in brief and it is used for the purposes of evaluation and demonstration.

A particular schema for semantic annotation with respect to real-world entities is proposed. The underlying philosophy is that a practical semantic annotation is impossible without some particular knowledge modelling commitments. Our understanding is that a system for such semantic annotation should be based upon a simple model of real-world entity classes, complemented with extensive instance knowledge. To ensure the efficiency, ease of sharing, and reusability of the metadata, we introduce an upper-level ontology (of about 250 classes and 100 properties), which starts with some basic philosophical distinctions and then goes down to the most common entity types (people, companies, cities, etc.). Thus it encodes many of the domain-independent commonsense concepts and allows straightforward domain-specific extensions. On the basis of the ontology, a large-scale knowledge base of entity descriptions is bootstrapped, and further extended and maintained. Currently, the knowledge bases usually scales between 105 and 106 descriptions.

Finally, this paper presents a semantically enhanced information extraction system, which provides automatic semantic annotation with references to classes in the ontology and to instances. The system has been running over a continuously growing document collection (currently about 0.5 million news articles), so it has been under constant testing and evaluation for some time now. On the basis of these semantic annotations, we perform semantic based indexing and retrieval where users can mix traditional information retrieval (IR) queries and ontology-based ones. We argue that such large-scale, fully automatic methods are essential for the transformation of the current largely textual web into a Semantic Web.

References

,

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2003 SemanticAnnotationIndexingRetrieval	Atanas Kiryakov Borislav Popov Ivan Terziev Dimitar Manov Damyan Ognyanoff			Semantic Annotation, Indexing, and Retrieval			http://www.ontotext.com/publications/SemAIR ISWC169.pdf	10.1016/j.websem.2004.07.005