Hostname: page-component-7c8c6479df-995ml Total loading time: 0 Render date: 2024-03-27T13:51:16.943Z Has data issue: false hasContentIssue false

KIM – a semantic platform for information extraction and retrieval

Published online by Cambridge University Press:  11 October 2004

BORISLAV POPOV
Affiliation:
Ontotext Lab., Sirma AI EOOD, 135 Tsarigradsko Shose, Sofia 1784, Bulgaria e-mail: borislav@ontotext.comnaso@ontotext.comdamyan@ontotext.commitac@ontotext.comangel@ontotext.com
ATANAS KIRYAKOV
Affiliation:
Ontotext Lab., Sirma AI EOOD, 135 Tsarigradsko Shose, Sofia 1784, Bulgaria e-mail: borislav@ontotext.comnaso@ontotext.comdamyan@ontotext.commitac@ontotext.comangel@ontotext.com
DAMYAN OGNYANOFF
Affiliation:
Ontotext Lab., Sirma AI EOOD, 135 Tsarigradsko Shose, Sofia 1784, Bulgaria e-mail: borislav@ontotext.comnaso@ontotext.comdamyan@ontotext.commitac@ontotext.comangel@ontotext.com
DIMITAR MANOV
Affiliation:
Ontotext Lab., Sirma AI EOOD, 135 Tsarigradsko Shose, Sofia 1784, Bulgaria e-mail: borislav@ontotext.comnaso@ontotext.comdamyan@ontotext.commitac@ontotext.comangel@ontotext.com
ANGEL KIRILOV
Affiliation:
Ontotext Lab., Sirma AI EOOD, 135 Tsarigradsko Shose, Sofia 1784, Bulgaria e-mail: borislav@ontotext.comnaso@ontotext.comdamyan@ontotext.commitac@ontotext.comangel@ontotext.com

Abstract

The KIM platform provides a novel Knowledge and Information Management framework and services for automatic semantic annotation, indexing, and retrieval of documents. It provides a mature and semantically enabled infrastructure for scalable and customizable information extraction (IE) as well as annotation and document management, based on GATE.General Architecture for Text Engineering (GATE) (http://gate.ac.uk), leading NLP and IE platform developed at the University of Sheffield. Our understanding is that a system for semantic annotation should be based upon a simple model of real-world entity concepts, complemented with quasi-exhaustive instance knowledge. To ensure efficiency, easy sharing, and reusability of the metadata we introduce an upper-level ontology. Based on the ontology, a large-scale instance base of entity descriptions is maintained. The knowledge resources involved are handled by use of state-of-the-art Semantic Web technology and standards, including RDF(S) repositories, ontology middleware and reasoning. From a technical point of view, the platform allows KIM-based applications to use it for automatic semantic annotation, for content retrieval based on semantic queries, and for semantic repository access. As a framework, KIM also allows various IE modules, semantic repositories and information retrieval engines to be plugged into it. This paper presents the KIM platform, with an emphasis on its architecture, interfaces, front-ends, and other technical issues.

Type
Papers
Copyright
© 2004 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)