Hostname: page-component-8448b6f56d-c47g7 Total loading time: 0 Render date: 2024-04-18T01:30:41.247Z Has data issue: false hasContentIssue false

A corpus-based approach for Korean nominal compound analysis based on linguistic and statistical information

Published online by Cambridge University Press:  29 August 2001

JUNTAE YOON
Affiliation:
Center for Artificial Intelligence Research, Department of Computer Science, Korea Advanced Institute of Science and Technology, Taejon, Korea; e-mail: jtyoon@world.kaist.ac.kr, kschoi@world.kaist.ac.kr
KEY-SUN CHOI
Affiliation:
Center for Artificial Intelligence Research, Department of Computer Science, Korea Advanced Institute of Science and Technology, Taejon, Korea; e-mail: jtyoon@world.kaist.ac.kr, kschoi@world.kaist.ac.kr
MANSUK SONG
Affiliation:
Department of Computer Science, Engineering College, Yonsei University Seoul, Korea; e-mail: mssong@december.yonsei.ac.kr

Abstract

The syntactic structure of a nominal compound must be analyzed first for its semantic interpretation. In addition, the syntactic analysis of nominal compounds is very useful for NLP application such as information extraction, since a nominal compound often has a similar linguistic structure with a simple sentence, as well as representing concrete and compound meaning of an object with several nouns combined. In this paper, we present a novel model for structural analysis of nominal compounds using linguistic and statistical knowledge which is coupled based on lexical information. That is, the syntactic relations defined between nouns (complement-predicate and modifier-head relation) are obtained from large corpora and again used to analyze the structures of nominal compounds and identify the underlying relations between nouns. Experiments show that the model gives good results, and can be effectively used for application systems which do not require deep semantic information.

Type
Research Article
Copyright
© 2001 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)