Optimization of word alignment clues

JÖRG TIEDEMANN

doi:10.1017/S1351324905003864

Optimization of word alignment clues

Published online by Cambridge University Press: 21 September 2005

JÖRG TIEDEMANN

Show author details

JÖRG TIEDEMANN: Affiliation:
Alfa-Informatica, University of Groningen, Groningen, The Netherlands e-mail: tiedeman@let.rug.nl

Article contents

Abstract

Get access

Rights & Permissions

Abstract

Statistical, linguistic, and heuristic clues can be used for the alignment of words and multi-word units in parallel texts. This article describes the clue alignment approach and the optimization of its parameters using a genetic algorithm. Word alignment clues can come from various sources such as statistical alignment models, co-occurrence tests, string similarity scores and static dictionaries. A genetic algorithm implementing an evolutionary procedure can be used to optimize the parameters necessary for combining available clues. Experiments on English/Swedish bitext show a significant improvement of about 6% in F-scores compared to the baseline produced by statistical word alignment.Most of the work described in this paper was carried out at the Department of Linguistics and Philology at Uppsala University. I would like to acknowledge technical and scientific support by people at the department in Uppsala.

Type: Papers
Information: Natural Language Engineering , Volume 11 , Issue 3 , September 2005 , pp. 279 - 293

DOI: https://doi.org/10.1017/S1351324905003864 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

Optimization of word alignment clues

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests