Constrained EM for parallel text alignment

DAVID TALBOT

doi:10.1017/S1351324905003852

Constrained EM for parallel text alignment

Published online by Cambridge University Press: 21 September 2005

DAVID TALBOT

Show author details

DAVID TALBOT: Affiliation:
School of Informatics, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9LW, UK e-mail: [email protected]

Article contents

Abstract

Get access

Rights & Permissions

Abstract

Standard parameter estimation schemes for statistical translation models can struggle to find reasonable settings on some parallel corpora. We show how auxiliary information can be used to constrain the procedure directly by restricting the set of alignments explored during parameter estimation. This enables the integration of bilingual and monolingual knowledge sources while retaining the flexibility of the underlying models. We demonstrate the effectiveness of this approach for incorporating linguistic and domain-specific constraints on various parallel corpora, and consider the importance of using the context of the parallel text to guide the application of such constraints.

Type: Papers
Information: Natural Language Engineering , Volume 11 , Issue 3 , September 2005 , pp. 263 - 277

DOI: https://doi.org/10.1017/S1351324905003852 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

Constrained EM for parallel text alignment

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests