Implementing Encoder-Decoder Methods

Mihai Surdeanu; Marco Antonio Valenzuela-Escárcega

doi:10.1017/9781009026222.016

15 - Implementing Encoder-Decoder Methods

Published online by Cambridge University Press: 01 February 2024

Mihai Surdeanu and

Marco Antonio Valenzuela-Escárcega

Show author details

Mihai Surdeanu: Affiliation:
University of Arizona
Marco Antonio Valenzuela-Escárcega: Affiliation:
University of Arizona

Book contents

Get access

Summary

In this chapter, we implement a machine translation application as an example of an encoder-decoder task. In particular, we build on pretrained encoder-decoder transformer models, which exist in the Hugging Face library for a wide variety of language pairs. We first show how to use one of these models out-of-the-box to perform translation for one of the language pairs it has been exposed to during pretraining: English to Romanian. Afterward, we fine-tune the model to a new language combination that is has not seen before: Romanian to English. In both use cases, we use the T5 encoder-decoder model, which has been pretrained for several tasks, including machine translation.

Keywords

machine translation encoder-decoder Hugging Face

Type: Chapter
Information: Deep Learning for Natural Language Processing
A Gentle Introduction
, pp. 229 - 245

DOI: https://doi.org/10.1017/9781009026222.016 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

15 - Implementing Encoder-Decoder Methods

Summary

Keywords

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive