Contextualized Embeddings and Transformer Networks

Mihai Surdeanu; Marco Antonio Valenzuela-Escárcega

doi:10.1017/9781009026222.013

12 - Contextualized Embeddings and Transformer Networks

Published online by Cambridge University Press: 01 February 2024

Mihai Surdeanu and

Marco Antonio Valenzuela-Escárcega

Show author details

Mihai Surdeanu: Affiliation:
University of Arizona
Marco Antonio Valenzuela-Escárcega: Affiliation:
University of Arizona

Book contents

Get access

Summary

As mentioned in Chapter 8, the distributional similarity algorithms discussed there conflate all senses of a word into a single numerical representation (or embedding). For example, the word bank receives a single representation, regardless of its financial (e.g., as in the bank gives out loans) or geological (e.g., bank of the river) sense. This chapter introduces a solution for this limitation in the form of a new neural architecture called transformer networks, which learns contextualized embeddings of words, which, as the name indicates, change depending on the context in which the words appear. That is, the word bank receives a different numerical representation for each of its instances in the two texts above because the contexts in which they occur are different. We also discuss several architectural choices that enabled the tremendous success of transformer networks: self attention, multiple heads, stacking of multiple layers, and subword tokenization, as well as how transformers can be pretrained on large amounts of data through through masked language modeling and next-sentence prediction.

Keywords

transformer networks encoder self attention

Type: Chapter
Information: Deep Learning for Natural Language Processing
A Gentle Introduction
, pp. 178 - 193

DOI: https://doi.org/10.1017/9781009026222.013 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2024

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

12 - Contextualized Embeddings and Transformer Networks

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Book contents

12 - Contextualized Embeddings and Transformer Networks

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive