Tensor Cores

Richard Ansorge

doi:10.1017/9781108855273.012

11 - Tensor Cores

Published online by Cambridge University Press: 04 May 2022

Richard Ansorge

Show author details

Richard Ansorge: Affiliation:
University of Cambridge

Book contents

Get access

Summary

This chapter discusses the tensor core hardware available on newer GPUs. This hardware is designed to perform fast mixed precision matrix multiplications and is intended for applications in AI.However, CUDA exposes their use to programmers with the warp matrix function library. These functions support tiled matrix multiplication using 16 × 16 tiles.We provide examples of their use to improve on the early matrix multiplication example in Chapter 2.We also show how reduction operations can be performed using tensor codes as a potential non-AI application.

Keywords

tensor cores warp matrix functions reduction AI machine learning Ampere architecture

Type: Chapter
Information: Programming in Parallel with CUDA
A Practical Guide
, pp. 358 - 372

DOI: https://doi.org/10.1017/9781108855273.012 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

11 - Tensor Cores

Summary

Keywords

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive