Norming frequency counts

Douglas Biber; Susan Conrad; Randi Reppen

doi:10.1017/CBO9780511804489.017

6 - Norming frequency counts

Published online by Cambridge University Press: 05 June 2012

Douglas Biber ,

Susan Conrad and

Randi Reppen

Show author details

Douglas Biber: Affiliation:
Northern Arizona University
Susan Conrad: Affiliation:
Iowa State University
Randi Reppen: Affiliation:
Northern Arizona University

Book contents

Get access

Summary

When corpus-based studies examine the frequency of features across texts and registers, it is important to make sure that the counts are comparable. In particular, if the texts in a corpus are not all the same length, then frequency counts from those texts are not directly comparable. For example, imagine that you analyzed two texts and found that each one has 20 modal verbs. It might be tempting to conclude that modals are equally common in the texts. However, further imagine that the first text has a total length of 750 words, and the second text is 1,200 words long – in this case, your conclusion would be wrong. Because the second text is longer, there are more opportunities for modals to occur, and therefore simply comparing the raw counts does not give an accurate account of the relative frequencies of modals in the two texts.

“Normalization” is a way to adjust raw frequency counts from texts of different lengths so that they can be compared accurately. The total number of words in each text must be taken into consideration when norming frequency counts. Specifically, the raw frequency count should be divided by the number of words in the text, and then multiplied by whatever basis is chosen for norming.

Type: Chapter
Information: Corpus Linguistics
Investigating Language Structure and Use
, pp. 263 - 264

DOI: https://doi.org/10.1017/CBO9780511804489.017 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1998

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

6 - Norming frequency counts

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

6 - Norming frequency counts

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive