Issues in diachronic corpus design

Douglas Biber; Susan Conrad; Randi Reppen

doi:10.1017/CBO9780511804489.013

2 - Issues in diachronic corpus design

Published online by Cambridge University Press: 05 June 2012

Douglas Biber ,

Susan Conrad and

Randi Reppen

Show author details

Douglas Biber: Affiliation:
Northern Arizona University
Susan Conrad: Affiliation:
Iowa State University
Randi Reppen: Affiliation:
Northern Arizona University

Book contents

Get access

Summary

Designing a diachronic corpus can be even more complicated than a synchronic corpus (discussed in Methodology Box 1): in addition to concerns relating to size and register diversity, there is the added parameter of time that must be adequately represented. Further, the universe of available texts is much smaller for earlier historical periods, making it difficult to even assess when a representative sample has been achieved.

When designing either a synchronic or a diachronic corpus, the first step is to determine the intended research purposes. For historical research, those purposes might be as narrow as studying the style of a single author's novels. Designing a representative corpus for this purpose would be relatively straightforward – in fact, it might be reasonable to aim for an exhaustive sampling in this case. However, broader research goals quickly result in much more complicated corpus designs. For example, designing a corpus to study a period style (e.g., early eighteenth-century prose) or a single genre (e.g., the novel) raises serious questions about sampling methods.

At the far extreme of complexity is the multi-purpose diachronic corpus designed to represent a wide range of register diversity across historical periods. The Helsinki Corpus and the ARCHER Corpus were both designed for these purposes; the Helsinki Corpus covers the period from c. 750 to c. 1700; and the ARCHER Corpus covers the period from 1650 to the present.

Type: Chapter
Information: Corpus Linguistics
Investigating Language Structure and Use
, pp. 251 - 253

DOI: https://doi.org/10.1017/CBO9780511804489.013 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1998

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

2 - Issues in diachronic corpus design

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

2 - Issues in diachronic corpus design

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive