Corpus-based Dialectometry: Aggregate Morphosyntactic Variability in British English Dialects

16 - Corpus-based Dialectometry: Aggregate Morphosyntactic Variability in British English Dialects

Published online by Cambridge University Press: 12 September 2012

Benedikt Szmrecsanyi

Edited by

John Nerbonne ,

Charlotte Gooskens ,

Sebastian Kürschner and

Renée van Bezooijen

Show author details

Benedikt Szmrecsanyi: Affiliation:
University of Freiburg
John Nerbonne: Affiliation:
University of Groningen
Charlotte Gooskens: Affiliation:
University of Groningen
Sebastian Kürschner: Affiliation:
Friedrich-Alexander-Universität Erlangen-Nürnberg
Renée van Bezooijen: Affiliation:
University of Groningen

Book contents

Get access

Summary

Abstract The research reported in this paper departs from most previous work in dialectometry in several ways. Empirically, it draws on frequency vectors derived from naturalistic corpus data and not on discrete atlas classifications. Linguistically, it is concerned with morphosyntactic (as opposed to lexical or pronunciational) variability. Methodologically, it marries the careful analysis of dialect phenomena in authentic, naturalistic texts to aggregational-dialectometrical techniques. Two research questions guide the investigation: First, on methodological grounds, is corpus-based dialectometry viable at all? Second, to what extent is morphosyntactic variation in nonstandard British dialects patterned geographically? By way of validation, findings will be matched against previous work on the dialect geography of Great Britain.

INTRODUCTION

The overarching aim in this study is to provide a methodological sketch of how to blend philologically responsible corpus-based research with aggregational-dialectometrical analysis techniques. The bulk of previous research in dialectometry has focussed on phonology and lexis (however, for work on Dutch dialect syntax see Spruit 2005, 2006, 2008, Spruit et al. t.a.). Moreover, orthodox dialectometry draws on linguistic atlas classifications as its primary data source. The present study departs from these traditions in several ways. It endeavours, first, to measure aggregate morphosyntactic distances and similarities between traditional dialects in the British Isles. Second, the present study does not rely on atlas data but on frequency information deriving from a careful analysis of language use in authentic, naturalistic texts. This is another way of saying that the aggregate analysis in this paper is frequency-based, an approach that contrasts with atlas-based dialectometry, which essentially relies on categorical input data.

Type: Chapter
Information: Computing and Language Variation
International Journal of Humanities and Arts Computing Volume 2
, pp. 279 - 296

Publisher: Edinburgh University Press

Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

16 - Corpus-based Dialectometry: Aggregate Morphosyntactic Variability in British English Dialects

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive