Hostname: page-component-cd9895bd7-gxg78 Total loading time: 0 Render date: 2024-12-27T07:21:54.518Z Has data issue: false hasContentIssue false

Integrating learner corpora and natural language processing: A crucial step towards reconciling technological sophistication and pedagogical effectiveness1

Published online by Cambridge University Press:  24 August 2007

Sylviane Granger
Affiliation:
Université catholique de Louvain, Centre for English Corpus Linguistics, Place Blaise Pascal 1, B-1348 Louvain-la-Neuve, Belgium (email: [email protected])
Olivier Kraif
Affiliation:
LIDILEM, Université Stendhal Grenoble3, BP-25 – 38040 Grenoble Cedex 9France (email: [email protected])
Claude Ponton
Affiliation:
LIDILEM, Université Stendhal Grenoble3, BP-25 – 38040 Grenoble Cedex 9France (email: [email protected])
Georges Antoniadis
Affiliation:
LIDILEM, Université Stendhal Grenoble3, BP-25 – 38040 Grenoble Cedex 9France (email: [email protected])
Virginie Zampa
Affiliation:
LIDILEM, Université Stendhal Grenoble3, BP-25 – 38040 Grenoble Cedex 9France (email: [email protected])

Abstract

Learner corpora, electronic collections of spoken or written data from foreign language learners, offer unparalleled access to many hitherto uncovered aspects of learner language, particularly in their error-tagged format. This article aims to demonstrate the role that the learner corpus can play in CALL, particularly when used in conjunction with web-based interfaces which provide flexible access to error-tagged corpora that have been enhanced with simple NLP techniques such as POS-tagging or lemmatization and linked to a wide range of learner and task variables such as mother tongue background or activity type. This new resource is of interest to three main types of users: teachers wishing to prepare pedagogical materials that target learners' attested difficulties; learners themselves for editing or language awareness purposes and NLP researchers, for whom it serves as a benchmark for testing automatic error detection systems.

Type
Research Article
Copyright
Copyright © Cambridge University Press 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

1 The research reported in this article is part of a wider project on Integrated Digital Language Learning (IDILL) carried out within the framework of the EU-funded network of excellence Kaleidoscope dedicated to research in the field of technology-enhanced learning: http://www.noe-kaleidoscope.org/pub/