We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
Roland Schäfer & Felix Bildhauer, Web Corpus Construction (Synthesis Lectures on Human Language Technologies 22). Morgan & Claypool, 2013. Pp. xv + 129.
Published online by Cambridge University Press:
19 November 2014
An abstract is not available for this content so a preview has been provided. Please use the Get access link above for information on how to access this content.
Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)
Baroni, Marco, Bernardini, Silvia, Ferraresi, Adriano & Zanchetta, Eros. 2009. The WaCkyWide Web: A collection of very large linguistically processed webcrawled corpora. Language Resources & Evaluation43, 209–226.CrossRefGoogle Scholar
Biemann, Chris, Bildhauer, Felix, Evert, Stefan, Goldhahn, Dirk, Quasthoff, Uwe, Schäfer, Roland, Simon, Johannes, Swiezinski, Leonard & Zesch, Torsten. 2013. Scalable construction of high-quality web corpora. Journal for Language Technology and Computational Linguistics, 28(2), 23–59.CrossRefGoogle Scholar
Fletcher, William H.2013. Corpus analysis of the World Wide Web. In Chapelle, Carol A. (ed.), The Encyclopedia of Applied Linguistics, vol. 3, 1339–1347. Oxford: Wiley-Blackwell.Google Scholar
Hundt, Marianne, Nesselhauf, Nadja & Biewer, Carolin (eds.). 2007. Corpus Linguistics and the Web. Amsterdam: Rodopi.CrossRefGoogle Scholar
Kilgarriff, Adam. 2001. Comparing corpora. International Journal of Corpus Linguistics6(1), 97–133.CrossRefGoogle Scholar
Kilgarriff, Adam. 2007. Googleology is bad science. Computational Linguistics33(1), 147–151.Google Scholar
Kilgarriff, Adam & Grefenstette, Gregory. 2003. Introduction to the special issue on the web as corpus. Computational Linguistics29(3), 333–347.CrossRefGoogle Scholar
Loftsson, Hrafn & Östling, Robert. 2013. Tagging a morphologically complex language using an Averaged Perceptron Tagger: The case of Icelandic. 19th Nordic Conference of Computational Linguistics (NODALIDA), 105–119. Linköping: Linköping University Electronic Press.Google Scholar
Nivre, Joakim, Hall, Johan, Nilsson, Jens, Chanev, Atanas, Eryigit, Gülsen, Kübler, Sandra, Marinov, Svetoslav & Marsi, Erwin. 2007. MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering13, 95–135.CrossRefGoogle Scholar
Östling, Robert. 2013. Stagger: An open-source part of speech tagger for Swedish. Northern European Journal of Language Technology (NEJLT) 3, 1–18.Google Scholar
Renouf, Antoinette, Kehoe, Andrew & Banerjee, Jayeeta. 2007. WebCorp: An integrated system for web text search. In Hundt et al. (eds.), 2007, 47–67.Google Scholar
Schäfer, Roland & Bildhauer, Felix. 2012. Building large corpora from the web using a New Efficient Tool Chain. The Eighth International Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey, 486–493.Google Scholar
Suchomel, Vít & Pomikálek, Jan. 2012. Efficient web crawling for large text corpora. The Seventh Web as Corpus Workshop (WAC), Lyon, France, 39–43.Google Scholar