Big Data in Economics: Evolution or Revolution?

Christine De Mol: Affiliation:
Université libre de Bruxelles, Brussels, Belgium
Eric Gautier: Affiliation:
Toulouse School of Economics, Toulouse
Domenico Giannone: Affiliation:
Federal Reserve Bank of New York, New York
Sendhil Mullainathan: Affiliation:
Harvard University, Cambridge, MA, USA
Lucrezia Reichlin: Affiliation:
London Business School, London, UK
Herman Van Dijk: Affiliation:
Erasmus University Rotterdam, Rotterdam
Jeffrey Wooldridge: Affiliation:
Michigan State University, East Lansing, MI, USA
Laszlo Matyas: Affiliation:
Central European University, Budapest
Richard Blundell: Affiliation:
University College London
Estelle Cantillon: Affiliation:
Université Libre de Bruxelles
Barbara Chizzolini: Affiliation:
Università Commerciale Luigi Bocconi, Milan
Marc Ivaldi: Affiliation:
Toulouse School of Economics, EHESS
Wolfgang Leininger: Affiliation:
Universität Dortmund
Ramon Marimon: Affiliation:
European University Institute, Florence
Frode Steen: Affiliation:
Norwegian School of Economics

Book contents

HTML view is not available for this content. However, as you have access to this content, a full PDF is available via the 'Save PDF' action button.

Summary

Abstract

The Big Data Era creates a lot of exciting opportunities for new developments in economics and econometrics. At the same time, however, the analysis of large datasets poses difficultmethodological problems that should be addressed appropriately and are the subject of the present chapter.

Introduction

‘Big Data’ has become a buzzword both in academic and in business and policy circles. It is used to cover a variety of data-driven phenomena that have very different implications for empirical methods. This chapter discusses some of these methodological challenges.

In the simplest case, ‘Big Data’ means a large dataset that otherwise has a standard structure. For example, Chapter 13 describes how researchers are gaining increasing access to administrative datasets or business records covering entire populations rather than population samples. The size of these datasets allows for better controls and more precise estimates and is a bonus for researchers. This may raise challenges for data storage and handling, but does not raise any distinct methodological issues.

However, ‘Big Data’ often means much more than just large versions of standard datasets. First, large numbers of units of observation often come with large numbers of variables, that is, large numbers of possible covariates. To illustrate with the same example, the possibility to link different administrative datasets increases the number of variables attached to each statistical unit. Likewise, business records typically contain all consumer interactions with the business. This can create a tension in the estimation between the objective of ‘letting the data speak’ and obtaining accurate (in a way to be specified later) coefficient estimates. Second, Big Data sets often have a very different structure from those we are used to in economics. This includes web search queries, real-time geolocational data or social media, to name a few. This type of data raises questions about how to structure and possibly re-aggregate them.

The chapter starts with a description of the ‘curse of dimensionality’, which arises from the fact that both the number of units of observation and the number of variables associated with each unit are large. This feature is present in many of the Big Data applications of interest to economists. One extreme example of this problem occurs when there are more parameters to estimate than observations.

Type: Chapter
Information: Economics without Borders
Economic Research for European Policy Challenges
, pp. 612 - 632

DOI: https://doi.org/10.1017/9781316636404.016 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2017
Creative Commons: This content is Open Access and distributed under the terms of the Creative Commons Attribution licence CC-BY-NC-ND 4.0 https://creativecommons.org/cclicenses/

Book contents

14 - Big Data in Economics: Evolution or Revolution?

Summary

Save book to Kindle

Save book to Dropbox

Save book to Google Drive