Book contents
- Frontmatter
- Contents
- List of Figures
- List of Tables
- Foreword
- Contributors
- Introduction
- 1 Innovation and Growth: The Schumpeterian Perspective
- 2 European Union Dual Labour Markets: Consequences and Potential Reforms
- 3 Population, Migration, Ageing and Health: A Survey
- 4 Education Systems: Assignment, Admissions, Accountability and Autonomy
- 5 Competition and Regulation in Markets for Goods and Services: A Survey with Emphasis on Digital Markets
- 6 Winners and Losers of Globalization: Sixteen Challenges for Measurement and Theory
- 7 Economic Approaches to Energy, Environment and Sustainability
- 8 Regional Disparities and Efficient Transport Policies
- 9 Skilled Cities and Efficient Urban Transport
- 10 Fiscal and Monetary Policies after the Crises
- 11 Financial Regulation in Europe: Foundations and Challenges
- 12 Inequality and Welfare: Is Europe Special?
- 13 Developments in Data for Economic Research
- 14 Big Data in Economics: Evolution or Revolution?
- Index
14 - Big Data in Economics: Evolution or Revolution?
Published online by Cambridge University Press: 24 March 2017
- Frontmatter
- Contents
- List of Figures
- List of Tables
- Foreword
- Contributors
- Introduction
- 1 Innovation and Growth: The Schumpeterian Perspective
- 2 European Union Dual Labour Markets: Consequences and Potential Reforms
- 3 Population, Migration, Ageing and Health: A Survey
- 4 Education Systems: Assignment, Admissions, Accountability and Autonomy
- 5 Competition and Regulation in Markets for Goods and Services: A Survey with Emphasis on Digital Markets
- 6 Winners and Losers of Globalization: Sixteen Challenges for Measurement and Theory
- 7 Economic Approaches to Energy, Environment and Sustainability
- 8 Regional Disparities and Efficient Transport Policies
- 9 Skilled Cities and Efficient Urban Transport
- 10 Fiscal and Monetary Policies after the Crises
- 11 Financial Regulation in Europe: Foundations and Challenges
- 12 Inequality and Welfare: Is Europe Special?
- 13 Developments in Data for Economic Research
- 14 Big Data in Economics: Evolution or Revolution?
- Index
Summary
Abstract
The Big Data Era creates a lot of exciting opportunities for new developments in economics and econometrics. At the same time, however, the analysis of large datasets poses difficultmethodological problems that should be addressed appropriately and are the subject of the present chapter.
Introduction
‘Big Data’ has become a buzzword both in academic and in business and policy circles. It is used to cover a variety of data-driven phenomena that have very different implications for empirical methods. This chapter discusses some of these methodological challenges.
In the simplest case, ‘Big Data’ means a large dataset that otherwise has a standard structure. For example, Chapter 13 describes how researchers are gaining increasing access to administrative datasets or business records covering entire populations rather than population samples. The size of these datasets allows for better controls and more precise estimates and is a bonus for researchers. This may raise challenges for data storage and handling, but does not raise any distinct methodological issues.
However, ‘Big Data’ often means much more than just large versions of standard datasets. First, large numbers of units of observation often come with large numbers of variables, that is, large numbers of possible covariates. To illustrate with the same example, the possibility to link different administrative datasets increases the number of variables attached to each statistical unit. Likewise, business records typically contain all consumer interactions with the business. This can create a tension in the estimation between the objective of ‘letting the data speak’ and obtaining accurate (in a way to be specified later) coefficient estimates. Second, Big Data sets often have a very different structure from those we are used to in economics. This includes web search queries, real-time geolocational data or social media, to name a few. This type of data raises questions about how to structure and possibly re-aggregate them.
The chapter starts with a description of the ‘curse of dimensionality’, which arises from the fact that both the number of units of observation and the number of variables associated with each unit are large. This feature is present in many of the Big Data applications of interest to economists. One extreme example of this problem occurs when there are more parameters to estimate than observations.
- Type
- Chapter
- Information
- Economics without BordersEconomic Research for European Policy Challenges, pp. 612 - 632Publisher: Cambridge University PressPrint publication year: 2017
- Creative Commons
- This content is Open Access and distributed under the terms of the Creative Commons Attribution licence CC-BY-NC-ND 4.0 https://creativecommons.org/cclicenses/
- 4
- Cited by