Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2025-01-05T15:13:21.123Z Has data issue: false hasContentIssue false

Number preferences in lotteries

Published online by Cambridge University Press:  01 January 2023

Tong V. Wang
Affiliation:
*Erasmus University Rotterdam
Rogier J.D. Potter van Loon
Affiliation:
*Erasmus University Rotterdam
Dennie van Dolder
Affiliation:
‡University of Nottingham
Rights & Permissions [Opens in a new window]

Abstract

We explore people’s preferences for numbers in large proprietary data sets from two different lottery games. We find that choice is far from uniform, and exhibits some familiar and some new tendencies and biases. Players favor personally meaningful and situationally available numbers, and are attracted towards numbers in the center of the choice form. Frequent players avoid winning numbers from recent draws, whereas infrequent players chase these. Combinations of numbers are formed with an eye for aesthetics, and players tend to spread their numbers relatively evenly across the possible range.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
The authors license this article under the terms of the Creative Commons Attribution 3.0 License.
Copyright
Copyright © The Authors [2016] This is an Open Access article, distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.

1 Introduction

Many choice situations involve numeric values. Numbers indicate quantities, prices, rankings, and they serve as arbitrary labels or identification codes. A recent literature related to the Chinese culture shows that tastes and distastes for particular numbers can influence decisions and affect market prices. Vehicle license plates with the lucky number eight are auctioned at relatively high prices, and vehicle plates with the unlucky number four are auctioned at relatively low prices (Reference Woo and KwokWoo & Kwok, 1994; Reference Woo, Horowitz, Luk and LaiWoo, Horowitz, Luk & Lai, 2008; Reference Chong and DuChong & Du, 2008; Reference Ng, Chong and DuNg, Chong & Du, 2010). In housing markets, houses with a number ending in eight are traded at a premium, whereas houses with a number ending in four are traded at a discount (Reference Bourassa and PengBourassa & Peng, 1999; Reference Chau, Ma and HoChau, Ma & Ho, 2001; Reference Agarwal, He, Liu, Png, Sing and WongAgarwal, He, Liu, Png, Sing & Wong, 2014; Reference Fortin, Hill and HuangFortin, Hill & Huang, 2014; Reference Shum, Sun and YeShum, Sun & Ye, 2014). In financial markets, culture-inspired number preferences cause particular limit-order and transaction prices to be more frequent than other ones (Reference Brown, Chua and MitchellBrown, Chua & Mitchell, 2002; Reference He and WuHe & Wu, 2006; Reference Cai, Cai and KeaseyCai, Cai & Keasey, 2007; Reference Brown and MitchellBrown & Mitchell, 2008; Reference Bhattacharya, Kuo, Lin and ZhaoBhattacharya, Kuo, Lin & Zhao, 2016). Moreover, the shares of newly listed firms with lucky listing codes seem to be overvalued and underperform those with unlucky listing codes (Reference Hirshleifer, Jian and ZhangHirshleifer, Jian & Zhang, 2014).

Tradition or cultural background is just one possible determinant of tastes and distastes for particular numbers. In the present paper, we map a variety of other determinants in the context of two different lottery games. The first is the Dutch Lotto, a nationwide six-number lottery. For 175 consecutive draws that span a two-and-a-half year period, we have five million choices of combinations of six different numbers between 1 and 45. The second is a lottery that was organized as a promotional event by a large casino company in the Netherlands in 2013 and 2014. We have the complete collection of entries for each of the two years, for an aggregate of more than five hundred thousand choices of combinations of four numbers between 0 and 36.

The question whether people in these lottery games exhibit a systematic preference for particular numbers is interesting from multiple perspectives. First, our data provide a real-life test-bed for various behavioral regularities. The orientation of the games towards chance and prediction, the use of particular choice forms, the fact that people choose numbers in combinations, and the availability of specific numbers in the decision context, allow for the testing of a variety of psychological phenomena. Second, the preferences that we document here may also play a role in areas outside that of lotteries. Numerical labels and indicators abound in the environments of, for example, consumers, investors, entrepreneurs, and experimental subjects. If people have number preferences, these labels and indicators could influence their choices. The studies cited in the first paragraph above illustrate that the economic impact of such preferences is potentially significant. Last, understanding how people behave in lotteries is interesting in its own right. Many countries have one or more large lotteries in which people can choose the numbers they play with. Worldwide, households spend a significant portion of their income on lotteries, with total expenditures amounting to hundreds of billions of dollars (Reference Kearney, Tufano, Guryan and HurstKearney, Tufano, Guryan & Hurst, 2011; Reference Beckert and LutterBeckert & Lutter, 2013).

Our results are surprisingly similar across the two games. Players have a tendency to play with the personally meaningful numbers in their birthdate, age, and postal code. They also more frequently choose numbers that are situationally available: there is a preference for numbers (i) in the current date, (ii) in the date of the draw, (iii) forming the jackpot size, (iv) representing the remaining time until the draw shown on the screen, and (v) on a voucher that players need in order to participate.

We also find evidence that the spatial position of numbers matters. The two lottery games employ a different range of numbers and tabulate these numbers in a different way. In both lotteries, players are attracted towards numbers in the center of the choice form and avoid numbers at the edges. Our final result for individual numbers is that frequent players avoid the winning numbers from recent draws, whereas infrequent players chase these.

For combinations of numbers we find that players care about aesthetics. With only a few exceptions, the most popular combinations all represent numeric sequences or spatial patterns. These combinations are selected extremely often in comparison with what would be expected if people choose randomly. Furthermore, players spread their numbers relatively evenly across the range of possible numbers.

Our study is not the first to investigate number preferences in lottery games, but it is distinct in terms of data and scope. Many earlier studies rely on indirect or aggregated data, analyzing the number of winners given particular draw results (Reference ChernoffChernoff, 1981; Reference Cook and ClotfelterCook & Clotfelter, 1993; Reference TerrellTerrell, 1994; Reference FinkelsteinFinkelstein, 1995; Reference ScogginsScoggins, 1995; Reference HaighHaigh, 1997; Reference Cox, Daniell and NicoleCox, Daniell & Nicole, 1998; Reference Papachristou and KaramanisPapachristou & Karamanis, 1998; Reference Farrell, Hartley, Lanot and WalkerFarrell, Hartley, Lanot & Walker, 2000; Reference Roger and BroihanneRoger & Broihanne, 2007) or the overall popularity of individual numbers or combinations (Reference JoeJoe, 1987; Reference Halpern and DevereauxHalpern & Devereaux, 1989; Reference Stern and CoverStern & Cover, 1989; Reference Clotfelter and CookClotfelter & Cook, 1993; Reference HenzeHenze, 1997; Reference SimonSimon, 1999; Reference DingDing, 2011; Reference Lien, Yuan and ZhengLien, Yuan & Zheng, 2015; Reference Lien, Yuan and ZhengLien & Yuan, 2015). To the best of our knowledge, only Reference Suetens and TyranSuetens and Tyran (2012) and Suetens, Galbo-Jørgensen & Tyran (2015) use detailed individual-level data on lottery players and number choices. All these studies focus on a subset of the behavioral regularities that we consider in the present paper.

2 Games and data

2.1 Lotto game

Generating €144 million in revenues in 2014, the Dutch Lotto is one of the largest nationwide lotteries in the Netherlands (annual report De Lotto, Reference De2014). Draws take place every Saturday at 6pm CET. On the last Saturday of every month (“Super Saturday”) there are two draws. Players choose six numbers from the range of 1 to 45, and additionally one color from six. Bets cost €2 each, and prizes are awarded for matching at least two of the numbers drawn. The more numbers a player matches, the bigger the prize. During our sample period, the progressive jackpot had a minimum value of €7.5 million and increased by half a million each time it was not awarded. A player wins the jackpot if she matches all six numbers and the jackpot color. If there is more than one winner, the jackpot is shared. The chance of winning the jackpot or a share of it is roughly one-in-49-million. Table S1 in the Supplement displays the probabilities for the smaller prizes.

Our data consists solely of online transactions. When making an online transaction, a player is first asked how many combinations she wishes to bet on. Next, she chooses the numbers and color of each combination, and decides how many draws she wants to participate in (maximum of twelve). Our analyses ignore the number of chosen repetitions, because there is only one decision process underlying a string of automatically repeated bets. Figure S1 in the Supplement shows the online Lotto form.

By default, the computer system generates a random combination for each bet. A player can choose whether to play with this combination, to generate another random combination, to adjust one or more numbers manually, or to choose a combination from scratch. Unfortunately, we do not know when default combinations were used.

In our standard approach we weight each chosen combination equally, regardless of how many other combinations the same player bets on. As a robustness check, we also conduct analyses in which we weight observations by the reciprocal of the total number of combinations chosen by the player in our sample period.

Our anonymized data set consists of 2,590,919 online transactions for the Dutch Lotto between April 19, 2010 and December 31, 2012. A total of 175 draws took place in this time period. For the 5,108,343 chosen combinations in our data set we know the date of the transaction and the date of the draw. For the 131,407 (anonymous) players we know their gender, birthdate, and the four digits of their postal code.Footnote 1 A majority of 73% of the players are male and 84% of the combinations are entered by males.

2.2 Casino game

Our data for the casino game derive from two identical promotional events organized by Holland Casino in 2013 and 2014. Anyone who visited a casino of this Dutch state-owned company between May 2 and June 9, 2013 or May 6 and June 9, 2014 received a voucher with a login code. Via a terminal inside the casino and via the Internet this code granted access to a lottery where players had to predict the outcomes of four consecutive spins of a roulette wheel with pockets numbered from 0 to 36. Participants were competing for a guaranteed prize of €100,000, to be shared by those who predicted the correct numbers in the correct order. If nobody would win according to this criterion, then the prize would be shared by all players who predicted the correct numbers irrespective of order. If nobody would win on the basis of all four numbers, the prize would be awarded on the basis of the first three numbers alone. Unlike Lotto, players were not offered the possibility to use randomly generated numbers.

Our anonymized data consist of all 323,896 combinations of four numbers entered in 2013 and all 245,091 entered in 2014. For each combination we know the voucher code, the date of play, the player’s gender, and the player’s birthdate. The data set from 2014 also contains a unique number for each of the 112,473 players. For 2013 such a unique number is not available. The percentages of combinations entered by male players in 2013 and 2014 are 54.9 and 58.6, respectively.

If we analyze the two years separately, the results are strikingly similar. For example, as illustrated in Figure S2 in the Supplement, the correlation between the individual number frequencies is equal to 0.98 and the differences are small. In the subsequent sections we therefore present the results for the pooled data.

3 Number frequencies

If players in the Lotto game pick their numbers randomly, each number is expected to be chosen 13.3% of the time (6/45). Figure 1 depicts the actual frequencies. The most popular number in the Lotto data is 11, picked in 16.5% of the combinations. The number 7 follows closely (16.3%). The least popular numbers are 37 and 38 (10.3% and 10.5%, respectively). Overall, we observe that players have a tendency to pick small numbers. Figure 2 presents the frequencies in a heat map, where the numbers are displayed in a matrix as they appear on the Lotto website.

Figure 1: Number frequencies in the Lotto game.

Figure 2: Heat map for the Lotto game.

Similar results emerge in the casino game. Figure 3 shows the selection frequencies for the 37 numbers. Under random number selection each number would be chosen 2.70% of the time (1/37). Again, we observe a preference for small numbers. The most popular number is 7, chosen 4.19% of the time, closely followed by 8 (4.05%). The most frequently picked number in the Lotto data, 11, is the fourth most popular number in the casino data (3.46%). The least popular numbers are 34 (1.43%) and 35 (1.64%). Figure 4 presents the frequencies in a heat map, with the numbers displayed as they appear on the roulette table. This presentation was also used on the vouchers and on the screen when players entered their predictions.

Figure 3: Number frequencies in the casino game.

Figure 4: Heat map for the casino game.

These results are in line with past research. Other lottery studies have similarly found that players have a preference for small numbers (Reference Stern and CoverStern & Cover, 1989; Reference FinkelsteinFinkelstein, 1995; Reference Cox, Daniell and NicoleCox et al., 1998; Reference Papachristou and KaramanisPapachristou & Karamanis, 1998; Reference Farrell, Hartley, Lanot and WalkerFarrell et al., 2000; Roger & Briohanne, Reference Roger and Broihanne2007; Reference Oyeleke and OtekunrinOyeleke & Otekunrin, 2014; Reference Suetens, Galbo-Jørgensen and TyranSuetens et al., 2015). A possible explanation is that smaller numbers are more present in everyday life and easier to recall, and thus more likely to be personally relevant and prominently available in memory (Reference MilikowskiMilikowski, 1995). The popularity of 7 seems to be a general phenomenon. Without exception, lottery studies find that 7 is among the most popular numbers. Experimental studies similarly document a preference for this number (Reference SimonSimon, 1971; Reference Simon and PrimaveraSimon & Primavera, 1972; Reference HeywoodHeywood, 1972; Reference Kubovy and PsotkaKubovy & Psotka, 1976; Reference TeigenTeigen, 1983; Reference Silver, McCulley, Chambliss, Charles, Smith, Waddell and WinfieldSilver et al., 1988). Footnote 2

Studies that looked at color preferences find that blue is the most frequently chosen color (Reference SimonSimon, 1971; Reference Simon and PrimaveraSimon & Primavera, 1972; Reference TruemanTrueman, 1979; Reference Silver, McCulley, Chambliss, Charles, Smith, Waddell and WinfieldSilver et al., 1988).Footnote 3, Footnote 4 Among our Lotto players, the most popular jackpot color is blue as well (22.2%), followed by red (18.9%), green (17.6%), yellow (14.6%), purple (13.4%), and orange (13.3%). In the game of roulette, half the numbers 1–36 are black and the other half are red (0 is green), and when the casino game players entered their predictions the numbers were displayed in these colors. The average selection frequency of red numbers is 2.75%, which is significantly higher than the average for black numbers of 2.68% (z-test; p < 0.001).

In both games, odd numbers are more popular than even numbers (Lotto: 13.5% vs. 13.1%; Casino: 2.77% vs. 2.63%); among the odd numbers, prime numbers are more popular than non-prime numbers (Lotto: 14.0% vs. 13.0%; Casino: 3.14% vs. 2.32%) and among the even numbers, non-round numbers are more popular than the “round” multiples of ten (Lotto: 13.2% vs. 12.7%; Casino: 2.68% vs. 2.48%). All these pairs of averages are significantly different (z-tests; all p < 0.001).

In other contexts, people tend to use round numbers more often than non-round numbers (Reference PlugPlug, 1977; Reference Klesges, Debon and RayKlesges, Debon & Ray, 1995; Reference Bopp and FaehBopp & Faeh, 2008; Reference Pope and SimonsohnPope & Simonsohn, 2011). One possible explanation for the difference is that lottery players may look for combinations that “look random”, and that non-round numbers appear more random than round numbers. Similarly, odd and prime numbers may appear more random than even and non-prime numbers, respectively.

4 Personally meaningful and situationally available numbers

People generally hold a favorable view towards the self (Reference Greenwald and BanajiGreenwald & Banaji, 1995). This favorable view tends to spill over to things associated with the self (Reference BegganBeggan, 1992; Reference Morewedge, Shu, Gilbert and WilsonMorewedge, Shu, Gilbert & Wilson, 2009; Nuttin, Reference Nuttin1985, Reference Nuttin1987). The resulting tendency of people to gravitate towards people, places, and things that resemble the self has been termed implicit egotism (Reference Pelham, Carvallo and JonesPelham, Carvallo & Jones, 2005). One example is the preference for the numbers in one’s own birthday (Kitayama & Karasawa, Reference Kitayama and Rarasawa1997; Reference Jones, Pelham, Mirenberg and HettsJones, Pelham, Mirenberg & Hetts, 2002). In line with this, virtually all past Lotto studies show that the numbers in the range of 1–31 (days), and in particular 1–12 (days and months) are more popular than other numbers.

With our individual-level Lotto data we can directly investigate whether players have a preference for playing with the numbers of their day, month, and year of birth. We can also test whether they favor two other kinds of personally meaningful numbers, namely the number corresponding to their age and the numbers in their postal code.

For year of birth we consider the last two digits. Players need to be born between 1901 and 1945 to be able to use their birth year, which was true for 7.9% of the 5.1 million combinations. Selecting age as a number is only possible for people under the age of 46, which was true for 42.3% of the combinations. Dutch postal codes are alphanumeric, consisting of a number between 1000 and 9999 and two letters. We consider the first two digits and the last two digits. Players could select these numbers in 60.7 and 59.2% of the cases, respectively.

Table 1, Panel A shows how frequently the personally meaningful numbers are chosen, conditional on the player being able to do so. Under the null hypothesis of random choice, numbers will be picked 13.3% of the time (6/45). This proportion is exceeded for all personally meaningful numbers (z-tests; all p < 0.001). Day of birth is the most popular one, followed by the year and month of birth, age, and the postal code numbers.

Table 1: Personally meaningful and situationally available numbers in the Lotto game

Notes: The number of combinations reflects how often players were able to choose the particular number. For 29,442 (18,758) combinations we have no birthdate (postal code) information. All frequencies are significantly higher than 13.33% at the 0.1% level.

Personally meaningful numbers may also be popular due to the mere fact that people are frequently exposed to them. Even a short exposure to a number can make that number more available in short-term memory and affect subsequent responses (Reference KubovyKubovy, 1977). In the context of the Lotto game, numbers that are especially available to players are the current date, the numbers in the date of the upcoming draw, and the numbers prominently displayed on the website. Also, when making an online transaction, Lotto displays both the current jackpot size and the remaining time before the next scheduled draw.

For the jackpot size (expressed in millions of Euros), we consider the popularity of both the integer and the decimal number, where the latter could only take a value of zero or five during our sample period. The time until the draw is shown in days and hours (before the final 24 hours) or in hours and minutes (during the final 24 hours), and we examine whether a number is chosen more frequently when it appears on the screen as one of these elements. Selecting the numbers in the current date or draw date was always possible, as was selecting the integer of the jackpot size (range: 7–36). The decimal number, and the first and second element of the remaining time could be chosen in 46.8, 96.3, and 86.3% of the cases, respectively.

Table 1, Panel B shows the raw frequencies for these available numbers. All percentages significantly exceed 13.3 by approximately one or two percentage points (z-tests; all p < 0.001).

The raw percentages are, however, biased by a general preference for small numbers that may result from a preference for other (unobserved) meaningful or available numbers, or from other mechanisms. To control for differences in base rates and to also disentangle the effects of the different meaningful and available numbers we perform a logit regression. The dependent variable is the player’s decision to choose (1) or not choose (0) a given number. Hence, each chosen combination generates 45 observations, one for each number (1–45) that could be selected. As explanatory variables we use dummy variables that take the value of 1 for the number that corresponds to the personally meaningful or situationally available number (and 0 otherwise).Footnote 5 To allow for differences in base rates we include number fixed effects. We follow the common approach of reporting average marginal effects, and correct the standard errors for clustering at the player-number level and the combination level (Reference Cameron, Gelbach and MillerCameron, Gelbach & Miller, 2011; Reference ThompsonThompson, 2011).

Table 2, Model 1 displays the average marginal effects (in percentage points). All personally meaningful numbers are significantly more likely to be selected. The marginal effect sizes of the day and year of birth are roughly equal: players are approximately 7 percentage points more likely to pick these numbers. The effects for month of birth and age are about half as strong. Postal code numbers are considerably less important, with marginal effect sizes of 0.30 and 0.24 percentage points for the first and last two digits, respectively. The effects of the current date, draw date, and jackpot size are also significant and comparable in size to those of the postal code. The second element of the remaining time has a small but significant effect, whereas the first element is insignificant.

Table 2: Logit regression results for the Lotto game

** p <.001;

* p <.01;

p <.05.

Table 2, Model 2 shows the logit regression results when observations are weighted by the reciprocal of a player’s total number of combinations. When all chosen combinations are weighted equally, as we have done so far, the results may be more representative for frequent players than for the cross-section of players. After weighting, the effects of birthdate, age, current date, draw date, and jackpot numbers are stronger. This implies that infrequent players make more use of these personally meaningful and situationally available numbers, possibly because they use the random number generator less frequently.

Similar patterns emerge in the casino data. The last two digits of the year of birth can be selected by people born between 1900 and 1936. Players in this category entered 4.2% of the combinations. Age can be only chosen by people under 37. This condition is met for 33.1% of all entries. Table 3, Panel A shows how often players pick these personal numbers. All frequencies significantly exceed 2.70% (z-tests; all p < 0.001).Footnote 6 The results are especially pronounced for the day of birth; players select this number approximately three times as often.

Table 3: Personally meaningful and situationally available numbers in the casino game

Notes: The number of observations reflects how often players were able to choose the particular number. All frequencies are significantly higher than 2.70% at the 0.1% level.

The situationally available numbers that we consider here are the day and month of play, and the numeric values that appear in a player’s voucher code. In 2013, the voucher code was composed of three sets of three symbols that could be either letters or numbers. We extract all numbers between 0 and 36 from each set. For example, from XVH-M51-36Z we extract 5, 1, 3, 6, and 36. On average there are 2.05 such numbers in a voucher code. In 2014, the voucher code was composed of letters alone. Table 3, Panel B shows that whenever players are able to pick a number from the date of play or from the voucher code, they do this significantly more often than 2.70% of the time (z-tests; all p < 0.001).

We perform similar regression analyses as we did for the Lotto data. We correct standard errors for clustering at the player-number level and the level of individual predictions.Footnote 7

Table 4, Model 1 displays the average marginal effects (in percentage points). The effects of the personally meaningful and situationally available numbers are all significant. Players are 4.7 and 3.3 percentage points more likely to pick their day and year of birth, respectively. Month of birth and age are somewhat less important, with effect sizes of 1.5 and 1.2 percentage points. The average marginal effects for the numbers from the current date are 0.15 percentage points, corresponding to roughly 5.6% of the probability under random selection. The numbers in the voucher codes also play a statistically significant role, but the effect size there is only 0.07 percentage point.

Table 4: Logit regression results for the casino game

** p < .001.

Table 4, Model 2 shows the logit regression results when observations are weighted by the reciprocal of a player’s total number of entries. The effect sizes for birthdate numbers and age are stronger after weighting, suggesting that personally meaningful numbers are more popular among infrequent players. The effect sizes for current date and voucher code are hardly affected.

5 Spatial position

Players in the Lotto game select their numbers from a given 5 by 9 matrix (Figure 2). In the casino game the set of numbers are presented as on a roulette table, with the numbers 1 through 36 depicted in a 12 by 3 matrix and 0 on top (Figure 4). Multiple studies have shown that people have a tendency to select choice options presented in the middle of a display and avoid the edges. This behavior has been observed with laboratory and field data, for both individual choice and strategic interaction (Reference ChristenfeldChristenfeld, 1995; Reference Rubinstein, Tversky and HellerRubinstein, Tversky & Heller, 1997; Reference Shaw, Bergen, Brown and GallagherShaw, Bergen, Brown & Gallagher, 2000; Reference Attali and Bar-HillelAttali & Bar-Hillel, 2003; Reference Raghubir and ValenzuelaRaghubir & Valenzuela, 2006; Reference Chandon, Hutchinson, Bradlow and YoungChandon, Hutchinson, Bradlow & Young, 2009; Reference Atalay, Bodur and RasolofoarisonAtalay, Bodur & Rasolofoarison, 2012; Reference Valenzuela, Raghubir and MitakakisValenzuela, Raghubir & Mitakakis, 2013; Reference Bar-HillelBar-Hillel, 2015). Closely related to our analyses for lottery games, Bar-Hillel and Zultan (Reference Bar-Hillel and Zultan2012) examine the distribution of gamblers’ bets on a roulette table and observe that numbers in the center are more popular.

There are several ways to define the central part of the Lotto form. Figure 5 compares the raw frequencies for numbers in and out of the center for eight definitions. Under each definition, the difference is positive and statistically significant. In relative terms, numbers in the center are 5–13% more likely to be selected than numbers out of the center. The difference is largest if the center region is confined to the number 23 alone. This number in the exact center does not determine the effect in full, as positive and significant differences remain when we exclude it (z-tests; all p < 0.001).

Figure 5: Center effects in the Lotto game.

Notes: Center definitions are indicated with bold rectangles. Differences are expressed in percentage points. Results after excluding number 23 (highlighted in grey) are within parentheses.

Figure 6 shows that the difference is also positive for all six possible definitions of the center region of the casino game (z-tests; all p < 0.001). In relative terms, numbers in the center are 22–40% more likely to be selected than numbers out of the center. As with Lotto, the center effect is strongest when the center is confined to the most centrally located number (17), but it is not solely driven by this single number.

Figure 6: Center effects in the casino game.

Notes: Center definitions are indicated with bold rectangles. Differences are expressed in percentage points. Results after excluding number 17 (highlighted in grey) are within parentheses.

Weighting observations by the reciprocal of a player’s total number of entries amplifies the center effects in the Lotto game (Figure S3 in the Supplement). In the casino game, however, the results hardly change (Figure S4 in the Supplement). A possible explanation for this difference is that frequent Lotto players are more likely to use the random number generator than infrequent Lotto players. In the casino game there is no such number generator available.Footnote 8

6 Recent draws

Various lottery studies find that players tend to avoid numbers that were recently drawn (Reference Clotfelter and CookClotfelter & Cook, 1993; Reference TerrellTerrell, 1994; Reference DingDing, 2011; Reference Suetens and TyranSuetens & Tyran, 2012).Footnote 9 Suetens et al. (2015) document a similar response to the previous draw, but they also find that a number is popular if it appears in multiple recent draws.

The Lotto data comprises 175 draws. Figure 7A compares the average selection frequency of numbers that appeared in the previous draw with that of numbers that did not appear in the previous draw. This simple comparison shows that recent winning numbers are chosen less often than other numbers. Figure 7B displays the average selection frequency of a number conditional on whether it was drawn 0, 1, 2, 3, or 4 times in the preceding six draws. This figure suggests that numbers drawn only once over the past six draws are being avoided, while numbers drawn three or four times are relatively popular. The regression results in Table 2, Model 1 confirm these patterns. Note that the effect sizes are relatively small. This is not surprising because the numbers from previous draws are not readily available to players; players have to make a conscious effort to keep track of those numbers.

Figure 7: Recent draw effects in the Lotto game.

Notes: (A) displays the average selection frequency of a number that appeared in the previous draw and that of a number that did not appear in the previous draw. (B) displays the average selection frequency of a number conditional on whether it was drawn 0, 1, 2, 3, or 4 times in the preceding six draws. (C) and (D) display the results of similar analyses for the six jackpot colors.

Weighting observations by the reciprocal of a player’s total number of combinations changes the effect of the past draw from negative to positive, and amplifies the effects of frequently drawn numbers (Table 2, Model 2). These changes suggest that frequent and infrequent players respond differently to prior draw results. To investigate this in more detail, we perform separate regressions for players who participated only ten or fewer times throughout our sample period (Table 2, Model 3) and for players who participated a thousand times or more (Table 2, Model 4). The results show that infrequent players have a preference for “hot” numbers, whereas frequent players tend to avoid these.

These results can be related to a large literature showing that people have difficulties understanding randomness. In their early work, Reference Tversky and KahnemanTversky and Kahneman (1971) speak of a “belief in the law of small numbers” to describe the misconception that a short sequence of events generated by a random process will have characteristics that closely resemble those of the data generating process (DGP). This false belief leads to the gambler’s fallacy when people know the DGP and to the hot-hand fallacy when people do not know it (Reference Kahneman and TverskyKahneman & Tversky, 1972; Reference Tversky and KahnemanTversky & Kahneman, 1974; Reference RabinRabin, 2002). When people are asked to produce random sequences for a given DGP, they typically predict too many reversals (Reference O’NeillO’Neill, 1987; Reference Rapoport and BudescuRapoport & Budescu, 1992, 1997; Reference Bar-Hillel and WagenaarBar-Hillel & Wagenaar, 1991). When a random sequence is given for an unknown DGP, people tend to exaggerate the degree to which the DGP will resemble the given sequence of signals, leading to a belief in non-existent variation over time (Reference Gilovich, Vallone and TverskyGilovich, Vallone & Tversky, 1985; Reference CamererCamerer, 1989; Reference Tversky and GilovichTversky & Gilovich, 1989). The different behavior of frequent and infrequent Lotto players is in line with the different theoretical underpinnings of the two biases, assuming that frequent players are more familiar with the game and the underlying DGP than infrequent players.

Surprisingly, the results for the jackpot colors are different. Color choices are consistent with the gambler’s fallacy only. Figure 7C shows that the winning color in the previous draw is chosen less often than other colors. Figure 7D shows that the more frequently a color has been drawn in the last six draws, the less frequently players bet on that color.

7 Combinations

In the Lotto game there are 8,145,060 possible combinations of numbers that players can choose. Table 5 lists the thirty most frequently selected combinations, ranked by the number of players who selected them. If players were picking their 5,108,343 combinations at random, the likelihood of one or more combinations appearing more than ten times in our data would be 0.1%. The fact that many combinations appear hundreds of times can thus be seen as an extreme deviation from random choice.

Table 5: Thirty most popular combinations in the Lotto game

Many of the thirty most popular combinations form a numeric sequence or spatial pattern. The majority are composed of a vertical or diagonal line of five numbers, plus a sixth number that connects with one of the endpoints or is located at one of the corners of the form (Figure S5 in the Supplement). Overall, 0.9% of the combinations in our sample can be classified as a diagonal or vertical pattern, which is a significantly greater portion than the 0.009% expected under randomness.

In the casino game, players can choose a number more than once, and the order of the chosen numbers matters. The total number of unique combinations thus equals 374=1,874,161. Table 6 shows the thirty most popular ones, ranked by the total number of times they appear in the data. If our total of 568,987 combinations would be picked completely at random, the likelihood of one or more combinations occurring more than ten times would be virtually zero. In sharp contrast, we observe that many combinations appear hundreds of times. Again, most of the popular combinations form a numeric sequence or spatial pattern. The exceptions in the top thirty represent neighboring numbers on the roulette wheel. Note that the numbers in all thirty combinations are in ascending order. This turns out to reflect a general phenomenon: 33.6% (33.1%) of all combinations are entered in ascending (strictly ascending) order, while only 4.88% (3.52%) would be expected to have that property under randomness.

Table 6: Thirty most popular combinations in the casino game

Henze (1997) similarly reports that many of the most popular Lotto combinations represent a numeric sequence. In line with the many occurrences of spatial patterns that we observe, Reference Falk, Falk and AytonFalk, Falk & Ayton (2009) find that aesthetics play an important role in the choices of laboratory subjects.

8 Spacing

Reference Boland and PawitanBoland and Pawitan (1999) find that the students in their classroom experiment tended to spread out their selections when asked to randomly generate a Lotto draw. Reference Lien, Yuan and ZhengLien and Yuan (2015) find similar results in data from a Chinese six-number lottery. These results may reflect a form of representativeness bias (Reference Tversky and KahnemanTversky & Kahneman, 1971): if people believe that six draws from a uniform distribution should closely resemble the uniform distribution, they will expect the six numbers to be evenly spread across the possible range and deem clusters unlikely.

To investigate the degree to which Lotto players spread their numbers across the possible range, we compute the five spaces between the six (ordered) numbers for each combination. Next, we compare the empirical distribution of these spaces with the distribution that can be expected under random number choice.Footnote 10 If people indeed have a tendency to evenly spread their numbers, small and large spaces will be underrepresented.

The bars in Figure 8A reflect the absolute differences between the empirical and theoretical frequencies. In line with a tendency to spread numbers evenly, we observe more medium-sized spaces and fewer small and large spaces than expected by chance. Figure 8B displays the differences as a percentage of the theoretical frequencies (with the vertical axis truncated at 70%). These relative differences follow a similar pattern but are more pronounced for larger spaces due to their smaller theoretical likelihood. Extremely large spaces are highly unlikely in theory, but relatively popular among the players in our sample.

Figure 8: Difference between the empirical and theoretical spacing distribution in the Lotto game.

Henze’s (1997) analyses of the most popular combinations in a German number lottery also point out that spacing patterns are not in accordance with randomness, but he cites this as evidence for the popularity of numeric sequences. Indeed, the abnormal spacing patterns that we find in our data could result from a preference for specific numeric sequences or spatial patterns. To rule out that the patterns are caused by specific, popular combinations, we redo the analysis after excluding combinations that occur more than once in our data. The lines in Figure 8 reflect the absolute and relative differences between the empirical and theoretical distribution for the unique combinations only. Albeit somewhat weaker, the resulting patterns have a similar shape.

In the casino game, the three distances between the four numbers can be positive, negative, and zero. Because of the tendency to pick numbers in ascending order, positive spaces are strongly overrepresented (Figure S8 in the Supplement). To analyze spacing effects in isolation from ordering effects, we therefore measure the three distances in each combination after sorting the numbers in ascending order.Footnote 11

Figure 9 shows the absolute and relative differences between the empirical and theoretical frequencies after sorting. In line with a tendency to spread numbers evenly, and similar to what we found for Lotto, medium-sized spaces are overrepresented. Similar patterns emerge when we reduce the samples to unique combinations only, indicating that the abnormal spacing patterns do not result from specific, popular combinations alone.

Figure 9: Difference between the empirical and theoretical spacing distribution in the casino game.

Weighting observations by the reciprocal of a player’s total number of entries amplifies the spacing effects in the Lotto game (Figure S10 in the Supplement), but leaves the casino results virtually unaffected (Figure S11 in the Supplement). This again suggests that frequent Lotto players are more likely to use the random number generator than occasional players.

9 Summary and concluding remarks

We have documented a variety of empirical patterns in number choices in lottery games, using data sets that together comprise a total of approximately 33 million selected numbers. The patterns in the two different lottery games are qualitatively very similar. In a quantitative sense the effects are somewhat more pronounced in the casino game than in the Lotto game. This difference can probably be ascribed to the availability of default, computer-generated sets of numbers in the Lotto game, as there is strong evidence that people tend to stick with defaults (Reference Camerer, Issacharoff, Loewenstein, O’Donoghue and RabinCamerer et al., 2003).

In line with earlier findings in the literature, the number 7 is highly popular in both games. Other numbers that consistently rank among the favorites include 3, 5, 8, and 11. More generally, numbers from the lower end of the possible ranges are more popular than numbers from the higher end. Also, in both games players prefer odd numbers over even numbers, prime numbers over non-prime numbers, and non-round numbers over round numbers.

Reinforcing earlier findings in different contexts, players are attracted towards numbers in the center of the choice form. Within each game, the relative location of the numbers on the entry screen is fixed, but between the two games the ordering is different. Regardless of the exact definitions of the center, numbers in the middle are more popular than numbers on the edges.

Using the data we have about individual players’ birthdates and postal codes, we find that players like to pick numbers that have a special meaning to them. Similarly, our analyses with data on dates of play, dates of draw, numbers on entry screens, and numbers in entry codes confirm that players more frequently choose numbers that are situationally available.

Our analyses of the combinations of numbers yield evidence that players care about aesthetics. Combinations that form a numeric sequence or spatial pattern are extremely popular, despite the fact that the parimutuel aspect of both lottery games creates an incentive to strategically attempt to select unique combinations. This suggests that many players do not see or understand the strategic aspect, or that the joy of playing with aesthetically pleasing combinations more than offsets the negative effect on expected payoff (Reference Goodman and IrwinGoodman & Irwin, 2006).

Last, we find that frequent players avoid numbers that appeared in the latest draws, that infrequent players chase these numbers, and that both spread their numbers relatively evenly across the possible range. These results may reflect that players misjudge the likelihood of winning with these numbers or combinations, and fit into a large body of literature that shows that people have difficulties understanding randomness. Moreover, the different responses of frequent and infrequent players to prior draw results accord with a literature arguing that knowing the data generating process leads to a gambler’s-fallacy type of behavior and not knowing it leads to a hot-hand type of behavior.

Footnotes

*

We thank Stichting de Nationale Sporttotalisator and Holland Casino for providing the data used in this paper. In accordance with the Dutch Personal Data Protection Act, the data was provided under non-disclosure agreements, in anonymous form, and for scientific purposes only. We thank Maya Bar-Hillel and the two anonymous reviewers for their constructive comments. The paper also benefited from discussions with seminar participants at the Erasmus University of Rotterdam and Carnegie Mellon University, and with participants of the Risk, Uncertainty and Ambiguity Workshop 2014 Ein Bokek, FUR 2014 Rotterdam, SPUDM 2015 Budapest, TIBER 2015 Tilburg, and the Rotterdam-Tilburg JDM Camp 2015 Tilburg. We gratefully acknowledge support from the Tinbergen Institute and from the Economic and Social Research Council via the Network for Integrated Behavioural Sciences (ES/K002201/1).

1 For 462 (503) players we do not have birthdate (postal code) information. These players selected an aggregate of 29,442 (18,758) combinations and are excluded from the relevant analyses.

2 Among roulette players in a casino the number 7 is somewhat less popular, most likely because it is relatively difficult to reach due to the position of the wheel and the croupier (Reference Sundali and CrosonSundali & Croson, 2006; Reference Bar-Hillel and ZultanBar-Hillel & Zultan, 2012).

4 There is evidence that Dutch subjects most frequently cite red when asked to spontaneously produce a color; when asked to produce their favorite color, however, they show a preference for blue (Reference Wiegersma and de KlerckWiegersma & de Klerck, 1984; Reference Wiegersma and van der ElstWiegersma & van der Elst, 1988).

5 Because the month and year numbers in the current date are highly correlated with those in the draw date (ρ=0.92 and ρ=0.99, respectively), we only include the former.

6 As players in the casino game predict the outcomes of four independent, consecutive roulette spins, they can choose the same number more than once. We therefore look at the likelihood that a number is chosen for a particular roulette spin (and not, as we did with the Lotto game, at the likelihood that a number is included in a combination).

7 For 2013, we are missing the information to discriminate between unique players, and use a surrogate player identifier constructed on the basis of gender and birthdate information. This solution underestimates the true number of clusters, as there are only 43,096 unique gender-birthday combinations in the 2013 data (compared to 112,473 players in 2014). Assuming that each combination was entered by a unique player leads to similar results.

8 Note that we cannot include the center effects in our regression models. Because the locations of the numbers on the form are fixed (every player faces the exact same form), it is not possible to disentangle center effects and number fixed effects.

9 At the same time, there is evidence of a “lucky store” effect, where retail stores sell more tickets after selling a large prize winning ticket (Reference Guryan and KearneyGuryan & Kearney, 2008; Reference Lien, Yuan and ZhengLien et al., 2015).

10 Figure S6 in the Supplement shows this theoretical distribution. Deriving the theoretical distribution on the basis of the actual individual number frequencies (instead of the uniform distribution) leads to similar results.

11 Figure S7 and S9 in the Supplement show the spacing distributions that can be expected under random number choice without (Figure S7) and with sorting in ascending order (Figure S9). Deriving the theoretical distributions on the basis of the actual individual number frequencies leads to similarly shaped benchmarks and similar abnormal spacing patterns.

References

Agarwal, S., He, J., Liu, H., Png, I. P. L., Sing, T. F., & Wong, W. (2014). Superstition and asset markets: Evidence from Singapore housing. Available at SSRN: ssrn.com/abstract=2416832.Google Scholar
Atalay, A. S., Bodur, H. O., & Rasolofoarison, D. (2012). Shining in the center: Central gaze cascade effect on product choice. Journal of Consumer Research, 39(4), 848866.CrossRefGoogle Scholar
Attali, Y., & Bar-Hillel, M. (2003). Guess where: The position of correct answers in multiple-choice test items as a psychometric variable. Journal of Educational Measurement, 40(2), 109128.CrossRefGoogle Scholar
Bar-Hillel, M. (2015). Position effects in choice from simultaneous displays: A conundrum solved. Perspectives on Psychological Science, 10(4), 419433.CrossRefGoogle ScholarPubMed
Bar-Hillel, M., & Wagenaar, W. A. (1991). The perception of randomness. Advances in Applied Mathematics, 12(4), 428454.CrossRefGoogle Scholar
Bar-Hillel, M., & Zultan, R. (2012). We sing the praise of good displays: How gamblers bet in casino roulette. Chance, 25(2), 2730.CrossRefGoogle Scholar
Beckert, J., & Lutter, M. (2013). Why the poor play the lottery: Sociological approaches to explaining class-based lottery play. Sociology, 47(6), 11521170.CrossRefGoogle Scholar
Beggan, J. K. (1992). On the social nature of nonsocial perception: The mere ownership effect. Journal of Personality and Social Psychology, 62(2), 229237.CrossRefGoogle Scholar
Bhattacharya, U., Kuo, W., Lin, T., & Zhao, J. (2016). Do superstitious traders lose money? Available at SSRN: ssrn.com/abstract=2478124.Google Scholar
Boland, P. J., & Pawitan, Y. (1999). Trying to be random in selecting numbers for Lotto. Journal of Statistics Education, 7(3).CrossRefGoogle Scholar
Bopp, M., & Faeh, D. (2008). End-digits preference for self-reported height depends on language. BMC Public Health, 8(342).CrossRefGoogle ScholarPubMed
Bourassa, S. C., & Peng, V. S. (1999). Hedonic prices and house numbers: The influence of feng shui. International Real Estate Review, 2(1), 7993.CrossRefGoogle Scholar
Brown, P., Chua, A., & Mitchell, J. (2002). The influence of cultural factors on price clustering: Evidence from Asia–Pacific stock markets. Pacific–Basin Finance Journal, 10(3), 307332.CrossRefGoogle Scholar
Brown, P., & Mitchell, J. (2008). Culture and stock price clustering: Evidence from the Peoples’ Republic of China. Pacific–Basin Finance Journal, 16(1), 95120.CrossRefGoogle Scholar
Cai, B. M., Cai, C. X., & Keasey, K. (2007). Influence of cultural factors on price clustering and price resistance in China’s stock markets. Accounting and Finance, 47(4), 623641.CrossRefGoogle Scholar
Camerer, C. F. (1989). Does the basketball market believe in the “hot hand”? American Economic Review, 79(5), 12571261.Google Scholar
Camerer, C. F., Issacharoff, S., Loewenstein, G., O’Donoghue, T., & Rabin, M. (2003). Regulation for conservatives: Behavioral economics and the case for “asymmetric paternalism”. University of Pennsylvania Law Review, 151(3), 12111254.CrossRefGoogle Scholar
Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2011). Robust inference with multiway clustering. Journal of Business & Economic Statistics, 29(2), 238249.CrossRefGoogle Scholar
Chandon, P., Hutchinson, J. W., Bradlow, E. T., & Young, S. H. (2009). Does in-store marketing work? Effects of the number and position of shelf facings on brand attention and evaluation at the point of purchase. Journal of Marketing, 73(6), 117.CrossRefGoogle Scholar
Chau, K., Ma, V. S. M., & Ho, D. C. W. (2001). The pricing of “luckiness” in the apartment market. Journal of Real Estate Literature, 9(1), 2940.CrossRefGoogle Scholar
Chernoff, H. (1981). How to beat the Massachusetts numbers game. Mathematical Intelligencer, 3(4), 166172.CrossRefGoogle Scholar
Chong, T. T., & Du, X. (2008). Hedonic pricing models for vehicle registration marks. Pacific Economic Review, 13(2), 259276.CrossRefGoogle Scholar
Christenfeld, N. (1995). Choices from identical options. Psychological Science, 6(1), 5055.CrossRefGoogle Scholar
Clotfelter, C. T., & Cook, P. J. (1993). The gambler’s fallacy in lottery play. Management Science, 39(12), 15211525.CrossRefGoogle Scholar
Cook, P. J., & Clotfelter, C. T. (1993). The peculiar scale economies of lotto. American Economic Review, 83(3), 634643.Google Scholar
Cox, S. J., Daniell, G. J., & Nicole, D. A. (1998). Using maximum entropy to double one’s expected winnings in the UK national lottery. Journal of the Royal Statistical Society: Series D (the Statistician), 47(4), 629641.Google Scholar
De, Lotto. (2014). De Lotto jaarverslag (annual report) 2014. Retrieved from: www.delotto.nl/files/De\%20Lotto/DeLottoJaarverslag2014-webversie.pdf.Google Scholar
D’Hondt, W., & Vandewiele, M. (1983). Colors and figures in Senegal. Perceptual and Motor Skills, 56(3), 971978.CrossRefGoogle Scholar
Ding, J. (2011). What numbers to choose for my lottery ticket? Behavior anomalies in the Chinese online lottery market. Available at SSRN: ssrn.com/abstract=1926526.Google Scholar
Falk, R., Falk, R., & Ayton, P. (2009). Subjective patterns of randomness and choice: Some consequences of collective responses. Journal of Experimental Psychology: Human Perception and Performance, 35(1), 203224.Google ScholarPubMed
Farrell, L., Hartley, R., Lanot, G., & Walker, I. (2000). The demand for lotto: The role of conscious selection. Journal of Business & Economic Statistics, 18(2), 228241.CrossRefGoogle Scholar
Finkelstein, M. (1995). Estimating the frequency distribution of the numbers bet on the California lottery. Applied Mathematics and Computation, 69(2), 195207.CrossRefGoogle Scholar
Fortin, N. M., Hill, A. J., & Huang, J. (2014). Superstition in the housing market. Economic Inquiry, 52(3), 974993.CrossRefGoogle Scholar
Gilovich, T., Vallone, R., & Tversky, A. (1985). The hot hand in basketball: On the misperception of random sequences. Cognitive Psychology, 17(3), 295314.CrossRefGoogle Scholar
Goodman, J. K., & Irwin, J. R. (2006). Special random numbers: Beyond the illusion of control. Organizational Behavior and Human Decision Processes, 99(2), 161174.CrossRefGoogle Scholar
Greenwald, A. G., & Banaji, M. R. (1995). Implicit social cognition: Attitudes, self-esteem, and stereotypes. Psychological Review, 102(1), 427.CrossRefGoogle ScholarPubMed
Guryan, J., & Kearney, M. S. (2008). Gambling at lucky stores: Empirical evidence from state lottery sales. American Economic Review, 98(1), 458473.CrossRefGoogle Scholar
Haigh, J. (1997). The statistics of the national lottery. Journal of the Royal Statistical Society: Series A (Statistics in Society), 160(2), 187206.CrossRefGoogle Scholar
Halpern, A. R., & Devereaux, S. D. (1989). Lucky numbers: Choice strategies in the Pennsylvania daily number game. Bulletin of the Psychonomic Society, 27(2), 167170.CrossRefGoogle Scholar
He, Y., & Wu, C. (2006). Is stock price rounded for economic reasons in the Chinese markets? Global Finance Journal, 17(1), 119135.CrossRefGoogle Scholar
Henze, N. (1997). A statistical and probabilistic analysis of popular lottery tickets. Statistica Neerlandica, 51(2), 155163.CrossRefGoogle Scholar
Heywood, S. (1972). The popular number seven or number preference. Perceptual and Motor Skills, 34(2), 357358.CrossRefGoogle Scholar
Hirshleifer, D. A., Jian, M., & Zhang, H. (2014). Superstition and financial decision making. Available at SSRN: ssrn.com/abstract=1460522.Google Scholar
Joe, H. (1987). An ordering of dependence for distribution of k-tuples, with applications to lotto games. Canadian Journal of Statistics, 15(3), 227238.CrossRefGoogle Scholar
Jones, J. T., Pelham, B. W., Mirenberg, M. C., & Hetts, J. J. (2002). Name letter preferences are not merely mere exposure: Implicit egotism as self-regulation. Journal of Experimental Social Psychology, 38(2), 170177.CrossRefGoogle Scholar
Kahneman, D., & Tversky, A. (1972). Subjective probability: A judgment of representativeness. Cognitive Psychology, 3(3), 430454.CrossRefGoogle Scholar
Kearney, M. S., Tufano, P., Guryan, J., & Hurst, E. (2011). Making savers winners: An overview of prize-linked saving products. In O. S. Mitchell, & A. Lusardi (Eds.), Financial literacy: Implications for retirement security and the financial marketplace (pp. 218240) Oxford University Press.CrossRefGoogle Scholar
Kitayama, S., & Rarasawa, M. (1997). Implicit self-esteem in Japan: Name letters and birthday numbers. Personality and Social Psychology Bulletin, 23(7), 736742.CrossRefGoogle Scholar
Klesges, R. C., Debon, M., & Ray, J. W. (1995). Are self-reports of smoking rate biased? Evidence from the Second National Health and Nutrition Examination Survey. Journal of Clinical Epidemiology, 48(10), 12251233.CrossRefGoogle ScholarPubMed
Kubovy, M. (1977). Response availability and the apparent spontaneity of numerical choices. Journal of Experimental Psychology: Human Perception and Performance, 3(2), 359364.Google Scholar
Kubovy, M., & Psotka, J. (1976). The predominance of seven and the apparent spontaneity of numerical choices. Journal of Experimental Psychology: Human Perception and Performance, 2(2), 291294.Google Scholar
Kuloğlu, M., Atmaca, M., Tezcan, A. E., Unal, A., & Gecici, O. (2002). Color and number preferences of patients with psychiatric disorders in eastern Turkey. Perceptual and Motor Skills, 94(1), 207213.CrossRefGoogle ScholarPubMed
Lien, J. W., & Yuan, J. (2015). The cross-sectional “gambler’s fallacy”: Set representativeness in lottery number choices. Journal of Economic Behavior & Organization, 109, 163172.CrossRefGoogle Scholar
Lien, J. W., Yuan, J., & Zheng, J. (2015). Representativeness biases and lucky store effects. Available at SSRN: ssrn.com/abstract=2635427.Google Scholar
Milikowski, M. (1995). Knowledge of numbers (Unpublished doctoral thesis). University of Amsterdam.Google Scholar
Morewedge, C. K., Shu, L. L., Gilbert, D. T., & Wilson, T. D. (2009). Bad riddance or good rubbish? Ownership and not loss aversion causes the endowment effect. Journal of Experimental Social Psychology, 45(4), 947951.CrossRefGoogle Scholar
Ng, T., Chong, T. T., & Du, X. (2010). The value of superstitions. Journal of Economic Psychology, 31(3), 293309.CrossRefGoogle ScholarPubMed
Nuttin, J. M. (1985). Narcissism beyond gestalt and awareness: The name letter effect. European Journal of Social Psychology, 15(3), 353361.CrossRefGoogle Scholar
Nuttin, J. M. (1987). Affective consequences of mere ownership: The name letter effect in twelve European languages. European Journal of Social Psychology, 17(4), 381402.CrossRefGoogle Scholar
O’Neill, B. (1987). Nonmetric test of the minimax theory of two-person zerosum games. Proceedings of the National Academy of Sciences of the United States of America, 84(7), 21062109.CrossRefGoogle ScholarPubMed
Oyeleke, O. B., & Otekunrin, O. A. (2014). On the performance of lottery winning strategies: A case study of Oyo State Lottery, Nigeria. British Journal of Mathematics & Computer Science, 4(17), 25572569.CrossRefGoogle Scholar
Papachristou, G., & Karamanis, D. (1998). Investigating efficiency in betting markets: Evidence from the Greek 6/49 Lotto. Journal of Banking & Finance, 22(12), 15971615.CrossRefGoogle Scholar
Pelham, B. W., Carvallo, M., & Jones, J. T. (2005). Implicit egotism. Current Directions in Psychological Science, 14(2), 106-110.CrossRefGoogle Scholar
Philbrick, J. L. (1976). Blue seven in East Africa: Preliminary report. Perceptual and Motor Skills, 42(2), 484.CrossRefGoogle Scholar
Plug, C. (1977). Number preferences in ratio estimation and constant-sum scaling. American Journal of Psychology, 90(4), 699704.CrossRefGoogle Scholar
Pope, D., & Simonsohn, U. (2011). Round numbers as goals: Evidence from baseball, SAT takers, and the lab. Psychological science, 22(1), 7179.CrossRefGoogle ScholarPubMed
Rabin, M. (2002). Inference by believers in the law of small numbers. Quarterly Journal of Economics, 117(3), 775816.CrossRefGoogle Scholar
Raghubir, P., & Valenzuela, A. (2006). Center-of-inattention: Position biases in decision-making. Organizational Behavior and Human Decision Processes, 99(1), 6680.CrossRefGoogle Scholar
Rapoport, A., & Budescu, D. V. (1992). Generation of random series in two-person strictly competitive games. Journal of Experimental Psychology: General, 121(3), 352363.CrossRefGoogle Scholar
Rapoport, A., & Budescu, D. V. (1997). Randomization in individual choice behavior. Psychological Review, 104(3), 603617.CrossRefGoogle Scholar
Roger, P., & Broihanne, M. (2007). Efficiency of betting markets and rationality of players: Evidence from the French 6/49 Lotto. Journal of Applied Statistics, 34(6), 645662.CrossRefGoogle Scholar
Rubinstein, A., Tversky, A., & Heller, D. (1997). Naive strategies in competitive games. In W. Albers, W. Güth, P. Hammerstein, B. Moldovanu & E. van Damme (Eds.), Understanding strategic interaction: Essays in honor of Reinhard Selten (pp. 394402). Berlin: Springer-Verlag.CrossRefGoogle Scholar
Saito, M. (1999). “Blue and seven phenomena” among Japanese students. Perceptual and Motor Skills, 89(2), 532536.CrossRefGoogle ScholarPubMed
Scoggins, J. F. (1995). The lotto and expected net revenue. National Tax Journal, 48(1), 6170.CrossRefGoogle Scholar
Shaw, J. I., Bergen, J. E., Brown, C. A., & Gallagher, M. E. (2000). Centrality preferences in choices among similar options. Journal of General Psychology, 127(2), 157164.CrossRefGoogle ScholarPubMed
Shum, M., Sun, W., & Ye, G. (2014). Superstition and “lucky” apartments: Evidence from transaction-level data. Journal of Comparative Economics, 42(1), 109117.CrossRefGoogle Scholar
Silver, N. C., McCulley, W. L., Chambliss, L. N., Charles, C. M., Smith, A. A., Waddell, W. M., & Winfield, E. B. (1988). Sex and racial differences in color and number preferences. Perceptual and Motor Skills, 66(1), 295299.CrossRefGoogle Scholar
Simon, J. (1999). An analysis of the distribution of combinations chosen by UK national lottery players. Journal of Risk and Uncertainty, 17(3), 243276.CrossRefGoogle Scholar
Simon, W. E. (1971). Number and color responses of some college students: Preliminary evidence for a “blue seven phenomenon”. Perceptual and Motor Skills, 33(2), 373374.CrossRefGoogle Scholar
Simon, W. E., & Primavera, L. H. (1972). Investigation of the “blue seven phenomenon” in elementary and junior high school children. Psychological Reports, 31(1), 128130.CrossRefGoogle Scholar
Stern, H., & Cover, T. M. (1989). Maximum entropy and the lottery. Journal of the American Statistical Association, 84(408), 980985.CrossRefGoogle Scholar
Suetens, S., Galbo-Jørgensen, C. B., & Tyran, J. K. (2015). Predicting lotto numbers: A natural experiment on the gambler’s fallacy and the hot hand fallacy. Journal of the European Economic Association. doi: 10.1111/jeea.12147.Google Scholar
Suetens, S., & Tyran, J. K. (2012). The gambler’s fallacy and gender. Journal of Economic Behavior & Organization, 83(1), 118124.CrossRefGoogle Scholar
Sundali, J. & Croson, R. (2006). Biases in casino betting: The hot hand and the gambler’s fallacy. Judgment and Decision Making, 1(1): 112.CrossRefGoogle Scholar
Teigen, K. H. (1983). Studies in subjective probability l: Prediction of random events. Scandinavian Journal of Psychology, 24(1), 1325.CrossRefGoogle Scholar
Terrell, D. (1994). A test of the gambler’s fallacy: Evidence from pari-mutuel games. Journal of Risk and Uncertainty, 8(3), 309317.CrossRefGoogle Scholar
Thompson, S. B. (2011). Simple formulas for standard errors that cluster by both firm and time. Journal of Financial Economics, 99(1), 110.CrossRefGoogle Scholar
Trueman, J. (1979). Existence and robustness of the blue and seven phenomena. Journal of General Psychology, 101(1), 2326.CrossRefGoogle Scholar
Tversky, A., & Gilovich, T. (1989). The “hot hand”: Statistical reality or cognitive illusion? Chance, 2(4), 3134.CrossRefGoogle Scholar
Tversky, A., & Kahneman, D. (1971). Belief in the law of small numbers. Psychological Bulletin, 76(2), 105110.CrossRefGoogle Scholar
Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases. Science, 185(4157), 11241131.CrossRefGoogle ScholarPubMed
Valenzuela, A., Raghubir, P., & Mitakakis, C. (2013). Shelf space schemas: Myth or reality? Journal of Business Research, 66(7), 881888.CrossRefGoogle Scholar
Vandewiele, M., D’Hondt, W., Didillon, H., Iwawaki, S., & Mwamwenda, T. (1986). Number and color preferences in four countries. Perceptual and Motor Skills, 63(2), 945946.CrossRefGoogle Scholar
Wiegersma, S., & de Klerck, I. (1984). The “blue phenomenon” is red in the Netherlands. Perceptual and Motor Skills, 59(3), 790.CrossRefGoogle Scholar
Wiegersma, S., & van der Elst, G. (1988). “Blue phenomenon”: Spontaneity or preference? Perceptual and Motor Skills, 66(1), 308310.CrossRefGoogle Scholar
Woo, C., Horowitz, I., Luk, S., & Lai, A. (2008). Willingness to pay and nuanced cultural cues: Evidence from Hong Kong’s license-plate auction market. Journal of Economic Psychology, 29(1), 3553.CrossRefGoogle Scholar
Woo, C., & Kwok, R. H. F. (1994). Vanity, superstition and auction price. Economics Letters, 44(4), 389395.CrossRefGoogle Scholar
Figure 0

Figure 1: Number frequencies in the Lotto game.

Figure 1

Figure 2: Heat map for the Lotto game.

Figure 2

Figure 3: Number frequencies in the casino game.

Figure 3

Figure 4: Heat map for the casino game.

Figure 4

Table 1: Personally meaningful and situationally available numbers in the Lotto game

Figure 5

Table 2: Logit regression results for the Lotto game

Figure 6

Table 3: Personally meaningful and situationally available numbers in the casino game

Figure 7

Table 4: Logit regression results for the casino game

Figure 8

Figure 5: Center effects in the Lotto game.

Figure 9

Figure 6: Center effects in the casino game.

Figure 10

Figure 7: Recent draw effects in the Lotto game.

Figure 11

Table 5: Thirty most popular combinations in the Lotto game

Figure 12

Table 6: Thirty most popular combinations in the casino game

Figure 13

Figure 8: Difference between the empirical and theoretical spacing distribution in the Lotto game.

Figure 14

Figure 9: Difference between the empirical and theoretical spacing distribution in the casino game.

Supplementary material: File

Wang et al. supplementary material

Online Supplement
Download Wang et al. supplementary material(File)
File 390 KB