Introduction
Behavioural science interventions for modifying human behaviour, in the following called (behaviour) interventions, have been put forward to mitigate large-scale environmental problems, such as climate change or the ongoing loss of biodiversity (Amel et al., Reference Amel, Manning, Scott and Koger2017; Reddy et al., Reference Reddy2017; Battista et al., Reference Battista, Romero-Canyas, Smith, Fraire, Effron, Larson-Konar and Fujita2018; Klotz et al., Reference Klotz2018; Travers et al., Reference Travers2021). These interventions typically rely on human decision heuristics or social influence (Byerly et al., Reference Byerly, Balmford, Ferraro, Hammond Wagner, Palchak, Polasky, Ricketts, Schwartz and Fisher2018; Cinner, Reference Cinner2018). For example, default nudging exploits the heuristic of simply going with the default option instead of pondering which option is best (Thaler & Sunstein, Reference Thaler and Sunstein2008). As an application, providers have set sustainably produced electricity instead of conventionally produced electricity as a default (Liebe et al., Reference Liebe, Gewinner and Diekmann2021). Interventions based on social influence typically stress behaviours that are socially expected or chosen by a majority. For example, to encourage the re-usage of towels in hotels, guests have been made aware that a majority is engaging in this behaviour (Goldstein et al., Reference Goldstein, Cialdini and Griskevicius2008).
A number of quantitative meta-analyses show that sometimes these kinds of interventions can have a tremendous impact. Yet, the typical effect size is moderate at best (Osbaldiston & Schott, Reference Osbaldiston and Schott2012; Abrahamse & Steg, Reference Abrahamse and Steg2013; Lokhorst et al., Reference Lokhorst2013; Bergquist et al., Reference Bergquist, Nilsson and Schultz2019; Nisa et al., Reference Nisa2019), with a tendency for methodologically more rigorous studies to report smaller effects (Delmas et al., Reference Delmas, Fischlein and Asensio2013; Nisa et al., Reference Nisa2019).
Here, we discuss a number of reasons why interventions have mixed effects, which will, in turn, point toward specific strategies for improvement. Our starting point is the observation that most interventions target individuals to trigger behaviour change (Bicchieri & Dimant, Reference Bicchieri and Dimant2019). Models of cultural evolution suggest targeting groups instead because a strategy of this kind can reinforce the impact of a given intervention (Waring et al., Reference Waring2015; Waring et al., Reference Waring, Goff and Smaldino2017; Brooks et al., Reference Brooks, Waring, Borgerhoff and Richerson2018).
Pioneering theoretical contributions have provided fascinating insights into the mechanisms that translate behaviour change of a few into large-scale societal change. These contributions centre on the notion of ‘social tipping’. Tipping is the phenomenon that a small subgroup of a population, adopting a new behaviour, can trigger a self-reinforcing diffusion process that finally establishes this behaviour as a social norm – provided the size of the corresponding subgroup crosses a critical value called the ‘tipping point’ (Milkoreit et al., Reference Milkoreit, Hodbod, Baggio, Benessaiah, Calderón-Contreras, Donges, Mathias, Rocha, Schoon and Werners2018; Nyborg, Reference Nyborg2020; Scheffer, Reference Scheffer2020). Here, we define a social norm in a descriptive sense, as a behavioural pattern that individuals prefer to conform to when they believe that a sufficient number of others is also conforming to this pattern (Bicchieri, Reference Bicchieri2005).
Recent research empirically corroborates the power of tipping. Subgroups of 25–35% of the population can establish a new behaviour as a social norm in the laboratory (Centola et al., Reference Centola, Becker, Brackbill and Baronchelli2018; Andreoni et al., Reference Andreoni, Nikiforakis and Siegenthaler2021). Concerning social norms regulating behaviours with environmental impact (in the following called pro-environmental norms and pro-environmental behaviours), simulation models come to similar results. Behavior change of a minority can establish, but also erode, pro-environmental norms (Walker & Meyers, Reference Walker and Meyers2004; Castilla-Rho et al., Reference Castilla-Rho, Rojas, Andersen, Holley and Mariethoz2017; Sigdel et al., Reference Sigdel, Anand and Bauch2017; Mathias et al., Reference Mathias, Anderies, Baggio, Hodbod, Huet, Janssen, Milkoreit and Schoon2020) – a theoretical result in line with the findings of a recent field study (Berger, Reference Berger2021).
Tipping has been applied in attempts to resolve various social issues (Mackie & Lejeune, Reference Mackie and Lejeune2009; World Health Organization, 2009; Shell-Duncan et al., Reference Shell-Duncan2011; Dolan et al., Reference Dolan, Hallsworth, Halpern, King, Metcalfe and Vlaev2012; Lee-Rife et al., Reference Lee-Rife2012; World Bank Group, 2015; Nyborg et al., Reference Nyborg2016; Bicchieri, Reference Bicchieri2017; 34.95 paper’, O'Brien, Reference O'Brien2017; Platteau & Auriol, Reference Platteau, Auriol, Anderson, Beaman and Plateau2018). The idea of recruiting the endogenous dynamics of norm diffusion with an exogenous intervention, delimited in size and time, is tempting and has recently been put forward as a tool for environmental policy (Westley et al., Reference Westley2011; Nyborg, Reference Nyborg2018, Reference Nyborg2020; Otto et al., Reference Otto2020). Yet, there is no foolproof way of activating tipping, and attempts to do so have produced a variety of results (Bicchieri, Reference Bicchieri2017; Efferson et al., Reference Efferson, Vogt and Fehr2020).
Tipping dynamics have so far been described with coordination game models (Schelling, Reference Schelling1973; Bicchieri, Reference Bicchieri2005; Nyborg, Reference Nyborg2020). We first outline how these models apply to large-scale pro-environmental behaviour change. Then, we show that a simplifying assumption of these models, the assumption of homogenous preferences, may evoke overly optimistic expectations about the potential of tipping. More to the point, recent research in a related field, the abandonment of harmful traditions (Efferson et al., Reference Efferson, Vogt and Fehr2020; Efferson, Reference Efferson2021), shows that preference heterogeneity can constrain tipping.
Here, we discuss the consequences of preference heterogeneity for the potential of tipping pro-environmental norm diffusion. In particular, analysing a threshold model of collective behaviour (Granovetter, Reference Granovetter1978; Schelling, Reference Schelling1978; Macy, Reference Macy1991), we address the following questions. Which structures of preference heterogeneity enable or constrain tipping? And, given a specific structure, which intervention strategy is the most promising? Our research holds clear implications for environmental policy.
Tipping pro-environmental norm diffusion
Coordination incentives and social tipping
Many environmental problems can theoretically be viewed as a social dilemma, for example, a public goods game (Milinski et al., Reference Milinski, Semmann and Krambeck2002; Irwin, Reference Irwin2009; Boyd et al., Reference Boyd, Richerson, Meinzen-Dick, De Moor, Jackson, Gjerde and Harden-Davies2012; Hauser et al., Reference Hauser2016; Goeschl et al., Reference Goeschl, Kettner, Lohse and Schwieren2020). In public goods games, public and private benefits are in conflict. Everyone profits from environmental public goods such as a stable climate. At the same time, everyone has an individual incentive to withhold their contribution to environmental public goods, for example, by taking convenient short-haul flights instead of time-consuming train rides (Gössling et al., Reference Gössling, Humpe and Bausch2020). Under the traditional assumption of self-interested preferences, an implication of this model is the tragedy of the commons (Hardin, Reference Hardin1968).
Yet, a common only ends tragically when it is not governed properly. Numerous environmental dilemmas around the world have been solved successfully – with the help of institutions (Ostrom, Reference Ostrom2015). Implementing a formal institution, however, is not always feasible. Informal institutions, in turn, typically rely on the interaction of familiar individuals and are therefore not well suited for anonymous large-scale societies (Guala, Reference Guala2012).
Social norms are an exception. Norms regulate behaviour not only in small groups but also at scale (Mackie, Reference Mackie1996; Nyborg & Rege, Reference Nyborg and Rege2003; Young, Reference Young2015) – a general finding that extends to pro-environmental behaviour (Walker & Meyers, Reference Walker and Meyers2004; Nyborg et al., Reference Nyborg2016; Castilla-Rho et al., Reference Castilla-Rho, Rojas, Andersen, Holley and Mariethoz2017; Sigdel et al., Reference Sigdel, Anand and Bauch2017). For this reason, the diffusion of norms has been suggested for regulating behaviours with global environmental impact (Boyd et al., Reference Boyd, Richerson, Meinzen-Dick, De Moor, Jackson, Gjerde and Harden-Davies2012; Nyborg et al., Reference Nyborg2016; Nyborg, Reference Nyborg2018, Reference Nyborg2020; Otto et al., Reference Otto2020).
Since Hardin, a large body of literature has shown that many people are not only motivated by self-interest but also by social preferences. Social preferences, as we will show, enable the diffusion of pro-environmental norms. In particular, a specific form of social preference, conditional cooperation, is a major driver of cooperation in experimental public goods games (Henrich et al., Reference Henrich2001; Fehr & Fischbacher, Reference Fehr and Fischbacher2003; Gintis et al., Reference Gintis, Bowles, Boyd and Fehr2003). While self-interested individuals free-ride in these games, conditional cooperators are willing to cooperate to the extent that others are cooperating, too (Fischbacher et al., Reference Fischbacher, Gächter and Fehr2001; Fehr & Gächter, Reference Fehr and Gächter2002; Biel & Thøgersen, Reference Biel and Thøgersen2007; Chaudhuri, Reference Chaudhuri2011) – a finding that extends to real-world environmental dilemmas (Rustagi et al., Reference Rustagi, Stefanie and Kosfeld2010; Fehr & Leibbrandt, Reference Fehr and Leibbrandt2011; Bluffstone et al., Reference Bluffstone, Dannenberg, Martinsson, Jha and Bista2020; Goeschl et al., Reference Goeschl, Kettner, Lohse and Schwieren2020).
In a social dilemma, mutual defection is the only Nash equilibrium. No matter how many others are travelling by train, as a strategy of cooperation (C), instead of taking more convenient short-haul flights, as a strategy of defection (D), a purely self-interested actor will always defect (Goeschl et al., Reference Goeschl, Kettner, Lohse and Schwieren2020). It is important to note that beliefs about the probability for others to choose C are therefore irrelevant. Cooperation is unconditionally disfavoured (for more details refer to Figure a1, panels a and b, in the Appendix).
What changes under the empirically more accurate assumption that some people are conditionally cooperative? We elaborate on this question using Fehr & Schmidt's (Reference Fehr and Schmidt1999) influential theory of social preferences. This theory extends the standard model with the assumption that some people dislike inequity. More specifically, they dislike being worse off than others – a preference called disadvantageous inequity aversion, α. And they dislike being better off than others, a preference called advantageous inequity aversion, β. Assume two conditional cooperators, ‘focal’ and ‘other’, with identical Fehr–Schmidt preferences, playing a nominal social dilemma game with two options, cooperation and defection (Figure A1c in the Appendix). Importantly, for an actor with associated preferences, nominal payoffs are not exclusively relevant. Disadvantageous and advantageous inequality also matter. For this reason, unlike a self-interested actor, a conditional cooperator will not simply defect. Instead, she prefers to cooperate, provided other is cooperating, too. A conditional cooperator is willing to endure a time-consuming train ride when many others are also doing so.
More technically, the greater a conditionally cooperative focal's expected probability, q, that other is taking the train, the greater her expected utility from cooperation, E(C). Equating E(C) with her expected utility of defection, E(D), yields focal's indifference point, q*. At this point, she is indifferent between C and D (Figure A1d in the Appendix). After crossing q*, focal prefers C over D which means the nominal social dilemma turns into a coordination game at this point (Bicchieri, Reference Bicchieri2005; Bowles, Reference Bowles2009; Centola, Reference Centola2013).
In a coordination game, mutual defection is no longer the only Nash equilibrium. Instead, there are two equilibria: mutual cooperation and mutual defection. When the game is played repeatedly in a group of conditional cooperators, the population dynamics can converge to either of these equilibria. There is a third equilibrium, namely, one in mixed strategies. That is, in absence of the best strategy, people could take a random choice – they could choose C with probability q* and D with probability 1 − q* (Sugden et al., Reference Sugden, Harsanyi and Selten1989). However, the equilibrium in mixed strategies, q*, is unstable. Even a small deviation of the population dynamics from q* towards either mutual cooperation or mutual defection activates a self-reinforcing shift towards one of the pure strategy equilibria. For this reason, q* is also called a ‘tipping point’ (Nyborg, Reference Nyborg2020).
As an illustration, conditional cooperators choose 8-h train rides over 2-h flights when they believe that many others are doing so, too. That is, each time before travelling, they update their beliefs about the current proportion of train users, q t. When q t < q*, they opt for flights and the proportion of train users decreases over time, converging towards zero. When q t > q*, they opt for train travel, and the proportion of train users increases, converging towards the cooperative equilibrium.
Let us now assume a policymaker intending to push the population dynamics towards the cooperative equilibrium to curb greenhouse gas emissions. Most policymakers have limited resources, and they are therefore unable to target the entire population with an intervention. Luckily, in the presence of coordination incentives, there is a more efficient option. Namely, the option of treating only a delimited sample, the target, in an attempt to push the population over the tipping point. When successful, this would then trigger an endogenous process of norm diffusion. More specifically, such an intervention has two hypothetical effects: a direct effect, enhancing climate-friendly choices in the target; and an indirect effect in the untreated segment of the population, emulating the behaviour of the treated. We call this indirect effect ‘social tipping’, or simply ‘tipping’. In our example, the policymaker could deliberately try to create tipping by pushing the proportion of train users in the population across the critical value, q*, with a delimited intervention.
When the proportion of train users, q t, does not undercut q* too strongly, pre-intervention, the policymaker might well succeed. As soon as q* is crossed, more and more individuals switch from plane to train – an endogenous process of change then successively establishes train travel as a social norm, followed by everyone. In contrast, when the proportion of train users substantially undercuts q*, even a strong intervention might fail to push train travel across the tipping point, q* (Nyborg, Reference Nyborg2020). In other words, for a policymaker to succeed, it is crucial to know q*. The higher q*, the harder the cooperative equilibrium is to reach.
It can be shown that q* depends on two parameters: the incentive structure of the nominal game and the strength of social preferences relative to self-interest in the (homogenous) population (see the Appendix, note on Figure A1d.).
Concerning incentives, the higher the tension between public and private benefits, the higher the threshold q*. An example of a high-tension situation is the choice between air travel and train travel. An individual, enduring an 8-h train ride instead of taking a 2-h flight, bears relatively high costs in the form of effort, time, and money, while the reduction in emissions achieved is very limited at the global scale. An example of a low-tension situation is switching off the lights when leaving a public bathroom, which also has a limited impact, but requires little effort.
This model implication fits well the following results of recent meta-analyses: changing high-impact behaviours, such as avoiding flights or animal products (Wynes & Nicholas, Reference Wynes and Nicholas2017; Schiermeier, Reference Schiermeier2019), seems difficult to achieve with behaviour interventions, compared to behaviours with relatively low impact (Bergquist et al., Reference Bergquist, Nilsson and Schultz2019; Nisa et al., Reference Nisa2019, see Figure 1). As a likely explanation, changing high-impact behaviours typically requires individually costly lifestyle changes. Put differently, for these behaviours, the tension between private and public benefits in the underlying environmental dilemma is quite strong.
Granted, Figure 1 provides only suggestive evidence for the claim that tension is a key factor of intervention effectiveness. One potential confounder is the type of intervention strategy. Specific research paradigms, favouring strategies that vary with respect to their strength or impact, tend to address specific behaviours (Nisa et al., Reference Nisa2019), which could at least partly explain the association between tension and effectiveness.
In defence of our claim that tension largely controls effectiveness, consider a robust result from environmental sociology. Namely, the result that pro-environmental attitudes predict quite well pro-environmental behaviours that require little individual cost or effort, compared to a non-environmentally friendly standard. In contrast, attitudes are much less predictive of costly pro-environmental behaviours (Diekmann & Preisendörfer, Reference Diekmann and Preisendörfer2003; Steg et al., Reference Steg2014; Moser & Kleinhückelkotten, Reference Moser and Kleinhückelkotten2018; Farjam et al., Reference Farjam, Nikolaychuk and Bravo2019).
Concerning social preferences, one can show that for a given environmental dilemma, in a homogenous population with identical Fehr–Schmidt preferences, the following holds. The stronger advantageous inequity aversion β relative to disadvantageous inequity aversion α and self-interest, the lower the perceived tension between public and private interest, and therefore, the lower q*. This implies that two environmental dilemmas with the same incentive structure could be resolved in one population, where advantageous inequity aversion dominates is relatively strong, and remain unsolved in another population, where the opposite holds. This finding is crucial, as social preferences vary substantially between societies (Henrich et al., Reference Henrich2006, Reference Henrich2010).
Introducing heterogeneity – a threshold model of social tipping
Not only do social preferences vary across societies. Within society, they also vary across individuals (Engelmann & Strobel, Reference Engelmann and Strobel2004; Blanco et al., Reference Blanco, Engelmann and Normann2011). In any human population, we will not observe α and β values common to everyone, but distributions of α and β values. As a consequence, we will not observe a common threshold, q*, but a distribution of individual thresholds, $q_i^\ast$, as suggested by threshold models of collective behaviour (Granovetter, Reference Granovetter1978; Schelling, Reference Schelling1978; Macy, Reference Macy1991).
These models propose that once the proportion of cooperators in the population reaches an individual's threshold, $q_i^\ast$, this specific individual switches from defection to cooperation. But individuals vary concerning the value that makes them switch. It is a crucial model implication that the structure of threshold heterogeneity in a population determines the potential for tipping. Efferson et al. (Reference Efferson, Vogt and Fehr2020) have shown for abandoning harmful traditions that threshold heterogeneity still plays the dominant role when adding a number of complexities, for example, different network structures or intervention strategies.
In what follows, based on a threshold model, we discuss how preference heterogeneity and the tension inherent to a specific environmental dilemma control the potential for tipping the diffusion of pro-environmental norms. Consider section 1 of Supplementary Material for a more technical discussion of the model.
Let F be a cumulative distribution function of individual thresholds, $q_i^\ast$, and q t the proportion of cooperators at time t. When each individual in a population regularly evaluates the cooperative behaviour of others and updates his or her beliefs accordingly, then cooperation changes over time according to q t+1 = F(q t). This is a well-discussed property of the threshold model (Granovetter, Reference Granovetter1978; Schelling, Reference Schelling1978). As a central implication, depending on the shape of F, a social tipping point may exist, but not necessarily. In other words, the presence of coordination incentives does not necessarily imply a strong potential for tipping. Depending on the shape of F, the potential for tipping can vary tremendously.
More specifically, the tension inherent to a social dilemma and the distribution of social preferences in the population together control the shape of a threshold distribution, with strong implications for tipping.
Under right-skewed threshold distributions, the potential for tipping is considerable. In this scenario, a majority has low thresholds. Some may even be willing to cooperate unconditionally, and only a minority has high thresholds. This kind of distribution results from an environmental dilemma with low inherent tension, such as switching off the lights when leaving a public bathroom – especially, when in combination with widespread advantageous inequity aversion. Figure 2a shows an illustrative right-skewed threshold distribution, and Figure 2d shows the corresponding cumulative distribution. Points at which F intersects the diagonal from above are stable equilibria, whereas points at which F intersects from below are unstable equilibria. Here, F intersects from above at F(0) = 0 and F(1) = 1, implying two stable equilibria, zero cooperation and total cooperation. And it intersects from below, at F(0.1) = 0.1, which suggests that a tipping point exists and that this tipping point is low. Put differently, should a policymaker convince slightly more than 10% of the population to switch off the lights regularly, this behaviour would then spread without further ado until adopted by 100% of the population.
Under left-skewed distributions, the potential for tipping is the smallest. In this scenario, a majority has high thresholds, some may even defect unconditionally, and only a small minority of very prosocial individuals has low thresholds. An example is avoiding flights. As can be seen from the corresponding cumulative distribution in Figure 2f, a tipping point still exists in this example, but with F(0.9) = 0.9, it may be difficult to reach, depending on the current level of cooperation in the population. Moreover, the potential for tipping only amounts to 10% of the population.
Symmetrical unimodal distributions are an intermediate case. They can result from environmental dilemmas with intermediate inherent tension, such as bringing one's empty glass bottles to public recycling stations, in combination with moderate levels of social preferences. In Figure 2b, a tipping point lies at F(0.5) = 0.5, and the potential for tipping amounts to 50 percentage points (Figure 2e). Obviously, the potential can still be substantial under symmetric distributions, and, depending on the resources invested, a tipping process could still be activated.
In brief, any intervention is most likely to activate tipping under right-skewed distributions, and least likely under left-skewed distributions, with symmetrical distributions as an intermediate case.
It is worth noting, however, that, unlike in the three examples discussed, a unimodal tipping point might not even exist. For example, a threshold distribution could be even more left-skewed than the one shown in Figure 2c, with substantial population shares defecting unconditionally. In this case, the corresponding cumulative distribution stays below the 45°-line entirely, pointing to the absence of a tipping point. As an additional possibility, distinct groups in a threshold distribution may constrain tipping: picture two opposite groups in a population, one amenable and one opposed to climate change mitigation, as currently the case in the USA (Mccright & Dunlap, Reference Mccright and Dunlap2011; Kahan, Reference Kahan2012, Reference Kahan2017; Rinscheid et al., Reference Rinscheid, Pianta and Weber2020). Tipping the diffusion of climate-protecting behaviours among Democrats may be quite feasible, but these behaviours are unlikely to spill over from Democrats to Republicans under pronounced political polarization. In this case, cutting the link between climate change mitigation and Republican identity is a precondition for tipping (Doell et al., Reference Doell, Pärnamets, Harris, Hackel and Van Bavel2021).
That said, under the threshold model, a policymaker intending to apply tipping to environmental policy has three options. She can treat the beliefs regarding the current cooperation level in the population, leaving the thresholds unchanged (1). She can target the thresholds by changing the monetary incentives of the nominal game in favour of cooperation (2). Or she can change the thresholds by changing the underlying preferences – a strategy that strengthens the intrinsic value of cooperation (3).
Changing beliefs
The first strategy, changing beliefs about the prevalence of the target behaviour, has frequently been used to promote pro-environmental behaviour, under the label ‘descriptive norms intervention’ (e.g., Schultz et al., Reference Schultz2007; Goldstein et al., Reference Goldstein, Cialdini and Griskevicius2008). The key idea is feeding back the current cooperation level, called descriptive information, to the population, which then leads people to adopt accurate beliefs. The behavioural consequence of the intervention, which makes the distribution of choices at any point in time public knowledge, though, is not straightforward. Belief-based interventions can produce a variety of outcomes, including strong beneficial effects, nil results, and backlash (Schultz et al., Reference Schultz2007; Allcott & Rogers, Reference Allcott and Rogers2014; Farrow et al., Reference Farrow, Grolleau and Ibanez2017; Rinscheid et al., Reference Rinscheid, Pianta and Weber2020). For example, feeding back the average energy consumption of a neighbourhood to the households of this neighbourhood promoted energy-saving among households with above-average consumption, but increased consumption among households below the average (Schultz et al., Reference Schultz2007). In principle, due to the no-control group design of that specific study, regression to the mean could explain this result. That is, the ‘backlash’ among households with below-average energy consumption might have occurred even in absence of the intervention (Verkooijen et al., Reference Verkooijen, Stok and Mollen2015). Yet, regression to the mean is an unlikely explanation for the backlash observed in many other studies. For example, providing feedback about the share of coffee sold in reusable mugs instead of one-way paper cups increased the share of coffee bought in mugs at one cafeteria, with a relatively high initial level of mug usage, compared to a control group. At the same time, the intervention discouraged mug usage at another cafeteria, with a relatively low initial level of mug usage, compared to the same control group (Berger, Reference Berger2021). In fact, evidence from controlled laboratory experiments suggests that an individual's beliefs about the prevalence of cooperation in a group are a strong determinant of that individual's cooperativeness (Fischbacher et al., Reference Fischbacher, Gächter and Fehr2001; Bicchieri & Xiao, Reference Bicchieri and Xiao2009; Ackermann & Murphy, Reference Ackermann and Murphy2019). Based on this result, the threshold model has a clear answer to the question of when belief-based interventions have beneficial outcomes, and when harmful outcomes are more likely.
To illustrate this, picture a policymaker intending to tip the diffusion of a pro-environmental norm, with the provision of regular feedback about the current cooperation level to the population. The standard version of the model assumes actors that regularly update their beliefs regarding the prevalence of cooperation, and it assumes these beliefs to be accurate. In the real world, the latter assumption may be violated. In fact, in the absence of clear information about the prevalence of a specific pro-environmental behaviour, people tend to underestimate the engagement of others, relative to their own engagement – a bias that dampens their willingness to act environmentally friendly (Pieters et al., Reference Pieters1998; Bergquist, Reference Bergquist2020; Leviston & Uren, Reference Leviston and Uren2020). In the following discussion, we therefore explicitly assume beliefs that are distorted and that can also be coupled to preferences, pre-intervention. We further assume that the individuals targeted by an intervention adjust their beliefs to the descriptive information provided. Thus, belief formation after intervention is based on accurate information, but it is not forward-looking. Instead, it is myopic, and people best respond given these beliefs, as in the original threshold model (Granovetter, Reference Granovetter1978). We discuss an intervention that provides descriptive feedback regularly. Regular feedback provision promotes the regular updating of beliefs, as assumed by the model. For now, we also assume that the entire population is treated by the intervention and discuss the strategy of treating only a delimited sample later. Section 2 of Supplementary Material outlines a threshold model with distorted beliefs and shows how our key findings derive from the model.
How does the joint distribution of thresholds and beliefs shape the net effect of a belief-based intervention, under these assumptions? As a starting point, consider Figure 3a, showing 500 thresholds, $q_i^\ast$, and beliefs, $\hat{q}_{it}, \;$ pre-intervention. Individuals above the 45°-line hold beliefs matching or exceeding their thresholds; therefore, they cooperate. Individuals below the 45°-line hold beliefs smaller than their thresholds; therefore, they defect.
As we will demonstrate, the long-term effect of an intervention depends on the relative size of four subgroups, pre-intervention. First, cooperators with accurate beliefs in the sense that they believe cooperation to match or exceed their thresholds, which is true (x 1 in region A of Figure 3a). It is worth noting that actual cooperation may overshoot or undershoot their beliefs to some degree, which is, however, inconsequential. What matters is that these agents presume cooperation to match or exceed their thresholds and that this is actually the case. Cooperators holding beliefs that are accurate in this sense cooperate before and after the first round of feedback. Second, cooperators with false beliefs, in the sense that they assume cooperation to match or exceed their thresholds, while cooperation, in fact, undershoots their thresholds (x 2 in region B of Figure 3a). They cooperate before the intervention, but, disappointed from the first feedback, stop cooperating in response. Third, defectors holding the false belief of cooperation undershooting their thresholds, pre-intervention (x 3 in region C of Figure 3a). They defect before the intervention, but, positively surprised from the first feedback, switch to cooperation. Fourth, defectors holding the accurate belief of cooperation undershooting their thresholds, defecting before and after the first feedback (x 4 in region D of Figure 3a).
In short, a belief-based intervention has the following immediate effect. Disappointed cooperators (x 2) switch to defection, and positively surprised defectors (x 3) switch to cooperation. Individuals holding the accurate beliefs of cooperation exceeding their thresholds (cooperators, x 1) or undershooting their thresholds (defectors, x 4) are unaffected by the first round of the intervention. We designate t = 1 as the point in time when individuals who were cooperating or defecting specifically because of distorted beliefs have changed their behaviours as an immediate response to the intervention, but no additional cultural evolutionary dynamics have yet occurred. This means, in effect, that q 1 = q 0 − x 2 + x 3. The immediate effect of ensuring correct beliefs can thus be positive, negative, or neutral. If x 2 ≤ x 3 and q 1 ≥ q 0, cooperation (weakly) rises. If x 2 > x 3 and q 1 < q 0, cooperation declines. Which of these scenarios holds will depend on the distribution of individuals in regions B and C in Figure 3a. The distribution of individuals in regions A and D is irrelevant in terms of the immediate effect of the intervention.
What happens after the immediate effect? To answer this question, note that the intervention represents a fundamental change in the informational setting. It eliminates the possibility of distorted beliefs by making the current rate of cooperation public knowledge at all points in time. As a result, beliefs become accurate, subject to the assumption of myopic updating, and they are always the same for everyone. Once this happens, the preference heterogeneity represented by the distribution of thresholds is the mechanism driving the evolution of cooperation.
We now illustrate the potential long-term outcomes of a belief-based intervention with more depth, discussing four scenarios. In Scenario 1 (Figure 3a–c), the threshold distribution is right-skewed, with beliefs and thresholds uncorrelated. In this case, cooperators with accurate beliefs (x 1), and defectors with false beliefs (x 3), together outnumber cooperators with false beliefs (x 2), plus defectors with accurate beliefs (x 4). Consequently, the intervention has a positive net effect, with 100% cooperation as a potential long-term outcome, after several rounds of feedback.
In Scenarios 2, the threshold distribution is symmetrical, with thresholds and beliefs correlated positively (Figure 3d–f). Here, the outcome depends on the relative size of those with thresholds undershooting cooperation, pre-intervention (x 1 + x 3), compared to those with thresholds exceeding cooperation, pre-intervention (x 2 + x 4). More specifically, when x 1 + x 3 < x 2 + x 4, pre-intervention, cooperation cannot exceed x 1 + x 3 in the long term. In contrast, when x 1 + x 3 > x 2 + x 4, pre-intervention, cooperation can increase, with the final result depending on the exact threshold distribution.
Scenario 3 also assumes a symmetrical distribution, but a negative correlation between thresholds and beliefs (Figure 3g–i). In this case, individuals that cooperate or defect independently of the intervention dominate (x 1 + x 4 > x 2 + x 3), implying a negligible net effect.
Finally, Scenario 4 assumes a left-skewed distribution, that is, most thresholds are of a rather high value (Figure 3j–l). Furthermore, thresholds and beliefs are uncorrelated. In this scenario, the dominance of high thresholds renders the disappointment of cooperators with false beliefs more likely than the positive surprise of defectors with false beliefs, with the consequence of decreased cooperation, post-intervention. This sparks a negative dynamic with zero cooperation as a likely long-term outcome.
In short, the outcome of a belief-based intervention depends on the joint distribution of thresholds and beliefs. The intervention eliminates the possibility of distorted beliefs. Once this happens, the preference heterogeneity represented by the distribution of thresholds is the mechanism driving the evolution of cooperation. Under right-skewed threshold distributions, this intervention strategy bears considerable potential, while it tends to provoke backlash under left-skewed threshold distributions. Under symmetric distributions, the outcome strongly depends on the structure of correlation between thresholds and beliefs, and the relative frequency of the four subgroups, x 1, x 2, x 3, and x 4. A negative correlation implies a negligible effect. A positive correlation implies a positive effect when the following holds additionally. Cooperators with accurate beliefs and defectors with false beliefs, therefore switching to cooperation in response to the intervention, form a majority over defectors with accurate beliefs and cooperators with false beliefs, therefore switching to defection.
The variety of potential outcomes points to the necessity of information about the joint distribution of thresholds and beliefs for designing effective belief-based interventions. When feasible, this strategy is quite attractive, as providing feedback is likely cost-efficient, compared to other strategies.
Changing the thresholds with monetary incentives
The second strategy is treating the thresholds with monetary incentives, either with a subsidy, like a train voucher, or a tax, like a carbon tax on air travel. Both options enhance the benefit of cooperation relative to the benefit of defection, decreasing the thresholds. A possible drawback of monetary incentives is that the thresholds might return to their pre-intervention levels, once the incentives are removed. Even worse, monetary incentives could crowd out intrinsic motivation, with the result of increased thresholds, compared to pre-intervention levels (Frey & Oberholzer-Gee, Reference Frey and Oberholzer-Gee1997; Reeson & Tisdell, Reference Reeson and Tisdell2008; Rode et al., Reference Rode, Gómez-Baggethun and Krause2015; Lapinski et al., Reference Lapinski2017). In a worst-case scenario, increased thresholds and lower consequent cooperation among the treated could then trigger the decay of cooperation in the population.
Changing the thresholds by changing preferences
In contrast, preference-based interventions, the third strategy, target individual preferences to decrease the thresholds. One variant is increasing the psychological value of cooperation. For example, the target individuals could learn about the environmental advantage of train travel compared to air travel. Another variant would promote advantageous inequity aversion.
It is worth noting that the combination of preference-based intervention and financial incentives has proven astonishingly effective in promoting pro-environmental behaviour. Moreover, adding a psychological intervention to the mere provision of financial incentives seems to prevent motivation crowding-out (Kerr et al., Reference Kerr, Vardhan and Jindal2012; Kerr et al., Reference Kerr2017, Reference Kerr2019). This finding is crucial – only a bundle of intervention strategies might be able to achieve the strong effect needed to activate tipping under a left-skewed threshold distribution.
Treating a delimited sample to activate tipping
It may often be the case that a policymaker lacks the resources to treat an entire population. In this case, treating only a delimited sample of the population, to activate diffusion of the target behaviour in the untreated segment of the population, is an appealing approach. Yet, policymakers should be aware that the shape of the threshold distributions in the population and the target largely control the potential for tipping.
More specifically, four parameters control that potential. First, the shape of the threshold distribution in the population and the target, namely, right-skewed, symmetric, or left-skewed. For the following discussion, we assume a randomly selected target, and consequently, the threshold distribution in the target approximates the corresponding distribution in the population. Second, intervention effectiveness, d. We assume the intervention to decrease the thresholds in the target deterministically, by a value of d. An individual starts cooperating, when her post-intervention threshold, $q_i^{\ast \prime} = q_i^\ast{-}d, \;$ is smaller than, or equal to, the cooperation level in the population, pre-intervention, q 0. The stronger d, the larger the proportion of actors in the target switching to cooperation in response to the intervention. Third, the population share targeted. Given an effective intervention, the larger the target, the larger the share of cooperators in the population, post-intervention. Fourth, the level of cooperation in the population, pre-intervention, q 0. The higher q 0, the stronger the potential for tipping.
How does, under these assumptions, the shape of the threshold distribution in the population affect the potential for tipping? Any intervention has potentially two effects. First, a direct effect on the target. Second, an indirect effect, namely, the tipping process that is activated, once the size of the subgroup committed to the target behaviour crosses a critical value.
Concerning the direct effect, given an effective intervention, the share of those switching to cooperation in response to the intervention is the largest under right-skewed distributions, where low thresholds prevail, followed by symmetric distributions, where intermediate values prevail, and, finally, left-skewed distributions, where large values prevail. Put differently, after the intervention, the subsample of cooperative individuals in the target is smaller under symmetric distributions than under right-skewed distributions, and smaller still under left-skewed distributions than under symmetric distributions. The smaller the cooperative subsample, post-intervention, the weaker the consequent potential for tipping.
The shape of the threshold distribution in the population also exerts an indirect effect on the potential for tipping. Under left-skewed distributions, high thresholds prevail, and therefore, the share of cooperators in the population needs to be larger to activate tipping than under symmetric distributions. And under symmetric distributions, the share of cooperators in the population needs to be larger than under right-skewed distributions. Per implication, under symmetric distributions, compared to right-skewed distributions, a smaller start-up group of cooperators coincides with the need for a larger start-up group. And under left-skewed distributions, compared to symmetric distributions, an even smaller start-up group coincides with the need for an even larger start-up group.
In combination, the direct and indirect effects create favourable conditions for tipping under right-skewed distributions, and unfavourable conditions under left-skewed distributions, with symmetric distributions as an intermediate case.
Who makes a good target?
So far, our policymaker was selecting her target individuals at random. Changing her sampling strategy could further boost her success, as Efferson et al. (Reference Efferson, Vogt and Fehr2020) demonstrate. Given the policymaker has a very effective intervention at hand, an intervention that even changes the behaviour of individuals with high thresholds, selecting individuals with high thresholds as a target would further increase the potential for tipping. Simply put, the intervention then addresses the harder task of treating the less cooperative, leaving the easier task of treating the more cooperative to the endogenous process of tipping.
But selectively treating the less cooperative is also risky. Should the target individuals’ responsiveness to the intervention be negatively correlated to their threshold values, a substantial proportion of the target might be unwilling to adopt cooperation. A smaller start-up group of cooperators, in turn, implies a reduced potential for tipping.
In contrast, systematically sampling the more cooperative is counterproductive. First, some of the more cooperative might already be cooperating. Second, larger shares of relatively cooperative individuals in the target means smaller shares in the non-treated segment of the population, and, therefore, higher hurdles for tipping. The strategy of random sampling, selecting from among both, the more and the less cooperative, may therefore often be a reasonable compromise.
In the light of this finding, the widespread practice of using student samples, a subgroup typically more amenable to pro-environmental behaviour change than the population average, seems questionable (Nisa et al., Reference Nisa2019). Not only might that sampling strategy evoke an overly optimistic view of the intervention under study. In practical applications, targeting mainly amenable student targets may reduce the potential for tipping in the population.
It is worth noting that the practice of using biased samples has been challenged before in other fields. More specifically, research on the foundations of human behaviour clearly shows that results from so-called WEIRD samples, drawn from western, educated, industrialized, rich, and democratic societies, do not typically generalize to other societies (Henrich et al., Reference Henrich, Heine and Norenzayan2010a, Reference Henrich, Heine and Norenzayan2010b). This insight underlines the necessity of using diverse samples for drawing robust conclusions.
Discussion and conclusion
Social tipping has been put forward as a policy tool to create the rapid social change needed for mitigating urgent environmental issues at a global scale, for example, climate change (Westley et al., Reference Westley2011; Nyborg et al., Reference Nyborg2016; Otto et al., Reference Otto2020). It is the idea of social tipping to activate a process of norm diffusion at the population level, by convincing only a delimited sample of the population to adopt an environmentally beneficial target behaviour. Should the population share adopting the target behaviour in response to a policy intervention reach a critical value, the tipping point, a self-reinforcing process then establishes this behaviour as a social norm without further interference (Milkoreit et al., Reference Milkoreit, Hodbod, Baggio, Benessaiah, Calderón-Contreras, Donges, Mathias, Rocha, Schoon and Werners2018; Nyborg, Reference Nyborg2020; Scheffer, Reference Scheffer2020).
The idea of recruiting the endogenous dynamics of norm diffusion with an exogenous intervention, delimited in size and time, is tempting. Yet, tipping interventions have been applied in other fields than environmental policy with rather mixed results, ranging from tremendous success, over nil results, to harmful backlash (Dolan et al., Reference Dolan, Hallsworth, Halpern, King, Metcalfe and Vlaev2012; World Bank Group, 2015; Bicchieri, Reference Bicchieri2017). In response to these findings, some scholars have argued that the canonical model of tipping, the coordination game (Mackie, Reference Mackie1996; Nyborg, Reference Nyborg2020), may give rise to somewhat optimistic expectations regarding the potential of tipping (Efferson et al., Reference Efferson, Vogt and Fehr2020). More specifically, it is the assumption of homogenous preferences underlying the model that fuels optimism (Efferson et al., Reference Efferson, Vogt and Fehr2020; Berger et al., Reference Berger, Vogt and Efferson2021; Efferson, Reference Efferson2021). People vary with respect to their preferences, and this includes support for specific environmental policies.
Preference heterogeneity constrains tipping. This does not mean that tipping is a blunt sword under preference heterogeneity. Yet to devise a successful tipping intervention, understanding the implications of heterogeneity is key. Our research cultivates this kind of understanding, discussing the question: How does preference heterogeneity impact the potential of tipping in the field of pro-environmental policy? Applying a threshold model (Granovetter, Reference Granovetter1978; Schelling, Reference Schelling1978; Macy, Reference Macy1991) to the issue of pro-environmental norm diffusion, we discuss structures of preference heterogeneity more and less favourable for tipping and suggest intervention strategies suitable for these structures.
The model starts with the notion of individual thresholds. A threshold is the population share a given individual requires to have adopted a specific behaviour, to also adopt this behaviour. It can be shown that two factors control the value of an individual threshold in an environmental dilemma. First, the tension between the public and private benefits inherent to a dilemma (Bowles, Reference Bowles2009; Goeschl et al., Reference Goeschl, Kettner, Lohse and Schwieren2020). The adoption of environmentally beneficial behaviours that are individually costly in terms of money, effort, or time implies high thresholds. An example is avoiding flights. The adoption of less costly behaviours implies low thresholds. An example is switching off the lights when leaving a public bathroom (Diekmann & Preisendörfer, Reference Diekmann and Preisendörfer2003; Steg et al., Reference Steg2014; Moser & Kleinhückelkotten, Reference Moser and Kleinhückelkotten2018; Farjam et al., Reference Farjam, Nikolaychuk and Bravo2019). Second, social preferences. Particularly, the stronger the individual tendency for advantageous inequity aversion (Fehr & Schmidt, Reference Fehr and Schmidt1999), the weaker the perceived tension between public and private benefits, the lower the individual threshold (Bicchieri, Reference Bicchieri2005; Bowles, Reference Bowles2009; Centola, Reference Centola2013).
Tension and social preferences together shape a threshold distribution. Low tension and widespread advantageous inequity aversion imply the predominance of low thresholds, or, a threshold distribution that is skewed to the right. High tension and widespread self-interest imply the predominance of high thresholds, or, a threshold distribution that is skewed to the left. Depending on shape, the potential of tipping varies substantially.
We started our discussion with a policymaker, intending to tip the diffusion of a target behaviour in a population (also called cooperation), by treating only a delimited sample of that population, the target. The exogenous intervention can achieve two effects. First, a direct effect on the target. Second, an indirect effect in the untreated segment of the population. Namely, the endogenous spreading of cooperation through the population that is activated, once the size of the subgroup adhering to this behaviour crosses a critical value, the tipping point.
Concerning the direct effect, the more strongly right-skewed the threshold distribution in the target, pre-intervention, the larger the share of the target switching from environmentally harmful to pro-environmental. And, the more left-skewed the threshold distribution in the target, pre-intervention, the smaller the share of the target switching to pro-environmental. The reason for this pattern is the following. Assume the intervention to deterministically decrease the thresholds in the target by a specific, fixed value. When, post-intervention, a target individual's threshold undershoots the pre-intervention prevalence of the target behaviour in the population, this individual adopts the target behaviour. Holding the prevalence of the target behaviour in the population constant, interventions of identical effect size, therefore, induce adoption in larger shares of the target when low thresholds prevail (right-skewed distributions), compared to when intermediate thresholds prevail (symmetric distributions), and, even more so, compared to when high thresholds prevail (left-skewed distributions).
Concerning the indirect effect, or, tipping, again, right-skewed distributions provide favourable conditions, and left-skewed distributions provide conditions that are rather adverse, with symmetrical unimodal distributions as an intermediate case. Simply put, when most people have low thresholds (right-skewed distribution), the size of the cooperative start-up group needed to cross a tipping point is smaller, compared to a population where roughly equal parts have low or high thresholds (symmetric distribution), or even a population, where high thresholds dominate (left-skewed distribution). Moreover, as discussed, symmetric, and even more so, left-skewed, distributions constrain the effect of the exogenous intervention in the target, resulting in a smaller cooperative start-up group. In short, the larger the start-up group theoretically needed for activating tipping, the smaller the actual start-up group, resulting from the exogenous intervention.
What are strategies for activating tipping? A policymaker can either treat the thresholds. To do so, she could change the incentives of the nominal social dilemma underlying an environmental issue, for example, with a subsidy or a tax. As an alternative, she could treat the preferences of the individuals involved. One example is promoting advantageous inequity aversion, which would boost the psychological value of cooperation in the respective environmental dilemma.
Earlier research suggests that using financial incentives could crowd out intrinsic motivation (Frey & Oberholzer-Gee, Reference Frey and Oberholzer-Gee1997; Reeson & Tisdell, Reference Reeson and Tisdell2008; Rode et al., Reference Rode, Gómez-Baggethun and Krause2015; Lapinski et al., Reference Lapinski2017), which would speak in favour of treating the preferences. More current research finds that combining financial incentives with psychological interventions is not only more effective than using any of the two strategies in isolation, but also prevents motivation crowding-out (Kerr et al., Reference Kerr, Vardhan and Jindal2012; Kerr et al., Reference Kerr2017, Reference Kerr2019). Our findings encourage combining both strategies under left-skewed distributions, where strong effects are needed to activate tipping.
An alternative to treating the thresholds is treating the beliefs about the population prevalence of the target behaviour. Treating beliefs means simply feeding back descriptive information about this prevalence to the target. Information provision might often be more cost-efficient than offering financial incentives or implementing preference-based interventions at scale. While this is surely attractive, the outcome of a belief-based intervention, in particular, strongly depends on context (Schultz et al., Reference Schultz2007; Goldstein et al., Reference Goldstein, Cialdini and Griskevicius2008; Allcott & Rogers, Reference Allcott and Rogers2014; Farrow et al., Reference Farrow, Grolleau and Ibanez2017; Rinscheid et al., Reference Rinscheid, Pianta and Weber2020; Berger, Reference Berger2021). More specifically, belief-based interventions are most effective under right-skewed distributions, where low thresholds prevail, and most likely to produce backlash under left-skewed distributions, where high thresholds prevail. The reason is that backlash in belief-based interventions is driven by disappointed cooperators. Namely, by individuals that expected cooperation to exceed their thresholds, but learn that the opposite is the case, therefore switching back from cooperation to defection in response to the intervention. Under left-skewed distributions, high thresholds prevail and disappointment is, therefore, more likely than positive surprise. The opposite holds under right-skewed distributions. The positively surprised defectors, learning that cooperation is exceeding rather than undershooting their thresholds, likely outnumber the disappointed cooperators, which results in a beneficial net outcome.
Under symmetric distributions, the net effect of a belief-based intervention depends on the correlation between thresholds and beliefs. A negative correlation implies a negligible effect. In this case, those with overly optimistic beliefs have also low thresholds. They stick to the target behaviour, post-intervention, when they learn that cooperation is exceeding their thresholds only slightly, rather than substantially. And those with overly pessimistic beliefs have high thresholds. They keep defecting when they learn that cooperation is indeed undershooting their thresholds, although to a smaller than expected extent. Put differently, the beliefs of the majority are accurate in the sense that cooperation is smaller (defectors) or greater (cooperators) than their thresholds, as presumed. Descriptive feedback then lacks any behavioural consequence for the majority. In contrast, a positive correlation bears a strong potential for tipping, provided the following holds additionally. Cooperators with accurate beliefs, and defectors with false beliefs, switching to cooperation in response to the intervention, form a majority over defectors with accurate beliefs and cooperators with false beliefs, switching to defection. Therefore, it is encouraging that most people tend to underestimate others’ pro-environmental engagement relative to their own (Bergquist, Reference Bergquist2020; Leviston & Uren, Reference Leviston and Uren2020), as this finding suggests positive surprise might be a rather common reaction to descriptive feedback.
In short, a belief-based intervention only works in two kinds of situations. Either under left-skewed threshold distributions. Or under symmetric distributions, given a positive correlation between thresholds and beliefs, plus cooperators with accurate beliefs, and defectors with false beliefs, jointly forming the majority. Otherwise, belief-based interventions bring nil effects, if not backlash.
Finally, it is worth mentioning that large parts of a population can be completely unwilling to adopt certain pro-environmental behaviours – their thresholds are fixed at 100%, putting a drag on tipping. This is the case when a specific behaviour conflicts with social identities widely held in society (Mccright & Dunlap, Reference Mccright and Dunlap2011; Kahan, Reference Kahan2012, Reference Kahan2017; Rinscheid et al., Reference Rinscheid, Pianta and Weber2020). Cutting the link between the target behaviour and the social identities involved is then a precondition for a pro-environmental norm to spread through the entire population (Doell et al., Reference Doell, Pärnamets, Harris, Hackel and Van Bavel2021).
To sum our findings, right-skewed distributions imply strong potentials for tipping. In this case, small interventions, targeting beliefs, preferences, or incentives, can produce quick and comprehensive norm diffusions. The opposite holds for left-skewed distributions. Here, large interventions, preferably bundling different strategies, for example, preference-based interventions plus incentive provision, are necessary for activating tipping – if tipping can be activated at all. Notably, policymakers are well advised not to use belief-based interventions in this kind of situation, as they then more likely bring backlash than beneficial effects. Symmetric distributions are an intermediate case. Even though larger interventions are necessary to activate tipping, the potential can still be relatively strong. Also, a belief-based strategy could work, although the potential here strongly depends on the joint distribution of preferences and beliefs. Refer to Figure 4 for a visual summary of our main findings.
The main implication of our research is the following. A policymaker may wish to gather information about the joint distributions of thresholds and beliefs in the population, as about the population prevalence of the target behaviour, before implementing an intervention. This information will allow her to gauge the potential for tipping that is socially beneficial and the risk of a backlash. She could then tailor her intervention to the requirements of the context. Measures of pro-environmental attitudes and behaviours, nowadays part of many large-scale surveys, may provide the information necessary. Future research should address the challenge of measuring thresholds and gauging the potential for tipping pro-environmental norm diffusion in field settings.
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1017/bpp.2021.36.
Acknowledgement
Charles Efferson and Sonja Vogt would like to thank the Swiss National Science Foundation (Grant No. 100018 185417/1).
Conflict of interest
The authors declare none.
Appendix