DIRTY AND CLEAN TECHNOLOGIES

SUPRATIM DAS GUPTA

doi:10.1017/aae.2014.1

DIRTY AND CLEAN TECHNOLOGIES

Published online by Cambridge University Press: 26 January 2015

SUPRATIM DAS GUPTA

Show author details

SUPRATIM DAS GUPTA*: Affiliation:
University of South Carolina, Columbia, South Carolina; and the University of Guanajuato, Guanajuato, Mexico
*: *Email: [email protected]

Article contents

Abstract
Introduction
Model
Model without Knowledge Accumulation
Model with Knowledge Accumulation
Concluding Remarks
Footnotes
References

Rights & Permissions

Abstract

Pollution from fossil fuel use is a global problem. Studies have shown that a worsening of environmental quality has adverse effects on worker productivity and health. In this study, there is an inexhaustible natural resource that deteriorates environmental quality and affects productivity. There also exists a perfect substitute clean backstop, which is initially too costly to operate and whose costs can be reduced through investments in knowledge. Depending on the endowment of environmental quality, the optimal solution shows that the planner should only use the resource or only the backstop until a constant steady state is reached in which the polluting resource and backstop are used in fixed proportions. We show that investments in alternative technologies from the very beginning can help an economy make the eventual switch to clean energy sources, thereby attaining better environmental quality.

Keywords

clean technologies dirty technologies environmental quality investment knowledge stock O32 O44 Q42 Q56 Q57

Type: Research Article
Information: Journal of Agricultural and Applied Economics , Volume 47 , Issue 1 , February 2015 , pp. 123 - 145

DOI: https://doi.org/10.1017/aae.2014.1 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s) 2015

1. Introduction

Efforts to use renewable alternative technologies (e.g., wind, solarFootnote ¹ ) at a greater scale have been adopted by countries in recent years. This has been the case as rising pollution concerns (global warming, climate change) from use of fossil fuels have gained relatively more importance than fears of running out of these essential resources. In a model of total energy production from a dirty resource and a perfect substitute clean technology (backstop), we show the optimal solution implies using either one energy source at first before finally converging to a steady state of using both the dirty and clean technologies in fixed proportions. The dirty technology deteriorates environmental quality, and the average cost of the initially costlier clean technology falls with investments in knowledge.Footnote ² In our model, we show the social planner uses either one of the two technologies (but always invests in the clean technology) depending on the relative magnitude of the pollution cost due to environmental degradation and the average cost of the clean backstop until reaching the steady state. We arrive at the interesting result that in a situation of a stock of low environmental quality (or high pollution cost), it is optimal to use only the expensive clean technology initially if the external pollution cost of the dirty technology is correctly taken into account. A case can be made for a developing country with a high pollution level where using a costlier backstop (but with continuous investments in knowledge to reduce its average cost as in Tsur and Zemel [Reference Tsur and Zemel2003]) would help improve environmental quality, followed by, at a high enough level of the environment, a switch to dual use of the dirty and the alternative technologies. Similarly, for nations with a better quality of the environment, using the dirty technology at first is optimal, followed by, with investments in knowledge to reduce the backstop cost, eventually converging to a steady state of using the two technologies.

The optimal solution in our model is a feasible and realistic one given falling unit costs for alternative technologies (which are still significantly higher relative to conventional energy sourcesFootnote ³ ), which have led to their wider use in some nations. We contrast the optimal solution with the equilibrium one when atomistic firms do not realize the external effects of their own actions on the aggregate stock of environmental quality. As a result, individual firms never invest in knowledge (or engage in R&D activities) to reduce backstop cost, and we get the extreme result that a firm in equilibrium always sticks to the dirty technology. We show that in the presence of external costs of pollution from using the dirty technology, it is optimal to invest in the clean technology very early to help an economy reach a steady state of using both the dirty technology and the backstop. Although the economy can completely switch to only using the clean resource with more investments over time, switching to these relatively less efficient technologies (e.g., wind and solar energies face the huge burden of lack of good storage solutions) may entail a temporary drop in consumption at the time of switch (Boucekkine et al., Reference Boucekkine, Saglam and Vallée2004, Reference Boucekkine, Krawczyk and Vallée2011, Reference Boucekkine, Pommeret and Prieur2013a).Footnote ⁴

Rising demand for coal by the growing economies of China and India, and a sustained increase in demand from the United States, has been met with rising coal production from China, India, and Australia.Footnote ⁵ With a possible increase in world oil prices and the use of enhanced oil recovery technologies, the EIA predicts the production of world petroleum and other liquid fuels (hydrocarbons) to increase by 28.3 million barrels per day between 2010 and 2040. Thus, given the relative inexhaustibility of essential fossil fuels such as coal and gasoline, switching to a steady state of using a constant mix of dirty and clean technologies for extended periods is feasible and would arrest harmful potential environmental damages to future generations. Dasgupta (Reference Dasgupta2008) states that CO₂ concentration in the atmosphere is 385 parts per million, which according to ice cores in Antarctica, is the highest level reached in the past 650,000 years. The author further states that given the nonlinearities governing the earth's ecological system, little action to avoid climate change may involve the planet crossing many irreversible tipping points. On the other hand, leaving some of the stock of dirty technologies in the ground translates to losses of millions of dollars of GDP and would lead to unemployment and significant political and social tensions. The problem of how soon to start investing in alternative technologies and the time of switch to using more of the backstop is no doubt an interesting problem (according to Stern [Reference Stern2007], substantial steps to reduce emissions must be taken right now to prevent global average temperatures increasing by more than the critical value of 5°C).

Our work can be placed in the literature of regime switching by Boucekkine et al. (Reference Boucekkine, Saglam and Vallée2004, Reference Boucekkine, Krawczyk and Vallée2011, Reference Boucekkine, Pommeret and Prieur2013a, Reference Boucekkine, Pommeret and Prieur2013b), Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012), and other work by Tsur and Zemel (Reference Tsur and Zemel2003, Reference Tsur and Zemel2005) in which switching from a dirty to a clean resource may involve a temporary drop in consumption because of shifting to a new regime of less efficient technology.Footnote ⁶ Similar to the studies by Boucekkine et al. (Reference Boucekkine, Pommeret and Prieur2013a, Reference Boucekkine, Pommeret and Prieur2013b), in which the backstop is adopted in the optimal solution to prevent further environmental damages after an “ecological threshold” (which implies a lower regeneration capacity of the earth) is reached, adoption of the clean technology in our model takes place to maintain environmental quality at a constant level or at times to improve it for a low endowment of environmental quality. One of the main differences between our work and that of Boucekkine et al. (Reference Boucekkine, Saglam and Vallée2004, Reference Boucekkine, Krawczyk and Vallée2011, Reference Boucekkine, Pommeret and Prieur2013a, Reference Boucekkine, Pommeret and Prieur2013b) is that we contrast the optimal and equilibrium solutions. We get a similar result to Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012) of the behavior of individual firms in equilibrium leading to a severe deterioration of environmental quality (“environmental disaster” in Acemoglu et al. [Reference Acemoglu, Aghion, Bursztyn and Hemous2012]). In Boucekkine et al. (Reference Boucekkine, Saglam and Vallée2004) and Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012), a widening technological gap between the dirty and clean sectors causes environmental degradation; the former introduces a learning curve (where the economy learns about the new technology from the very beginning) to eliminate this efficiency gap, whereas the latter resorts to government intervention to help improve the productivity of the dirty sector. Although we attempt to characterize the difference between the optimal and equilibrium solutions and show how the optimal solution makes intuitive sense, efforts to bring the two solutions closer is something we leave for our future work. We discuss in the concluding remarks some practical correction mechanisms (for firms to internalize the externalities) and policy implications.

The paper is organized as follows: we provide our main model (the “Model” section); solve for special cases when we do not have knowledge accumulation and when knowledge accumulation is included to reduce the average cost of the backstop technology (the “Model without Knowledge Accumulation” and “Model with Knowledge Accumulation” sections, respectively); and discuss the key results and conclusions (the “Concluding Remarks” section).

2. Model

We consider a model consisting of a continuum of measure 1 of identical firms. Each firm is owned by an infinitely lived household. We abstract from population growth and for simplicity, and we normalize the total population in the economy to be unity.

A representative firm in the economy produces a “composite commodity” using a polluting natural resource and a backstop technology. The composite commodity is used for both consumption and investment. Energy, or the sum of the use of the polluting natural resource and the backstop, is regarded as the only input for a representative firm. We assume both the polluting resource and the backstop to be in nearly unlimited supply. We henceforth refer to the polluting resource as the “dirty technology” and the backstop as the “clean technology.”

The costs of production for a firm are divided into the costs of the dirty and clean technologies. There is no cost of extraction for the dirty technology. The only cost associated with its use is the cost of pollution, which adversely affects the profits for a representative firm. On the other hand, the firm faces a positive average cost for the clean technology each period. However, continuous investments in knowledge taken out of the composite commodity can help reduce this cost over time. Knowledge can be thought of as the technical know-how to operate alternative clean technologies like wind and solar. Investments through R&D may bring about more efficient techniques to use these technologies lowering their average costs. Alternative technologies may be thought of as having a higher embodied technical progress (Boucekkine et al., Reference Boucekkine, Saglam and Vallée2004) or to be more sophisticated. Increasing the knowledge base for an economy would thus help people operate them more efficiently. In this paper, we assume innovation in technical know-how to operate alternative technologies is developed in the lab (i.e., we assume that each firm has its own R&D sector). They do not come about through using the clean technologies themselves. The production function for the composite commodity is then given by

(1)

\begin{equation} y = e^\alpha\end{equation}

where y denotes the composite commodity, e denotes total energy use,Footnote ⁷ and 0 < α < 1. All firms are assumed price takers, and entry and exit are not permitted in the model. Finally, we take the price of the composite commodity y to be constant at unity. Total energy production is in turn given by

(2)

\begin{equation} e = b + r\end{equation}

where b ⩾ 0 and r ⩾ 0 represent use of the clean and the dirty technologies by a representative firm. Because of the clean and dirty technologies being perfect substitutes, energy use can come from either source. Although the dirty technology is assumed to be in unlimited supply, there is a feasible limit $\bar r$ that firms can extract in any time period given their production techniques. So we impose $r \le \bar r$ as a feasible technology constraint. The previous specification is similar to the study by Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012), in which “dirty” and “clean” inputs are used to produce the final good. A representative firm can invest in knowledge out of total production. The knowledge accumulation function is given by

(3)

\begin{equation} \dot{n} = a\sqrt i\end{equation}

where n denotes the stock of knowledge for a representative firm. The initial endowment of knowledge stock, n ₀, is assumed to be positive. Each firm is endowed with the same initial stock of knowledge. The variable a > 0 represents the investment parameter, and i ⩾ 0 denotes the investment each period for a firm. A concave knowledge accumulation function implies that it pays to invest little amounts every period rather than a lot in any one period. We impose a ceiling on n after which investments in knowledge do not further reduce the average cost of the clean technology. We denote this maximal level of knowledge by $\bar n$ . This can be thought of as a stage where the productivity of knowledge accumulation reduces to zero. Investment, being worthless, falls to zero beyond this point.

In our setup, we capture pollution through adverse effects on environmental quality. Environmental quality is denoted by A in the model. We assume A > 0 for all t. Introducing the possibility of an environmental disaster when A < 0 (Acemoglu et al., Reference Acemoglu, Aghion, Bursztyn and Hemous2012) or an ecological threshold with two regimes for A (Boucekkine et al., Reference Boucekkine, Pommeret and Prieur2013b) could generate interesting results.

The initial stock of environmental quality is given byA ₀ > 0. Environmental quality worsens due to use of the dirty technology by all firms but regenerates at a constant rate γ > 0. We assume that there exists a pristine level of environmental quality denoted by Ā. We can write the relevant equations as

(4)

\begin{equation} \dot A = \left( {\bar A - A} \right)\gamma - R\end{equation}

and

(5)

\begin{equation} R = \mathop \smallint \limits_0^1 r_j dj\end{equation}

where r_j denotes resource use by the jth firm, and R stands for aggregate use of the dirty technology for the economy. We do not allow for heterogeneity in firm choices (larger firms using more of the dirty technology) and assume that each firm contributes equally to R. A representative firm in equilibrium treats R as constant. Being one among an infinite number of firms, it ignores the effect of its own action on the deterioration of environmental quality. Each firm, however, faces a damage cost (Ā - A)² each period from the deterioration of environmental quality. A damage cost can be thought of in terms of lower worker productivity or poor worker health affecting production for the firm. Because the dirty technology is in infinite supply and is free, firms would use the maximum feasible amount $\bar r$ each period. As the relatively expensive clean technology is never used, investment in knowledge would be zero in the equilibrium solution. The social planner would, however, fully internalize the negative external effect of aggregate resource use. Investment would be carried out in the backstop technology to make it cheaper and so as to make a switch to the clean technology. The average cost of the clean technology as a function of the aggregate knowledge level is given by

(6)

\begin{equation} C\left( N \right) = a_0 - a_1 N\end{equation}

such that

(7)

\begin{equation} N = \mathop \smallint \limits_0^1 n_j dj\end{equation}

where n_j stands for knowledge stock of the jth firm, and N denotes the aggregate stock of knowledge in the economy. In a similar fashion to equation (5), we assume away heterogeneity in firm choices such that each firm would invest an equal amount in R&D every period. N also denotes the average knowledge stock for the representative firm, and because only the planner invests in knowledge in our model, the aggregate consistency condition N = n is substituted in equation (6) for the optimal solution. The variables a ₀ > a ₁ > 0 are the cost parameters of the model.

3. Model without Knowledge Accumulation

3.1. Optimal Solution

The externality in environmental pollution leads to a divergence between the equilibrium and the socially optimal solution. Here, we explore a model with no knowledge accumulation or zero investment for the social planner. That is to say, a = 0 in equation (3). Because aggregate consistency implies N = n from equation (7), the aggregate knowledge stock N would not change as a result. So cost of the clean technology given by equation (6) would remain constant.

We begin with this case for the following reason: Because there is only one state variable, A, it helps us to solve the entire path of energy use with relative ease. Moreover, the method of solving the model and the solutions remain qualitatively the same even when knowledge accumulation is included. This model provides a good starting point for the more complex case of investments in knowledge reducing the average cost of the clean technology. Depending on the initial stock of environmental quality, we get a situation of only using ror b before switching to simultaneous use of the two energy sources from the time their marginal costs become equal. For the case when the planner uses only the dirty technology, we assume an interior solution for r or $r < \bar r$ . So imposing the aggregate consistency condition R = r and assuming N = n ₀ = 1, we can write the maximization problem for the social planner solution as

(8)

\begin{equation} max\mathop \smallint \limits_{t = 0}^\infty \left. {\left( {\left( {b + r} \right)^\alpha - \left( {a_0 - a_1 } \right)b - \left( {\bar A - A} \right)^2 } \right.} \right)e^{ - \rho t} dt\end{equation}

subject to equation (4) where ρ > 0 denotes the constant rate of discount, b, r ⩾ 0 and A ₀ given. The maximization problem for the social planner in our model is similar to the social welfare function employed by Boucekkine et al. (Reference Boucekkine, Pommeret and Prieur2013b); in contrast to a concave utility function of consumption, in our model production of the composite commodity is nonlinear in total energy use. We choose to maximize net profits in the previous problem as this may be more realistic as an objective function for individual firms (or a social planner) in a perfectly competitive economy. In many situations, fossil fuels (ignoring their exhaustibility) and alternative technologies are perfect substitutes, and this linearity in total energy use gives rise to corner solutions (only resource or only backstop use).Footnote ⁸

The current-valued Hamiltonian of the problem is then

\begin{eqnarray*} H &=& \left( {b + r} \right)^\alpha - \left( {a_0 - a_1 } \right)b - \left( {\bar A - A} \right)^2 + \lambda \left( {\left( {\bar A - A} \right)\gamma - r} \right) + \theta _1 b\\ && +\, \theta _2 r + \theta _3 \left( {\bar r - r} \right)\end{eqnarray*}

where λ is the shadow price of environmental quality, and the θ_i's denote the nonnegativity and the technology constraint on b and r, with λ, θ₁, θ₂, θ₃ ⩾ 0. The necessary conditions for an optimal solution are

(9)

\begin{equation} \frac{{\partial H}}{{\partial b}} = \alpha \left( {b + r} \right)^{\alpha - 1} - \left( {a_0 - a_1 } \right) + \theta _1 = 0,\theta _1 b = 0\end{equation}

(10)

\begin{equation} \frac{{\partial H}}{{\partial r}} = \alpha \left( {b + r} \right)^{\alpha - 1} - \lambda + \theta _2 - \theta _3 = 0,\theta _2 r = 0,\theta _3 \left( {\bar r - r} \right) = 0\end{equation}

(11)

\begin{equation} \dot \lambda = \rho \lambda - \frac{{\partial H}}{{\partial A}} = \left( {\rho + \gamma } \right)\lambda - 2\left( {\bar A - A} \right),\end{equation}

and the transversality condition is given by

(12)

\begin{equation} lim_{t \to \infty } e^{ - \rho t} \lambda \left( t \right)A\left( t \right) = 0.\end{equation}

Because the dirty and clean technologies are perfect substitutes in the previous model, they have equal marginal benefits. When only using the dirty technology ( $0 < r\left\langle {\bar r,b = 0 \Rightarrow \theta _2 = 0 = \theta _3 ,\theta _1 } \right\rangle 0)$ , its optimum path is given by

(13)

\begin{equation} e^* = r^* = \left( {\frac{\alpha }{\lambda }} \right)^{\frac{1}{{1 - \alpha }}},\end{equation}

and we have

(14)

\begin{equation} \lambda = \left( {a_0 - a_1 } \right) - \theta _1.\end{equation}

Similarly, when only using the clean technology (b > 0, r = 0⇒θ₁ = 0 = θ₃, θ₂ > 0), its optimum path is given by

(15)

\begin{equation} e^* = b^* = \left( {\frac{\alpha }{{a_0 - a_1 }}} \right)^{\frac{1}{{1 - \alpha }}},\end{equation}

and we have

(16)

\begin{equation} \lambda = \left( {a_0 - a_1 } \right) + \theta _2.\end{equation}

So when using both the energy sources $(b > 0,0 < r < \bar r \Rightarrow \theta _1 ,\theta _2 ,\theta _3 = 0)$ , we have the conditions

(17)

\begin{equation} e^* = \left( {\frac{\alpha }{\lambda }} \right)^{\frac{1}{{1 - \alpha }}} = \left( {\frac{\alpha }{{a_0 - a_1 }}} \right)^{\frac{1}{{1 - \alpha }}}\end{equation}

and

(18)

\begin{equation} \lambda = \left( {a_0 - a_1 } \right).\end{equation}

It should be noted from the previous equations that total energy used would be constant in the case of simultaneous use of r and b. However, their division is indeterminate.

Equation (18) is central to our analysis. Given a constant average cost of b and a free but polluting natural resource, simultaneous use of the two energy sources is only possible when the shadow price of environmental quality rises or falls to equal (a ₀ − a ₁). That is to say, if λ > 0, then we can get to a situation of simultaneous use of r and b from using any one of the energy sources at first. Equations (14) and (16) show that equation (18) can be satisfied starting from λ less than or greater than (a ₀ − a ₁).

Conventional and Modified Steady State: Solving for the Optimal Solution

This section deals with solving for the optimal solution using a phase diagram analysis. We find motion trajectories that are feasible and satisfy the necessary conditions for optimality. To check for the sufficient condition, we look at the transversality condition given by equation (12). If we find paths that satisfy both these conditions, then we say such paths are optimal. We first define a “conventional steady state” and a “modified steady state” in this regard.

Figure 1 shows a stylized version of the stationary loci, the stable arms and the possible levels of environmental quality. In Figure 1a, the point of intersection of the stationary loci for equations (4) and (11) is referred to as the conventional steady state. It is denoted by point S. Given equation (18), Figure 1 shows that only the clean technology is used above the line (a ₀ − a ₁), the dirty technology below it, and any combination of b and r along the line. A ₀ and A′₀ denote possible initial given levels of environmental quality (we demonstrate these two cases in one graph for ease of reading; both cases are shown separately in the Appendix). For a constant average cost of the clean technology above the shadow price of environmental quality implied at the conventional steady state, we show stable arms approaching point S from either its southeast or northwest. In this case, the transversality condition given by equation (12) is satisfied. When the initial stock of environmental quality is relatively high (at A ₀), only the dirty technology is used along the stable arm until we approach S in infinity. Because λ is rising in this region, this means decreasing use of the dirty technology over time according to equation (13). However, for a lower endowment of environmental quality A′₀, we may have two stable arms approaching the conventional steady state from its northwest. Depending on our choice of λ₀, we may have a sequence of using the clean and then the dirty technology or only the dirty technology until we approach S in infinity. Because A rises at a faster rate when r = 0, the stable arm in the former case will be relatively flat and then steep as we approach the conventional steady state.

Figure 1. Conventional and Modified Steady State

Intuitively, the magnitude of the shadow price of environmental quality relative to that of the constant average cost of the clean technology is what determines a planner's decision of whether to use only the dirty or only the clean technology. For a relatively pristine level of environmental quality at A ₀, the value of its additional unit on lifetime utility (λ) would be smaller relative to (a ₀ − a ₁). Hence, it would be economical for the planner to use only the dirty technology until λ rises to the level of the constant average cost of the clean technology. From then onward, the planner would use a mix of the clean and the dirty technologies. The economy can stay at this steady state forever as the dirty technology is inexhaustible. Conversely, when environmental quality is poor to begin with at A′₀, the present value of its marginal increment on lifetime utility may be greater than the average cost of the clean technology making it cheaper for the planner to use only the clean energy input. The planner may, however, switch to the dirty technology again if environmental quality improves much lowering its shadow price. The more interesting case involves when the line (a ₀ − a ₁) is below the conventional steady state. This situation is depicted in Figure 1b. We once again show two possible initial endowments of environmental quality, A ₀and A′₀, in one graph for an easier demonstration. Here the stationary locus for equation (4) is truncated at λ = (a ₀ − a ₁): because we only use the clean technology for any λ > (a ₀ − a ₁), the $\dot A = 0$ locus will coincide with A = Ā when R = r = 0 in equation (4). It can be shown that point E can be made a steady state at which $\dot \lambda = \dot A = 0$ . We call point E the modified steady state. Given $\dot \lambda = 0$ at E, the motion of A can be set equal to zero through an appropriate choice of r at any point along the open interval DP. Because equation (18) holds along this interval, we would also get simultaneous use of the two energy inputs. The coordinates at point E are denoted by $\left( {\hat A,\hat \lambda } \right)$ . The choice of r that makes $\dot A = 0$ along the interval DP can be obtained from equation (4) and imposing the aggregate consistency condition (R = r) as

(19)

\begin{equation} r^* = \gamma \left( {\bar A - \hat A} \right).\end{equation}

Note that $\hat \lambda = \left( {a_0 - a_1 } \right)$ at E. From equation (17), we can then obtain the path of the clean technology as a residual:

(20)

\begin{equation} b^* = e^* - r^*.\end{equation}

Although not proved yet, we will show that these are indeed the optimal paths of r and b. Figure 1b shows that the modified steady state E can be approached from either its northwest or southeast as indicated by the motion trajectories. Depending on the given initial stock of environmental quality, Figure 1b shows stable arms approaching E from either of these directions. It can be shown that both these stable arms are unique and that they approach E from either direction in finite time. The system would stay at E forever, and that this satisfies transversality would be explained later in the paper. If we, however, allow the stable arms to proceed further, they will either divert to the southwest or northeast as shown by the dashed arrows.

If the system starts at a relatively pristine level of environmental quality, that is at A ₀ near Ā, then there exists an optimum initial λ (or λ*₀) such that the motion trajectory originating from (A ₀, λ*₀) becomes perfectly horizontal at E and then proceeds toward its southwest. A falls and λ rises along this stable arm until it hits the modified steady state. For a choice of λ₀ < λ*₀ given A ₀, the system would follow a motion trajectory that would hit the $\dot \lambda = 0$ locus at a point below E and then proceed toward its southwest. The shadow price of environmental quality would then reach zero in finite time. If λ₀ > λ*₀, then the motion trajectory will hit a point on the open interval DP toward the right of E after which it proceeds toward its northeast. Similarly, if the system begins with poor environmental quality at A′₀, then we can also find a unique λ*₀ such that the stable arm originating from (A′₀, λ₀*) becomes perfectly horizontal at E and then proceeds toward its northeast. Any other choice of λ₀ would result in the motion trajectory intersecting the $\dot \lambda = 0$ locus either above Eor toward the left of it on the open interval DP. The path would then either proceed toward the northeast or southwest of the modified steady state E. Given a stock of environmental quality at A ₀, only r would be used along the stable arm until it gets infinitesimally close to E. The growth of λ along the stable arm implies decreasing use of the dirty technology by equation (13). However, at this point there is a jump in the control r such that its path changes to equation (19). The path of b is given by equation (20). Given that Â is constant at the modified steady state E, equations (19) and (20) imply that r* and b* are both constants. In addition, because of a constant $\hat \lambda $ , the transversality condition (equation 12) would be satisfied. Because such a plan of only using the dirty technology at first and then a mix of the clean and dirty technologies is feasible and satisfies the necessary and sufficient conditions for optimality, we say that such a plan is optimal. The system would finally settle at E and stay there forever. Similarly, given A′₀, only the clean technology b would be used along the stable arm at the constant rate given by equation (15) until it gets infinitesimally close to E. The path of b then jumps to equation (20), and the path of r jumps from r* = 0 to equation (19). We can then follow the previous analysis and show that the transversality condition (equation 12) would be satisfied. Such a plan would also be optimal.

Figure 2 shows the optimal paths for e, b, and r over a long period of time if we begin at A ₀. In Figure 2, the paths of e and r coincide when the planner only uses the dirty technology and gradually approaches the modified steady state. Because the shadow price of environmental quality λ rises during this period, the path of r follows a downward profile according to equation (13). As the clean technology is not used by the social planner, the path of b coincides with the horizontal axis. At the modified steady state, total energy use becomes constant by equation (17), and r jumps down to its new constant profile given by equation (19). So b jumps up from b* = 0 to make up the difference: this is given by the constant path (equation 20). We solve for the optimal solution using numerical methods given the values of the parameters. As will be defined again later, we denote by $\hat T$ the time it takes for the stable arm to reach E starting either from a relatively high or low initial stock of environmental quality. The Appendix provides the necessary numerical solutions.

Figure 2. Energy Use Given Constant Average Cost of b

Result 1: When a polluting resource and a clean alternative technology are perfect substitutes in producing a final good, an endowment of a relatively pristine environment implies that only the dirty technology is used by a social planner until a constant steady state is reached when the two technologies are used in fixed proportions. On the other hand, beginning from a relatively poor quality of the environment, the optimal solution involves only using the clean technology at first and then converging to the previously defined steady state.

Competitive Equilibrium Solution

Individual firms in the market equilibrium select the cheaper dirty technologies ignoring their impact on the stock of environmental quality. Each firm treats R from equation (4) as constant, and thus λ becomes irrelevant in this case. Because a firm uses a constant amount $\bar r$ of the resource every period, the equilibrium problem also reduces to a static one. Investment in R&D (to reduce average backstop cost) never occurs in the competitive equilibrium solution, and the problem would remain the same even when the model is extended to include knowledge accumulation.

The behavior of firms in equilibrium would, however, significantly reduce environmental quality. But a high enough constant rate of regeneration, γ, may stabilize environmental quality at some level Ã. Imposing $r = \bar r$ in equation (4), and setting $\dot A = 0$ , we get

(21)

\begin{equation} \tilde A = \bar A - \frac{{\bar r}}{\gamma }.\end{equation}

Figure 3 shows the dynamics of A for the equilibrium solution. A positive intercept in Figure 3 implies $\gamma > \frac{{\bar r}}{{\bar A}}$ . We assume an $\bar r$ such that this inequality holds and Ã > 0 (we rule out an $\bar r$ such that Ã < 0). The arrows indicate the direction of movement to the stationary state Ã if we start from a relatively high stock of environmental quality at A ₀. Similarly, for a low initial stock of A, environmental quality would move down along the $\dot A$ locus to the stationary state Ã. Given the parameter values in the Appendix, it can be seen that Ã < Â.

Figure 3. Transitional Dynamics of A in Market Equilibrium

4. Model with Knowledge Accumulation

Here we solve the full model where investments in knowledge reduce the average cost of the clean technology. We start out with analyzing the optimal solution as in the previous model. For the case when the planner only uses the dirty technology, we assume $r < \bar r$ . Furthermore, for simplicity, we impose a = 1 in equation (3). Interestingly, similar results are obtained here as in the case when knowledge accumulation was not included: based on the endowment of environmental quality, the social planner would use only one energy source followed by the simultaneous use after a certain point in time.

Using the aggregate consistency conditions (R = r) and (N = n), we can write the maximization problem for the social planner solution as

(22)

\begin{equation} max\mathop \smallint \limits_{t = 0}^\infty \left. {\left( {\left( {b + r} \right)^\alpha - \left( {a_0 - a_1 n} \right)b - i - \left( {\bar A - A} \right)^2 } \right.} \right)e^{ - \rho t} dt\end{equation}

subject to equations (3) and (4). Here ρ > 0 denotes the constant rate of discount, b, r, i ⩾ 0 and n ₀ and A ₀ given. The current-valued Hamiltonian is then given by

\begin{eqnarray*} \begin{array}{l} H = \left( {b + r} \right)^\alpha - \left( {a_0 - a_1 n} \right)b - i - \left( {\bar A - A} \right)^2 + \mu \sqrt i + \lambda \left( {\left( {\bar A - A} \right)\gamma - r} \right) + \theta _1 b\\ \quad + \theta _2 r + \theta _3 (\bar r - r) + \theta _4 i \\ \end{array}\end{eqnarray*}

where λ and μ denote the shadow prices of stocks of environmental quality and knowledge. The θ_i's have the same interpretation as before, with λ, μ, θ_i ⩾ 0. The first order conditions with respect to r and λ are given by equations (10) and (11). The other necessary conditions for an optimal solution are

(23)

\begin{equation} \frac{{\partial H}}{{\partial b}} = \alpha \left( {b + r} \right)^{\alpha - 1} - \left( {a_0 - a_1 n} \right) + \theta _1 = 0,\theta _1 b = 0\end{equation}

(24)

\begin{equation} \frac{{\partial H}}{{\partial i}} = - 1 + \frac{\mu }{{2\sqrt i }} + \theta _4 = 0,\theta _4 i = 0\end{equation}

(25)

\begin{equation} \dot \mu = \rho \mu - \frac{{\partial H}}{{\partial n}} = \rho \mu - a_1 b,\end{equation}

and the transversality conditions are given by equation (12) and

(26)

\begin{equation} lim_{t \to \infty } e^{ - \rho t} \mu \left( t \right)n\left( t \right) = 0.\end{equation}

The previous necessary conditions imply that investment is always positive in the model. With the introduction of knowledge accumulation, equation (24) implies that the marginal benefit of investment for i = 0 is infinity compared with a constant marginal cost of 1. From equation (24), we also see that θ₄ = −∞ when i = 0, which is not possible. So substituting θ₄ = 0 in equation (24), we get

(27)

\begin{equation} \dot n = \sqrt i = \frac{\mu }{2}.\end{equation}

The optimum path for use of the dirty technology $(0 < r\left\langle {\bar r,b = 0 \Rightarrow \theta _2 = 0 = \theta _3 ,\theta _1 } \right\rangle 0)$ is given by equation (13) as for the case without knowledge accumulation. However, equation (14) now changes to

(28)

\begin{equation} \lambda = \left( {a_0 - a_1 n} \right) - \theta _1.\end{equation}

When only using the clean technology (b > 0, r = 0, ⇒θ₁ = 0 = θ₃, θ₂ > 0), its optimal path is given by

(29)

\begin{equation} e^* = b^* = \left( {\frac{\alpha }{{a_0 - a_1 n}}} \right)^{\frac{1}{{1 - \alpha }}},\end{equation}

which implies

(30)

\begin{equation} \lambda = \left. {\left( {a_0 - a_1 n} \right.} \right) + \theta _2.\end{equation}

So, when using both r and b $(b > 0,0 < r < \bar r$ and θ₁, θ₂, θ₃ = 0), we have the following conditions

(31)

\begin{equation} e^* = \left( {\frac{\alpha }{\lambda }} \right)^{\frac{1}{{1 - \alpha }}} = \left( {\frac{\alpha }{{a_0 - a_1 n}}} \right)^{\frac{1}{{1 - \alpha }}}\end{equation}

and

(32)

\begin{equation} \lambda = \left( {a_0 - a_1 n} \right).\end{equation}

The previous two equations are similar to those obtained for the case without knowledge accumulation. However because both λ and n change over time in the current model, the relative magnitudes of these variables at each point in time would be critical for satisfying the condition for simultaneous use given by equation (32). This would determine the time of switch from only using the dirty technology or the clean one to periods of simultaneous use of the two energy inputs.

4.1. Solving for the Optimal Solution

The method of solving for the optimal solution in the current model would be analogous to that followed in the model without knowledge accumulation. We employ a phase diagram to solve the model. The relevant figure (Figure 4) shows a stylized version of the modified steady state when knowledge accumulation is included for the current model. The average cost of the clean technology depicted by the line (a ₀ − a ₁ n) will be henceforth referred to as the “cost line.” Because n rises with investments in knowledge, the cost line moves downward over time. However as n reaches its maximal level at $n = \bar n$ , the cost line ceases to fall after it equals $\left( {a_0 - a_{1} \bar n} \right)$ . Investment becomes zero (i = 0) beyond this point, and the shadow price of the stock of knowledge (μ) becomes moot in this case. The intersection of the $\left( {a_0 - a_{1} \bar n} \right)$ line with the $\dot \lambda = 0$ locus (point Z) gives a possible steady state for the social planner solution. Given a significantly large $\bar n$ where the planner can possibly use the clean technology due to its low average cost, we assume the previous point of intersection Z to be below point S in Figure 1a. Following our logic in the previous model of an appropriate choice of r, we can make this point of intersection the modified steady state. The path of rand b would follow equations (19) and (20), respectively, at the modified steady state. We will return to this point when we fully explain the dynamics of the model. Note that the cost line would take a finite time to reach $\left( {a_0 - a_{1} \bar n} \right)$ starting from its initial level at (a ₀ − a ₁ n ₀).

Figure 4. Transitional Dynamics of (A, λ) with Investment in Knowledge

Figure 4 shows stable arms (policy functions) approaching the modified steady state Z either from its southeast or northwest depending on the given initial stock of environmental quality. It is clear from the figure that the analysis with knowledge accumulation and with an upper bound on the stock of knowledge $\bar n$ is very similar to the previous model. However, the rate at which the cost line (a ₀ − a ₁ n) falls over time will be key to our analysis. From equations (25) and (27), this rate is given by μ. In Figure 4, we see that given a relatively high or low initial stock of environmental quality, there exists an optimal choice of λ₀ such that the stable arm hits the falling cost line only at point Z on the $\dot \lambda = 0$ locus. We denote this optimal choice of λ₀ by λ*₀. Note that at the modified steady state Z, $\hat \lambda $ , Â, and $\bar n$ (by definition) are constants and that μ = 0. Because investment in knowledge falls to zero at Z when $n = \bar n$ , the shadow price of the knowledge stock μ jumps to zero from equation (27). So the transversality conditions given by equations (12) and (26) are satisfied at the modified steady state. The economy can stay at this steady state forever as the dirty technology r is inexhaustible. We show that λ*₀ is indeed the optimum such that given A ₀, the stable arm originating from (A ₀, λ*₀) intersects $\dot \lambda = 0$ at the same time the cost line reaches $\left( {a_0 - a_1 \bar n} \right)$ . The uniqueness of this stable arm can be illustrated by the following two cases:

1. λ₀ > λ*₀: In this case, the motion trajectory would intersect the cost line toward the right of the stationary locus $\left( {\dot \lambda = 0} \right)$ . This would happen for $n < \bar n$ and in the special case of $n = \bar n$ . Let us represent the time it takes for the motion trajectory to intersect the cost line as t ₁. Because the cost line still keeps falling for $n < \bar n$ , this implies that the motion trajectory would be above the cost line from the very next instant. The motion trajectory would then proceed to the northeast. For the knife-edge case of $n = \bar n$ , any slight disturbance at t ₁ would also take the motion trajectory to the northeast so that λ becomes infinitely large.
2. λ₀ < λ*₀: For an initial value of the shadow price of environmental quality below its optimal level, the motion trajectory would hit the $\dot \lambda = 0$ locus at a point below Z before the cost line reaches the level $\left( {a_0 - a_1 \bar n} \right)$ . Let us call t ₂ the time it takes for the motion trajectory to hit the stationary locus $\dot \lambda = 0$ . Then, from the next instant t ₂ + ε, the motion trajectory would proceed to the southwest and intersect the horizontal axis in finite time.

For a relatively low initial stock of environmental quality at A′₀, λ*₀ has to be significantly above (a ₀ − a ₁ n ₀) or the initial cost of the clean technology. This point deserves special mention. In this case, the policy function “catches” the falling cost line from above only at the point Z. For any lower level of λ₀, the motion trajectory may cross the falling cost line soon enough at a level of $n < \bar n$ after which the point on the trajectory will find itself below the cost line. The trajectory would then head to the southwest. The only way the motion trajectory can turn around and proceed back to the southeast toward Z is if it intersects the falling cost line again and the cost line falls at a rate quicker than the point on the trajectory. In this case, the point on the trajectory would again be above the falling cost line. The uniqueness of λ*₀ in this case can be proved in a similar way to the case for a relatively high stock of environmental quality. For λ₀ > λ*₀ > (a ₀ − a ₁ n ₀), the motion trajectory would intersect the $\dot \lambda = 0$ locus sooner than the cost line reaches the level when $n = \bar n$ . The motion trajectory would then proceed to the northeast in such a way that the shadow price of environmental quality would tend to an infinitely large value. For (a ₀ − a ₁ n ₀) < λ₀ < λ*₀, the motion trajectory would hit a point to the left of the modified steady state Z on the cost line when it falls to its minimal level at $\left( {a_0 - a_1 \bar n} \right)$ . From the very next instant, the time path would proceed to the southwest and intersect the horizontal axis in finite time.

So, if we begin with a stock of environmental quality at A ₀, only the dirty technology is used according to equation (13) until the stable arm gets infinitesimally close to Z. This is evident from Figure 4. At Z, r jumps to its new profile given by equation (19). The path of the clean technology is then obtained from the residual of equations (31) and (19) with n replaced by $\bar n$ . Similarly, for a relatively low initial stock of environmental quality at A′₀, the planner would only use the clean technology along the stable arm as long as it is above the falling cost line. Because they only intersect at the modified steady state Z, the path of b would be given by equation (29) until the policy function gets infinitesimally close to Z. At this point, r jumps from r* = 0 to its new path given by equation (19). Once again, the path of the clean technology is obtained from the residual of equations (31) and (19).

We now deal with the optimal choice of μ₀ given the unique λ*₀ found previously for a high or low initial stock of environmental quality. Given (A ₀, λ*₀) or (A′₀, λ₀*) and using equations (4) and (11), we can find the time that it takes the stable arm to reach Z. We denote this time as $\hat T$ . Given $\hat T$ , there is a unique μ₀ that will drive the cost line down from (a ₀ − a ₁ n ₀) to $\left( {a_0 - a_1 \bar n} \right)$ in exactly $\hat T$ years.

We refer to this optimal choice of μ₀ as μ*₀. To verify, we can use μ*₀ to find the time $\bar T$ it takes for the cost line to reach $\left( {a_0 - a_1 \bar n} \right)$ starting from (a ₀ − a ₁ n ₀). Given Â, we can assume either A ₀ to be less or greater than Â (or A′₀ to be less or greater than Â). We can then find a $\hat \lambda _0 $ such that a motion trajectory originating from $\left( {A_0 ,\hat \lambda _0 } \right)$ or $\left( {A_0^\prime ,\hat \lambda _0 } \right)$ hits the cost line when $n = \bar n$ in exactly $\bar T$ years. If $\hat \lambda _0 $ equals λ*₀ found previously, we can say that there exists a unique μ*₀ given λ*₀. Numerical solutions for the optimal initial values of the costates and the time it takes for the stable arms to reach the modified steady state Z are provided in the Appendix.

A few comments should be noted here. The optimal solution in this more realistic case when knowledge accumulation reduces the average cost of the clean technology produces the result that the dirty and clean energy inputs are used in fixed proportions in the steady state (similar to the previous case without investment in knowledge). It is optimal for the planner to always invest in knowledge (investments in knowledge are always positive), a result that comes from the concavity of equation (3). Introducing learning by doing in the knowledge accumulation function would add convexity to equation (3), and the stock of knowledge at the point where returns to investment change from increasing to diminishing might correspond to some critical knowledge stock.Footnote ⁹ It would be interesting to see if this critical stock of knowledge coincides with the time of switch from only using the dirty or clean energy input to a constant mix of the two sources. The specification of equation (3) implying diminishing returns to knowledge accumulation also leads to one of the main conclusions of the paper: the dirty and clean technologies are used in fixed proportions in the steady state. The steady state corresponds to the stock of knowledge $\bar n$ (we assume this to be the maximum knowledge stock where returns to investment falls to zero for all practical purposes), and total energy use becomes constant from equation (31). It is intuitive that with diminishing returns to knowledge accumulation and a changing stock of environmental quality (because of either using r or b), the marginal costs of the two technologies would become equal at some point. Our model shows that the marginal costs become equal at the steady state when environmental quality settles at a constant level and the marginal cost of the clean technology falls to its minimum at $\left( {a_0 - a_1 \bar n} \right)$ . With a constant level of environmental quality, use of the dirty resource would be constant; use of the backstop obtained as a residual from total energy use is also contant as a result. There are no limits to how much knowledge the planner can accumulate at a point in time, and a faster than optimal knowledge accumulation may entail the cost line falling to $\left( {a_0 - a_1 \bar n} \right)$ before the policy function originating from a high or low endowment of environmental quality reaches that level. The policy function would then head to the northeast or southwest, and from our previous analysis, a modified steady state would not be attained as a result.

We next turn to the competitive equilibrium for the previous model with knowledge accumulation. The solution would look exactly like the one for the case without knowledge accumulation. A representative firm would always only use the dirty technology and extract the maximum feasible amount $\bar r$ every period. The relatively expensive clean technology would never be used, and because i = 0, the stock of knowledge would remain at its initial level of n ₀. The costate variables for the model, that is λ and μ, would both be irrelevant in this case. As in the model without knowledge accumulation, environmental quality may stabilize at some level Ã given a sufficiently high regeneration rate. The dynamics of A are then given in Figure 3. Once again, the economy can reach the stationary state starting from a high or low initial stock of environmental quality.

Result 2: The social planner solution remains qualitatively the same when knowledge accumulation (to reduce the average cost of the backstop) is included. The main difference from the previous model is that gradual investments in the backstop from the very beginning help the planner to attain the steady state of dual use of the two technologies when their marginal costs are equal. This steady state corresponds to a very high knowledge stock such that investments in the backstop are no longer worthwhile (i.e., the marginal benefit of investing an extra unit in the backstop falls to zero). Depending on the endowment of environmental quality, the steady state can be reached from either using the dirty or clean technology at first.

5. Concluding Remarks

There is a significant gap in our model between the optimal (planner) and equilibrium solutions to the basic problem of using the dirty technology. Individual agents (firms) in equilibrium cause a greater deterioration of environmental quality because they overuse the dirty resource. A representative firm treats environmental quality as constant and ignores its own impact in reducing the stock of environmental quality. We show that a representative firm always sticks to the dirty technology and never invests in R&D to reduce the average cost of the clean alternative technology (backstop). The social planner on the other hand internalizes the environmental externality (from use of the dirty technology), invests in R&D to reduce the backstop cost, and switches to a mix of using the clean and dirty technologies in the steady state. The model is simple enough that it arrives at a key conclusion of adopting the backstop in a steady state to prevent further environmental degradation but without introducing any “ecological threshold” (Boucekkine et al., Reference Boucekkine, Pommeret and Prieur2013b).Footnote ¹⁰ Secondly, a comparison of the optimal and equilibrium solutions is made (as in Acemoglu et al., Reference Acemoglu, Aghion, Bursztyn and Hemous2012) in an event where the market solution necessarily leads to a severe deterioration of environmental quality. Last but not the least, our paper introduces knowledge accumulation improving technical know-how in clean technologies (reducing their cost) into the literature of regime switching. Knowledge accumulation features diminishing returns in our model, which seems to be a realistic assumption.

In our model, we assume there is a separate R&D sector within each firm that is devoted to improving the technical know-how about alternative technologies (e.g., storage solutions for solar and wind and better fracking methods to reduce groundwater contamination when extracting natural gas) and not a sector where innovations come about through learning by doing. Introducing learning by doing would generate interesting possibilities in a way that knowledge accumulation such as in equation (3) would first be convex and then concave (when diminishing returns set in). In our current model, introducing learning by doing and adding firm heterogeneity (larger firms taking better advantage of learning by doing) may enable larger firms to switch sooner to the clean technology as compared with smaller firms. Further, a situation may arise when equality of the marginal costs of the two technologies occurs when the returns to investment in knowledge by the planner are still increasing; because of continually falling costs of the clean technology, a steady state in this case may correspond to only using the clean technology when returns to investment fall approximately to zero. Environmental quality would then settle at a constant level in the steady state.Footnote ¹¹

Earlier work by Bovenberg and Smulders (Reference Bovenberg and Smulders1996), Grimaud and Rougé (Reference Grimaud and Rougé2005), and Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012) prescribes some correction mechanisms to bring the equilibrium solution closer to the optimal one when an environmental externality exists from using the dirty resource.Footnote ¹² In our paper, with a continuum of identical firms, a policy to make an individual firm internalize some of the pollution costs from using the dirty technology would be to impose a tax on its use. Another mechanism to bring the equilibrium solution closer to the planner one are giving research subsidies to firms (similar to Acemoglu et al. [Reference Acemoglu, Aghion, Bursztyn and Hemous2012]) to encourage investment in new knowledge. Patent rights’ could then be added preventing knowledge spillovers (maintaining our continuum of identical firms assumption). Finally, cap-and-trade policies (allocating each firm some carbon credits and allowing trade in these credits) is another possibility if firm heterogeneity is added to our model. In this case, a result may be that larger firms invest more in the backstop and switch sooner than smaller firms; we leave this for future work.

Appendix

We use the parameter values α = 0.33, a ₀ = 10, a ₁ = 0.002, Ā = 2, $\bar r = 0.017$ , A ₀ = 1.99, A′₀ = 0.8, n ₀ = 0.01, $\bar n = 1,000$ , γ = 0.01, and ρ = 0.04. We use the program Mathematica to compute numerical solutions for the model when knowledge accumulation is not included and when investments in knowledge reduce average backstop cost.

Based on the previous parameter values, we find the coordinates of S, or the conventional steady state in Figure 1a, are A_S = 1.64 and λ_S = 14.35. For the model without knowledge accumulation, Figure A1 shows the two possible cases when the stock of environmental quality begins at a relatively high or low level. The coordinates of the modified steady state E in Figure A1 are Â = 1.75 and $\hat \lambda = 9.998$ given the previous parameter values. We find the stable arms using numerical methods and by working our way backward. For the equilibrium solution, ~ = 0.3.

Figure A1. Transitional Dynamics of (A, λ) with No Knowledge Accumulation

A similar analysis would follow when investments in knowledge are included to reduce the average cost of the clean technology. Figure A2 shows the stable arms for the two cases when an economy is endowed with a relatively high or low stock of environmental quality. We employ numerical methods to find these stable arms. The coordinates of the modified steady state Z in Figure A2 are Â = 1.8 and $\hat \lambda = 8$ . The point of intersection of the stable arms with the initial stock of A gives us a unique λ*₀. We find μ*₀ using equations (25), (27), and (29) such that the cost line (a ₀ − a ₁ n) falls to $\left( {a_0 - a_1 \bar n} \right)$ at the same time the stable arms reach Z.

Figure A2. Transitional Dynamics of (A, λ) with Investment in Knowledge

Table A1 summarizes the numerical solutions of the two models when the average cost of the clean technology is constant and when it can be reduced through investments in the stock of knowledge. Given the previous parameter values, we notice some significant differences based on the initial stock of A for each of the two models. For both the models, when the planner only uses the dirty technology for a high initial level of A, it reaches the modified steady state relatively soon compared with only using the clean technology given A′₀. A high $\hat T$ implies greater investments from an early period or a high initial shadow price of knowledge μ*₀. So μ*₀ would be smaller for a relatively low endowment of environmental quality.

Table A1. Optimal Values When Only r or b Is Used for the Two Models

Footnotes

1 By “alternative” we mean cleaner technologies. Other examples for alternative renewable technologies are hydro and biomass. Examples for nonrenewable alternative technologies are nuclear energy and natural gas.

2 Our idea of a falling average backstop cost with investments in knowledge follows Tsur and Zemel (Reference Tsur and Zemel2003).

3 The U.S. Energy Information Administration (EIA, 2014) estimates U.S. average levelized costs for electricity generation for plants entering service in 2019 (in 2012 $/MWh) to be 95.6 for conventional coal, 102.6 for biomass, and 130.0 for solar.

4 Boucekkine et al. (Reference Boucekkine, Saglam and Vallée2003, Reference Boucekkine, Krawczyk and Vallée2011, Reference Boucekkine, Pommeret and Prieur2013a) model cleaner technologies as having a lower marginal productivity of capital, and switching from relatively efficient dirtier technologies leads to capital “obsolescence” and a temporary fall in consumption.

5 The EIA (2014) predicts China's coal production to increase from 3.5 billion short tons (bst) in 2010 to 4.1 bst in 2015 and to 5.7 bst in 2040. An estimate of world coal reserves provided by the German Federal Institute for Geosciences and Natural Resources (BGR, 2013) and the World Coal Association (2013) predicts approximately 1,038 billion tons (1,144.7 bst) of reserves left as of 2012, equivalent to 132 years of global coal output at the current rate.

6 Barbier (Reference Barbier1999) and Schou (Reference Schou2000) include exhaustability of the dirty resource and find that when it is essential in production, optimal consumption paths would be declining and approach zero in the long term. Nondeclining consumption profiles in the steady state can be maintained by gradual substitution of the exhaustible resource by investments in human capital (similar to investments in physical capital by Stiglitz [Reference Stiglitz1974a, Reference Stiglitz1974b] and Solow [Reference Solow1974]).

7 We implicitly assume that the composite commodity is produced using both energy and labor where 1 − α is the share of labor. Population (and labor) is assumed to be unity, and the wage rate equals 1 as well.

8 Maximizing a utility function as in Boucekkine et al. (Reference Boucekkine, Krawczyk and Vallée2011) or Acemoglu et al. (Reference Acemoglu, Aghion, Bursztyn and Hemous2012) may give rise to interior solutions.

9 Boucekkine et al. (Reference Boucekkine, Saglam and Vallée2004) call a critical knowledge stock A* when introducing a learning curve.

10 In Boucekkine et al. (Reference Boucekkine, Pommeret and Prieur2013b), the cleaner technology is adopted after crossing the ecological threshold.

11 We think the optimal solution would involve using only the dirty or clean energy input at first (depending on the endowment of environmental quality), then a mix of the two technologies at one point in time when their costs are equal, and then use of only the clean technology (similar to Hung and Quyen [Reference Hung and Quyen1993] who have resource exhaustibility but do not include pollution).

12 Bovenberg and Smulders (Reference Bovenberg and Smulders1996) introduce an exogenous pollution ceiling implemented through a pollution tax.

References

Acemoglu, D., Aghion, P., Bursztyn, L., and Hemous, D.. “The Environment and Directed Technical Change.” American Economic Review 102, 1(2012):131–66.CrossRef Google Scholar PubMed

Barbier, E.B. “Endogenous Growth and Natural Resource Scarcity.” Environmental and Resource Economics 14, 1(1999):51–74.CrossRef Google Scholar

Boucekkine, R., Krawczyk, J., and Vallée, T.. “Environmental Quality versus Economic Performance: A Dynamic Game Approach.” Optimal Control Applications and Methods 32, 1(2011):29–46.CrossRef Google Scholar

Boucekkine, R., Pommeret, A., and Prieur, F.. “Technological vs. Ecological Switch and the Environmental Kuznets Curve.” American Journal of Agricultural Economics 95, 2(2013a):252–60.CrossRef Google Scholar

Boucekkine, R., Pommeret, A., and Prieur, F.. “Optimal Regime Switching and Threshold Effects.” Journal of Economic Dynamics and Control 37, 12(2013b):2979–97.CrossRef Google Scholar

Boucekkine, R., Saglam, C., and Vallée, T.. “Technology Adoption under Embodiment: A Two-Stage Optimal Control Approach.” Macroeconomic Dynamics 8, 2(2004):250–71.CrossRef Google Scholar

Bovenberg, A.L., and Smulders, S.A.. “Transitional Impacts of Environmental Policy in an Endogenous Growth Model.” International Economic Review 37, 4(1996):861–93.CrossRef Google Scholar

Dasgupta, P. “Discounting Climate Change.” Journal of Risk and Uncertainty 37, 2–3(2008):147–69.CrossRef Google Scholar

Dasgupta, P., and Heal, G.M.. “The Optimal Depletion of Exhaustible Resources.” Review of Economic Studies 41(1974):3–28.CrossRef Google Scholar

German Federal Institute for Geosciences and Natural Resources (BGR). Internet site: http://www.bgr.bund.de (Accessed June 15, 2014).Google Scholar

Grimaud, A., and Rougé, L.. “Polluting Non-Renewable Resources, Innovation and Growth: Welfare and Environmental Policy.” Resource and Energy Economics 27(2005):109–29.CrossRef Google Scholar

Hung, N.M., and Quyen, N.V.. “On R&D Timing under Uncertainty: The Case of Exhaustible Resource Substitution.”Journal of Economic Dynamics & Control 17, 5–6(1993):971–91.CrossRef Google Scholar

Schou, P. “Polluting Non-Renewable Resources and Growth.” Environmental and Resource Economics 16, 2(2000):211–27.CrossRef Google Scholar

Solow, R.M. “Intergenerational Equity and Exhaustible Resources.” Review of Economic Studies 41(1974):29–45.CrossRef Google Scholar

Stern, N. The Economics of Climate Change: The Stern Review. New York: Cambridge University Press, 2007.CrossRef Google Scholar PubMed

Stiglitz, J. “Growth with Exhaustible Natural Resources: Efficient and Optimal Growth Paths.” Review of Economic Studies 41(1974a):123–37.CrossRef Google Scholar

Stiglitz, J.. “Growth with Exhaustible Natural Resources: The Competitive Economy.” Review of Economic Studies 41(1974b):139–52.CrossRef Google Scholar

Tsur, Y., and Zemel, A.. “Optimal Transition to Backstop Substitutes for Nonrenewable Resources.” Journal of Economic Dynamics & Control 27, 4(2003):551–72.CrossRef Google Scholar

Tsur, Y., and Zemel, A.. “Scarcity, Growth and R&D.” Journal of Environmental Economics and Management 49, 3(2005):484–99.CrossRef Google Scholar

U.S. Energy Information Administration (EIA). “Annual Energy Outlook.” Internet site: http://www.eia.gov/forecasts/aeo (Accessed June 15, 2014).Google Scholar

World Coal Association. Internet site: http://www.worldcoal.org/resources (Accessed June 15, 2014).Google Scholar