Hostname: page-component-78c5997874-s2hrs Total loading time: 0 Render date: 2024-11-05T03:21:47.918Z Has data issue: false hasContentIssue false

Heterogeneous biased expectations of young and old individuals: macroeconomic effects and policy implications

Published online by Cambridge University Press:  21 October 2024

Jaeho Lee*
Affiliation:
Research Department, Bank of Korea, Seoul, Republic of Korea
Rights & Permissions [Opens in a new window]

Abstract

This paper examines the effects of heterogeneous biased expectations between the young and old on business cycles and explores its policy implications. Empirical findings reveal that individuals, particularly the young, can have more optimistic or pessimistic views about the future state of the economy compared to the data-generating measure. This study relates these results to the learning-from-experience literature, which suggests that individuals, particularly the young, place greater weight on recent observations when forming their expectations. Incorporating household weighting schemes into a life-cycle learning model, I show that household sensitivity to recent observations amplifies the effects of economic shocks. However, the amplification effects become less extensive as the population ages due to the lower sensitivity of the old. My simulation results indicate that a 10 percentage point increase in the old population ratio leads to a 16 percent decrease in output volatility. Regarding policy implications, this paper suggests that the government spending multiplier declines by approximately 10 percent when the old population ratio rises by 10 percentage points due to weak amplification effects. Moreover, the weakened output effects deteriorate the welfare of the population, particularly that of the young.

Type
Articles
Copyright
© The Author(s), 2024. Published by Cambridge University Press

1. Introduction

Accelerated population aging has had significant economic consequences in the past decades. For instance, the aging population has been identified as a main factor for low economic growth or secular stagnation, a concept first described by Alvin Hansen in 1938. Another structural change that has captured the attention of scholars is the Great Moderation, which refers to the reduction in macroeconomic volatility since the 1980s. The literature identifies several factors that may have contributed to this trend, including monetary and fiscal policies, regulations on financial markets, and population aging. While this paper also acknowledges the role of the aging population in reducing macroeconomic volatility, it focuses on a novel channel through which heterogeneous biased expectations between young and older individuals play a crucial role.

Figure 1 supports the claim of this paper that population aging, characterized by a higher proportion of older individuals with lower sensitivity to economic conditions, is a key driver of the observed decline in macroeconomic fluctuations. Panel (a) shows the 10-year rolling window volatility of GDP growth rates excluding GFC and COVID-19 periods has decreased fast recently, but the old-age dependency ratio has surged during the same period. These facts imply that the Great Moderation can be linked to population aging. Panel (b) and (c) also plot the index of consumer expectations (ICE) during two economic crises and suggest that older individuals exhibit less sensitivity to recent observations in their expectations of the future economy.Footnote 1

Figure 1. GDP volatility and population aging, and index of consumer expectations by age group during crises. Notes: 10-Year S.D. of GDP Growth indicates the 10-year rolling window standard deviation of the quarterly GDP growth rates. The Old-Age Dependency Ratio is the ratio of people older than 64 to those ages 15–64. The y-axis in lower panels denotes the difference between ICE in a current period and the average ICE from 1978q1 to 2020q4. Source: FRED and Surveys of Consumers

Thus, I aim to answer the following questions: 1) “How do heterogeneous biased expectations between young and old individuals affect dynamics of macro-variables?”, and the related question is “How much does population aging contribute to the reduction in the volatility of business cycles in recent decades?”, and 2) “What are the policy implications of heterogeneous biased expectations between the young and old for an aging society?”

To answer these questions, this paper first investigates the properties of household expectations, with particular emphasis on the heterogeneity between the expectations of young and old individuals. In particular, a belief wedge, defined as the disparity between the mean unemployment forecast one year ahead from the Michigan survey and the vector autoregression (VAR) or survey of professional forecasters (SPF) unemployment forecast, is introduced as a measure of the deviation of household expectations from the data-generating measure. The belief wedge analyses show household expectations can diverge significantly from the VAR or SPF forecast, which means they can be overly optimistic or pessimistic than current overall economic conditions. Especially, younger individuals have a stronger tendency towards biased expectations compared to their older counterparts. As a result, the younger individuals’ biased expectations have a greater impact on our economy.

Although the literature offers various theories to explain this deviation, including sticky or noisy information and rational inattention, this paper adopts an adaptive learning mechanism from the learning-from-experience literature since it provides empirical evidence for heterogeneous biased expectations between young and older individuals. Malmendier and Nagel (Reference Malmendier and Nagel2016) and Malmendier and Shen (Reference Malmendier and Shen2024) show that both young and older individuals place more weight on recent experiences when forming their expectations, with young individuals placing relatively more weight than older individuals. In particular, the constant gain learning algorithm with different gain parameters for each age group can effectively capture these expectation formation rules.

In this study, a life-cycle (LC) learning model is constructed by combining the real business cycle (RBC) learning and overlapping generations (OLG) framework based on the works of Eusepi and Preston (Reference Eusepi and Preston2011) and Gertler (Reference Gertler1999). The model assumes bounded rationality, where households have an incomplete understanding of the economy and update their beliefs about the future market-clearing prices, a real wage and rental rate of capital, each period using a constant gain learning algorithm. Here, households act as if their beliefs will remain unchanged forever, as in Cogley and Sargent (Reference Cogley and Sargent2008)’s anticipated utility.Footnote 2 In particular, young households have larger gain parameters than old households, reflecting the young’s relatively heavier reliance on recent data. The gain parameters for young and old households are derived from Malmendier and Nagel (Reference Malmendier and Nagel2016).

The LC learning model incorporating heterogeneous household weighting schemes on past data presents the novel dynamics of macroeconomic variables and implications for business cycles and government policies.

At first, this study compares the dynamics of the macroeconomic variables in response to a technology shock in two different models: the LC learning model and the LC rational expectation (RE) model. By comparing these models, I shed light on how heterogeneous biased expectations arising from the learning mechanism differently affect economic fluctuations than rational expectations. Then, the LC learning model shows that household sensitivity to recent observations amplifies the effects of the technology shock. However, as the old population share increases, these amplification effects become less extensive, as older households have relatively lower sensitivity to recent shocks.

I also compare the LC learning model with the representative agent (RA) learning model and show the life-cycle assumption better explains the data in the real world. In other words, the RA learning model produces far greater amplification than the LC learning model, which does not match the data. This is because the RA does not consider the probability of death and losing a job in the future, and thus, they have a larger time discount factor. Therefore, the RA gives more importance to biased expectations about the future economy, which creates excessive amplification effects.

In addition, this paper examines the impact of heterogeneous biased expectations on the volatility of business cycles and finds household learning behavior creates fluctuations in economic activities that closely resemble those observed in the data. Furthermore, the lower sensitivity of old households to recent experiences leads to a decrease in business cycle volatility as the population ages. A 10 percentage point increase in the old population share (= the number of people aged 65 and over / 15 and over $\times$ 100) induces a 16 percent decrease in output volatility.Footnote 3 Notably, the United States has experienced a decline in GDP volatility of approximately 74 percent, coinciding with about a 6 percent point increase in the old population ratio from the 1980s to the 2010s, indicating the meaningful roles of heterogeneous biased expectations and population aging in reducing GDP volatility.

Lastly, the LC learning model provides important policy implications. First, household learning behavior amplifies the response of output to government spending shocks, although the magnitude of this response diminishes with an increase in the old household share due to their lower sensitivity. The government spending multiplier declines by approximately 10 percent when the old population ratio rises by 10 percentage points. I also conduct welfare analyses of government spending. The amplification of government spending effects under learning leads to an improvement in the welfare of the population compared to under rational expectations. However, as the old household share increases, the welfare of both young and old households deteriorates due to weakened output effects. Especially, the welfare of young households declines more drastically with an aging population due to their overreaction to the relatively poor performance of the economy in an aging society following a stimulus policy.

Related literature

This paper bridges the gap in three strands of literature: 1) biased expectations, 2) adaptive learning, and 3) the effects of population aging on business cycles and economic policies.

First, individuals often exhibit deviation from rational expectations, and I assume these biased expectations originate from heavy reliance on recent experiences. The learning-from-experience literature, including Malmendier and Nagel (Reference Malmendier and Nagel2011), Malmendier and Nagel (Reference Malmendier and Nagel2016), and Malmendier and Shen (Reference Malmendier and Shen2024), supports this claim and further shows that young individuals are more sensitive to recent observations than old individuals. Blanchard (Reference Blanchard2010) also suggests that the “deep scars” of the great financial crisis have a lasting impact on the economy. Moreover, there is growing literature on expectation-driven business cycles, as evidenced by Lorenzoni (Reference Lorenzoni2009), Angeletos and La’O (Reference Angeletos and La’O2013), Benhabib et al. (Reference Benhabib, Wang and Wen2015), Benhabib et al. (Reference Benhabib, Liu and Wang2016), etc. Similar to these papers, I study the effects of biased expectations on business cycles but the biased expectations are heterogeneous between the young and old, which has not been sufficiently dealt with in the literature.

This paper especially adopts adaptive learning to reflect the deviation from rational expectations. Adam et al. (Reference Adam, Matveev and Nagel2021) state that learning-from-experience could contribute to time-varying subjective expectation errors. Malmendier and Nagel (Reference Malmendier and Nagel2016) show the constant gain adaptive learning rule captures average survey expectations well. Eusepi and Preston (Reference Eusepi and Preston2011) introduce imperfect information and learning behavior in the RBC framework. In particular, households in their model employ the minimum state variable (MSV) constant gain learning algorithm like Mitra et al. (Reference Mitra, Evans and Honkapohja2013), and Evans and Honkapohja (Reference Evans and Honkapohja2001). I also adopt the MSV constant gain learning rule but add the heterogeneity between the young and old. Specifically, different gain parameters represent the heterogeneous learning rule between the young and old. Branch and McGough (Reference Branch and McGough2009) and Honkapohja and Mitra (Reference Honkapohja and Mitra2005) also employ the different gain parameters to model heterogeneity in household expectations.

This study also contributes to the literature on factors driving the stylized facts of an aging society, i.e., low business cycle volatility and less effective fiscal policy. Improved monetary policy (Stock and Watson, Reference Stock and Watson2003) and the low volatility of old individuals’ employment and hours worked (Jaimovich and Siu, Reference Jaimovich and Siu2009) are identified as the main drivers that lead to the reduction in the volatility of business cycles in recent decades, known as the Great Moderation. Basso and Rachedi (Reference Basso and Rachedi2021) and Honda and Miyamoto (Reference Honda and Miyamoto2020) show that the government spending multiplier declines as the population ages due to weak responses of private consumption and employment. However, I offer a novel perspective by examining the role of heterogeneous biased expectations between the young and old in driving economic changes in an aging society, which has not been thoroughly investigated in the literature.

The structure of this paper is as follows. Section 2 presents the empirical evidence for heterogeneous biased expectations and estimates their impacts on the economy. Then, I discuss mechanisms for heterogeneous biased expectations based on previous papers. Next, Section 3 builds the life-cycle learning model in which young and old households use a different adaptive learning rule when they form expectations. After that, Section 4 explores the macroeconomic effects of heterogeneous biased expectations, and Section 5 provides their fiscal policy implications for an aging society. Finally, Section 6 concludes.

2. Empirical evidence for heterogeneous biased expectations

This section analyzes biased expectations of individuals by computing belief wedges and investigates the differences between the young and older generations. Then, I examine the impacts of biased expectations by each age group on the economy and present the mechanisms through which heterogeneous biased expectations arise.

2.1 Biased expectations

The belief wedges, the measures of biased expectations, suggest that individuals can perceive the future economic situation more optimistically or pessimistically than current overall economic conditions.

Definition

The belief wedge is defined as the difference between the mean unemployment rate forecast one year ahead from the Michigan survey and the VAR unemployment rate forecast—henceforth, VAR Wedge. Here, the VAR forecasts are considered the data-generating measure. I also exploit the unemployment forecast from a SPF conducted by the Federal Reserve Bank of Philadelphia as an alternative to the VAR forecast—henceforth, SPF Wedge.Footnote 4

\begin{align*} Belief\ Wedge = Expected\ Unemployment\ of\ Michigan\ Survey\ - VAR\ (or\ SPF)\ Forecast \end{align*}

Methodology

The belief wedges are calculated based on Bhandari et al. (Reference Bhandari, Borovicka and Ho2024) and Mankiw et al. (Reference Mankiw, Reis and Wolfers2003). Specifically, the VAR forecasts are generated from a standard quarterly forecasting VAR model containing nine variables: CPI inflation, real GDP, unemployment rate, the relative price of investment goods, capital utilization rate, hours worked, consumption rate (=Consumption/GDP), investment rate (=Investment/GDP), and federal funds rate. The time lag for the VAR model is two, and the data for VAR estimation and SPF forecasts are obtained from FRED and FRB of Philadelphia. The sample period for SPF Wedge ranges from 1960Q1 to 2023Q3, but for VAR Wedge, it is only from 1960Q1 to 2019Q4 since the COVID-19 shock causes a structural break in the VAR model.

Results

Figure 2 illustrates the changes in two belief wedges over time, which are similar to the findings of Bhandari et al. (Reference Bhandari, Borovicka and Ho2024). Both VAR and SPF Wedges exhibit a noticeable trend of increasing sharply during economic recessions, suggesting over-pessimistic expectations as a result of negative economic shocks, and gradually decreasing post-recession. In particular, during the COVID-19 crisis, these biased expectations are even stronger than in any previous crisis. Overall, these patterns suggest that individuals hold time-varying biased expectations.

Figure 2. Belief wedges. Notes: The belief wedges are computed based on Bhandari et al. (Reference Bhandari, Borovicka and Ho2024) and Mankiw et al. (Reference Mankiw, Reis and Wolfers2003). This paper does not report VAR Wedge after 2020 as the COVID-19 shock causes a structural break in the VAR model.

2.2 Heterogeneous biased expectations

Belief wedges also suggest that biased expectations vary by age. Figure 3 shows the difference in the belief wedge between the age group “under 65” and “over 65,” especially using the SPF Wedge.

\begin{align*} Difference\ in\ Belief\ Wedge = Belief\ Wedge\ Under\ 65 - Belief\ Wedge\ Over\ 65. \end{align*}

Noticeably, the difference increases sharply during recessions and gradually decreases in the post-recession periods. Hence, it can be inferred that young individuals react more sensitively to recent macroeconomic shocks than older individuals, which causes the young to hold more biased pessimistic expectations than the older during and right after economic downturns. The detailed mechanism for the difference in biased expectations by age is discussed in Section 2.3.

Figure 3. Difference in belief wedge between “Under 65” and “Over 65.”

Now, this paper estimates Equation (1) to verify how biased expectations of each age group, i.e., “Over 65” or “Under 65,” differently affect the economy, especially consumption

(1) \begin{align} PCE_t^{detrend} = \alpha + \sum _{h=0}^{1}\beta _hBW^{over65}_{t-h} + \sum _{h=0}^{1}\gamma _hBW^{under65}_{t-h} + e_t \end{align}

where $PCE_t^{detrend}$ is the detrended personal consumption expenditure using Equation (2)

(2) \begin{align} PCE_t^{detrend} = \frac{PCE_t-PCE_t^{trend}}{PCE_t^{trend}} \times 100 \; (\%) \end{align}.

Here, $PCE_t^{trend}$ is the trend of personal consumption expenditure from HP-filtering. Also, $BW^{over65}$ and $BW^{under65}$ indicate the belief wedges of the age group “Over 65” and “Under 65,” respectively.

For data, personal consumption expenditure is from Bureau of Economic Analysis, and the sample period is from 1980Q1 to 2023Q3 for estimation using SPF Wedge and from 1980Q1 to 2019Q4 for estimation using VAR Wedge.

Table 1 provides estimation results and shows only the young individuals’ belief wedge is inversely related to private consumption. Specifically, the coefficient on $BW^{under65}$ has a statistically significant negative value, which means excessively negative expectations about future unemployment result in a reduction in consumption below the trend. However, the coefficients on $BW^{over65}$ are not statistically significant and have positive signs, making it difficult to derive economic meanings.

Table 1. Effects of biased expectations by age group on consumption

Notes: Standard errors in parentheses, and ***p $\lt$ 0.01, **p $\lt$ 0.05, *p $\lt$ 0.1.

Through these empirical findings, this paper argues that older individuals are less overly affected by recent experiences, resulting in less biased expectations and subsequently weaker impacts on economic activities. From this perspective, the impact of biased expectations is expected to decrease as the proportion of older individuals increases.

2.3 Mechanisms for heterogeneous biased expectations

This paper explains heterogeneous biased expectations with a mechanism devised in the learning-from-experience literature. Although previous papers suggest that the deviation from rational expectations may be attributed to sticky or noisy information, rational inattention, etc, these assumptions cannot fully support the empirical findings that the expectations of older individuals are less biased than those of younger individuals. However, the learning-from-experience literature provides clear evidence for that. In particular, the literature highlights that the biased expectation formation rule can be well-captured by the constant gain learning algorithm with a different gain parameter by age.

The following are more details about the learning-from-experience literature. According to Malmendier and Nagel (Reference Malmendier and Nagel2011), individuals tend to assign more weight to recent stock returns when forming their expectations, leading to more optimistic beliefs about future stock returns after experiencing higher returns. Similarly, Malmendier and Nagel (Reference Malmendier and Nagel2016) find that people overweight inflation realized during their lifetimes, especially more recent data when predicting future inflation. Especially, these beliefs are heterogeneous between the young and old since young individuals place more weight on recent data than older individuals. Malmendier and Shen (Reference Malmendier and Shen2024) also show that personal experiences of unemployment, particularly recent ones, have long-lasting effects on consumption decisions. However, this beliefs-based channel is weaker in an old cohort than in a younger cohort.

3. Model

The model, based on the RBC framework, includes two market-clearing prices: real wages for the labor market and rental rates of capital for the capital market. While competitive firms take these factor prices as given, households hold biased expectations about them. Specifically, following Eusepi and Preston (Reference Eusepi and Preston2011), households possess an incomplete understanding of the economy and update their beliefs regarding market-clearing prices by relying on historical patterns in observed data.Footnote 5 They especially tend to assign greater weight to recent observations when forming expectations, using an adaptive learning algorithm. I also incorporate a life-cycle assumption, i.e., young households or workers and old households or retirees, into the RBC learning model following Gertler (Reference Gertler1999), Blanchard (Reference Blanchard1985), and Gali (Reference Gali2021). As discussed in the previous sections, young individuals weigh relatively more on the recent data than old individuals in the learning model. Including heterogeneous agents with distinct weighting schemes leads to unique macroeconomic dynamics and implications for business cycles and government policies.

The adaptive learning rules relax the assumptions under rational expectations that agents optimally forecast future variables and solve dynamic optimization problems. In contrast, households in the learning model lack information about other agents’ behavior and, as a result, cannot directly infer the aggregate laws of motion as they would in the RE model. Consequently, they rely on econometric time-series models to form their forecasts. Evans (Reference Evans2019) and Evans et al. (Reference Evans, Honkapohja and Mitra2009) explain how the adaptive learning model operates: at time $t$ , agents make decisions based on the current state, realizations of exogenous shocks, and expectations of relevant variables. The aggregation of heterogeneous agents’ behavior, combined with market clearing, determines the temporary equilibrium outcomes for endogenous variables in the economy. At time $t+1$ , expectations are updated using the new data point provided by the temporary equilibrium, and the process repeats. Over time, this sequence of temporary equilibrium may yield parameter estimates for the forecasting models that converge to a fixed point, corresponding to a rational expectations equilibrium for the economy. For more detailed explanations see Evans and Honkapohja (Reference Evans and Honkapohja2001).

3.1 Firms

There are identical competitive firms of mass one. Each firm produces goods using capital $K_t$ and labor $H_t$ . The production function is

(3) \begin{align} Y_t = (K_t)^\alpha (X_t H_t)^{1-\alpha } \end{align}

where $0\lt \alpha \lt 1$ . Here, $X_t$ denotes the aggregate labor-augmenting technical progress which evolves via

(4) \begin{align} \ln \left(\frac{X_{t+1}}{X_t}\right) = \ln (\chi _{t+1}) = \ln (\overline{\chi }) + u_{\chi, t+1} \end{align}

where $u_{\chi, t}$ indicates an i.i.d. random variable with zero mean and standard deviation $\sigma _{u_{\chi }}$ . The stochastic process for the evolution of the technological shock is assumed known to the agents.

Firms maximize their profits with factor prices, the real wage $W_t$ and returns to capital $R^k_t$ , as given. The optimality conditions are

(5) \begin{align} W_t = (1-\alpha )K_t^{\alpha }X_t^{1-\alpha }H_t^{-\alpha } = (1-\alpha )\frac{Y_t}{H_t} \end{align}
(6) \begin{align} R^k_t = \alpha K_t^{\alpha -1}(X_t H_t)^{1-\alpha } = \alpha \frac{Y_t}{K_t}\end{align}.

From now on, lowercase letters denote variables normalized with the technology $X_t$ . Then, the normalized production function and optimality conditions are

(7) \begin{align} y_t = (k_t)^\alpha (\chi _t)^{-\alpha }(H_t)^{1-\alpha } \end{align}
(8) \begin{align} w_t = (1-\alpha )(k_t)^\alpha (\chi _t)^{-\alpha }H_t^{-\alpha } = (1-\alpha )\frac{y_t}{H_t} \end{align}
(9) \begin{align} R^k_t = \alpha k_t^{\alpha -1}(\chi _t)^{-\alpha }(H_t)^{1-\alpha } = \alpha \frac{y_t}{k_t} \end{align}.

3.2 Belief updating

Households are assumed to use an economic model to forecast future market-clearing prices. The model relates the wage and returns to capital to the aggregate capital and the distribution of the wealth between young and old households, which are two minimum state variables (MSV) in the model

(10) \begin{align} \hat{R}^k_t = \mu _r + \mu _{rk}\hat{k}_t + \mu _{r\lambda }\lambda _t + e^r_t \end{align}
(11) \begin{align} \hat{w}_t = \mu _w + \mu _{wk}\hat{k}_t + \mu _{w\lambda }\lambda _t + e^w_t \end{align}
(12) \begin{align} \hat{k}_{t+1} = \mu _k + \mu _{kk}\hat{k}_t + \mu _{k\lambda }\lambda _t + e^k_t \end{align}
(13) \begin{align} \lambda _{t+1} = \mu _{\lambda } + \mu _{\lambda k}\hat{k}_t + \mu _{\lambda \lambda }\lambda _t + e^{\lambda }_t \end{align}

where $\lambda _t=\frac{\hat{a}^o_t}{\hat{a}_t}$ represents the distribution of the wealth between young and old households.Footnote 6 Here, $a^y_t$ , $a^o_t$ , and $a_t$ are normalized $A^y_t$ , $A^o_t$ , and $A_t$ , which are the assets of aggregate young, old, and total households, respectively, and $\hat{x}$ indicates the log-linearization of the variable $x$ around a balanced growth path (BGP). $e_t$ denotes a regression error or households consider $e_t$ an idiosyncratic disturbance (i.e., a perceived white noise unobserved shock). As in Eusepi and Preston (Reference Eusepi and Preston2011), this paper excludes the technology shock from the household forecasting system since the technology shock is the only disturbance in the model, and thus, households learn quickly, which means they correctly expect market prices right away, if it is included.Footnote 7

Equation (10)-(13) can be rewritten in a matrix form as Equation (14)

\begin{equation*} z_t^{\prime} = \begin {pmatrix} \hat {R}^k_t \\ \hat {w}_t \\ \hat {k}_{t+1} \\ \lambda _{t+1} \end {pmatrix},\; x_{t-1} = \begin {pmatrix} 1 \\ \hat {k}_t \\ \lambda _t \end {pmatrix},\; \zeta _{t} = \begin {pmatrix} \mu _{r,t} & \mu _{w,t} & \mu _{k,t} & \mu _{\lambda, t} \\ \mu _{rk,t} & \mu _{wk,t} & \mu _{kk,t} & \mu _{\lambda k,t} \\ \mu _{r\lambda, t} & \mu _{w\lambda, t} & \mu _{k\lambda, t} & \mu _{\lambda \lambda, t} \end {pmatrix},\; e_t^{\prime} = \begin {pmatrix} e^r_t \\ e^w_t \\ e^k_t\\ e^{\lambda }_t \end {pmatrix} \end{equation*}
(14) \begin{align} z_{t} = x^{\prime}_{t-1}\zeta _t + e_t. \end{align}

Under rational expectations, $\mu _{r,t} = \mu _{w,t} = \mu _{k,t} = \mu _{\lambda, t} = 0$ , and the coefficients of the model are time-invariant (i.e., $\mu _{ij,t}=\overline{\mu }_{ij} \quad \text{where} \quad i \in \{r,w,k,\lambda \} \quad \text{and} \quad j \in \{k,\lambda \}$ ). Also, $e^r_t = \overline{\mu }_{r\chi }\hat{\chi }_t$ , $e^w_t = \overline{\mu }_{w\chi }\hat{\chi }_t$ , $e^k_t = \overline{\mu }_{k\chi }\hat{\chi }_t$ , and $e^{\lambda }_t = \overline{\mu }_{\lambda \chi }\hat{\chi }_t$ . In other words, agents in the RE model have a complete understanding of the model and know the individual preferences and technologies of other market participants. Thus, they are aware of the exact relationship among the state variables, technology shocks, and factor prices. Therefore, the technology shocks are included in the forecasting system and the coefficients in Equation (10)–(13) are true and time-invariant. However, under learning, households update the coefficients every period as they observe new data. I employ constant gain recursive least squares estimates as an updating algorithm

(15) \begin{align} \zeta _{t}^i = \zeta _{t-1}^i + g^i(Q^i_t)^{-1}x_{t-1}\underbrace{(z_t - x_{t-1}^{\prime }\zeta ^i_{t-1})}_{\text{Forecast Error}} \end{align}
(16) \begin{align} Q_t^i = Q^i_{t-1} + g^i(x_tx_t^{\prime } - Q^i_{t-1}) \end{align}

where $i \in \{y, o\}$ and $Q^i_t$ is the estimate of the second-moment matrix of regressors.

Equations (15) and (16) imply that the coefficients estimated in the previous period are updated according to the forecast errors produced for the current period. So, households use the previous period’s coefficient estimates when forming expectations, and then, update their coefficients at the end of each period.

The gain parameter $g$ allows the deviation from rational expectations. That is, the larger $g$ , the more weight is given to recent data.Footnote 8 Here, young and old households have different gain parameters, i.e., $g^y$ and $g^o$ , based on Malmendier and Nagel (Reference Malmendier and Nagel2016)’s empirical findings.Footnote 9 , Footnote 10 In particular, young individuals are assumed to put more weight on recent data than old individuals, resulting in a larger gain parameter for the former, i.e., $g^y$ is greater than $g^o$ . Collin-Dufresne et al. (Reference Collin-Dufresne, Johannes and Lochstoer2017) also assume that the gain parameter of the young agents is five times larger than that of the old agents.Footnote 11 Since young and old households have different weighting schemes on the past data, their adaptive expectations about the factor prices can be distinct from each other. These differences between age groups finally produce the novel dynamics of the macro-variables rather than assuming a representative agent.

There is one more assumption worth mentioning. In my model, agents do not know what other agents expect, and thus, their expectations are not affected by others. If young households are aware of old households’ expectations or vice versa, they can internalize the impacts of updating their beliefs on the economy. So, households quickly learn and find the true evolution of wages and interest rates since the household learning behavior does not create forecast errors.

3.3 Households

This paper assumes an economy with overlapping generations following Gertler (Reference Gertler1999), Gali (Reference Gali2021), and Blanchard (Reference Blanchard1985). As explained in Section 3.2, households have heterogeneous expectations depending on their age due to the different rules of forming expectations. Therefore, I index households according to their age, i.e., young (workers) or old (retirees) households. The size of the population is constant and normalized to one. Each individual has the constant probability $\gamma$ of surviving into the following period, independently of their age and economic status. Moreover, each worker faces the constant probability $1-v$ of becoming old and retired permanently. This probability is also independent of their age and economic status. Consequently, the size of young individuals or workers at any time is the constant $\phi = \frac{1-\gamma }{1-v\gamma } \in (0,1]$ and that of old individuals or retirees is $1-\phi = \frac{\gamma (1-v)}{1-v\gamma }$ .

This study adopts the perfect annuity market assumption introduced by Gertler (Reference Gertler1999) and Gali (Reference Gali2021), where agents are insured against the risk of death. More specifically, households have an annuity contract with a perfectly competitive insurance company that issues payments proportional to the household’s financial wealth. Upon death, the household’s wealth is transferred to the insurance company. The surviving households receive all returns in this market.Footnote 12 Furthermore, an insurance market is introduced to mitigate the risk of income loss during retirement. The complete market assumption makes the model more tractable, especially in terms of aggregating household consumption.

3.3.1 Old households (retirees)

The old household or retiree of cohort ‘ $a$ ’ is the agent who retired ‘ $a$ ’ quarters ago.Footnote 13 Each agent maximizes the following Bellman equationFootnote 14

(17) \begin{align} V^o(A_{a,t}^o, A^o_t, A_t, K_t) = \text{Max} \{ \ln C^o_{a,t} + \gamma \beta \tilde{E}^o_t V^o(A^o_{a+1,t+1}, A^o_{t+1}, A_{t+1}, K_{t+1})\} \end{align}

and the budget constraint for old households is

(18) \begin{align} C_{a,t}^o + \gamma A^o_{a+1,t+1} = R_t A^o_{a,t} + S_t \end{align}

where $\beta$ is a time discount factor, $\tilde{E_t}$ indicates subjective expectations for the future, and the superscript $o$ denotes old or retired households. Also, $C^o_{a,t}$ is consumption, and $A^o_{a,t}$ is the asset that old households hold at the beginning of time $t$ . As mentioned above, only survivors receive all the returns, and households who die are paid nothing. The real interest rate $R_t$ satisfies $R_t = R^k_t + 1 - \delta$ where $\delta$ is the depreciation rate of capital due to the absence of arbitrage between loans and capital. $S_t$ is the social security benefit that old households receive. The state variables { $A^o_t, A_t, K_t$ }—aggregate old household asset, total asset, and total capital—are used when households expect the future wages and returns to capital, which is discussed in Section 3.2 in detail. Households know only their own objectives, constraints, and beliefs as in Eusepi and Preston (Reference Eusepi and Preston2011).

Then, Euler equation is

(19) \begin{align} (C_{a,t}^o)^{-1} = \beta \tilde{E}^o_t [(C_{a+1,t+1}^o)^{-1}R_{t+1}] \end{align}

and the Euler equation and budget constraint are normalized with technology as follows

(20) \begin{align} (c_{a,t}^o)^{-1} = \beta \tilde{E}^o_t [(c_{a+1,t+1}^o)^{-1}\chi ^{-1}_{t+1}R_{t+1}] \end{align}
(21) \begin{align} c_{a,t}^o + \gamma a^o_{a+1,t+1} = R_t a^o_{a,t}\chi _t^{-1} + s_t \end{align}.

To get the aggregate consumption of old households, this paper derives the intertemporal budget constraint (IBC) from the one-period budget constraint, Equation (21). After that, the IBC is log-linearized around a BGP and then I aggregate each cohort’s consumption using Euler equation, Equation (20). Finally, we obtain log-linearized aggregate consumption of old households

(22) \begin{align} \hat{c_t}^o = \; &\eta _{ay}(1-v)\hat{a}_{t-1}^y + \eta _{ao}\hat{a}^o_t + \eta _{r\chi }(\hat{R}_t-\hat{\chi }_t) + \eta _s\hat{s}_t \\ \notag & -\eta ^e_{r}\underbrace{\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{R}_{t+1+h}}_{\text{EPV of Returns to Capital}} +\eta ^e_{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{s}_{t+1+h} \end{align}

where $\eta$ ’s consist of primitive model parameters.Footnote 15 Note that Equation (22) shows how current and expected variables, such as real interest rates, affect the consumption decisions of old households.

3.3.2 Young households (workers)

The young household or worker of cohort ‘ $b$ ’ is the agent who started to work (was born) ‘ $b$ ’ quarters ago. Each agent maximizes the following Bellman equation

(23) \begin{align} V^y(A^y_{b,t}, A^y_t, A_t, K_t) = & \,\text{Max} \{ \ln C^y_{b,t} + \theta \ln (1-H_{b,t}) + \\\notag & \gamma \beta \tilde{E}^y_t[v V^y(A^y_{b+1,t+1}, A^y_{t+1}, A_{t+1}, K_{t+1}) \\\notag & + (1-v) V^o(A^{yo}_{b+1,t+1}, A^y_{t+1}, A_{t+1}, K_{t+1})]\} \end{align}

and the budget constraint for young households is

(24) \begin{align} C^y_{b,t} + v\gamma A^y_{b+1,t+1} + (1-v)\gamma A^{yo}_{b+1,t+1} = R_tA^y_{b,t} + W_tH_{b,t} - T_t \end{align}

where the superscript $y$ denotes young households or workers, $C^y_{b,t}$ is consumption, and since there are two possible future states, young households save $A^y_{b+1,t+1}$ for staying young and $A^{yo}_{b+1,t+1}$ for retiring next period. Complete asset markets insure young households against the risk of retirement. $W_t$ is the real wage, $H_{b,t}$ is the labor supply, and $T_t$ is the lump-sum tax. The state variables { $A^y_t, A_t, K_t$ }—aggregate young household asset, total asset, and total capital—are used when young households expect the future wages and returns to capital, which is discussed in Section 3.2 in detail.

Young households solve the maximization problem, Equation (23), subject to the budget constraint, Equation (24). Then, the normalized Euler equations, labor supply condition, and budget constraint of young households are as follows

(25) \begin{align} (c_{b,t}^y)^{-1} = \beta \tilde{E}^y_t [(c_{b+1,t+1}^y)^{-1}\chi ^{-1}_{t+1}R_{t+1}] \end{align}
(26) \begin{align} (c_{b,t}^y)^{-1} = \beta \tilde{E}^y_t [(c_{b+1,t+1}^o)^{-1}\chi ^{-1}_{t+1}R_{t+1}] \end{align}
(27) \begin{align} \frac{\theta c^y_{b,t}}{1-H_{b,t}} = w_t \end{align}
(28) \begin{align} c^y_{b,t} + v\gamma a^y_{b+1,t+1} + (1-v)\gamma a^{yo}_{b+1,t+1} = R_ta^y_t\chi ^{-1}_t + w_tH_{b,t} - \tau _t \end{align}.

Finally, we obtain log-linearized aggregate consumption of young households using the same way to get aggregate consumption of old households

(29) \begin{align} \hat{c}^y_t =& \; \psi _{ay}\hat{a}^y_t + \psi _{r\chi }(\hat{R}_t-\hat{\chi }_t) + \psi _w\hat{w}_t-\psi _{\tau }\hat{\tau }_t - \psi ^e_{\tau } \tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{\tau }_{t+1+h} \\ \notag & + \psi ^e_w \underbrace{ \tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h}}_{\text{EPV of Wages}} - \psi ^e_{r}\underbrace{\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{R}_{t+1+h}}_{\text{EPV of Returns to Capital}} \\ \notag & -(1-v)\gamma \psi ^e_{ro}\underbrace{ \tilde{E}^y_t\sum ^{\infty }_{h=1}(\frac{1-v^h}{1-v})(\gamma \beta )^h\hat{R}_{t+1+h}}_{\text{EPV of Returns to Capital after Retiring}} \\ \notag & +(1-v)\gamma \psi ^e_{s} \tilde{E}^y_t\sum ^{\infty }_{h=1}(\frac{1-v^h}{1-v})(\gamma \beta )^h\hat{s}_{t+h} \end{align}

where $\psi$ ’s consist of primitive model parameters.Footnote 16 Note that Equation (29) shows how current and expected variables such as the real interest rates and real wages affect the consumption decisions of young households. In particular, the last two terms in Equation (29) appear since young households or workers also consider they can lose their jobs and become retirees at any time in the future with the probability of $1-v$ .

3.4 Government

The government levies lump-sum taxes $T_t$ on young households or workers and consumes $G_t$ . Also, it pays retirees social security benefits $S_t$ each period. Thus, the government satisfies the following budget constraint

(30) \begin{align} \phi T_t= G_t + (1-\phi )S_t \end{align}.

The government maintains its policy variables, $T_t$ , $G_t$ , and $S_t$ , at steady-state levels, which is announced in advance. In particular, the policy announcement is credible, so agents believe there will be no policy changes in the future. These assumptions are based on Mitra et al. (Reference Mitra, Evans and Honkapohja2013).

3.5 Market clearing

Goods and asset market-clearing conditions are as follows:

(31) \begin{align} Y_t = C_t + I_t + G_t \end{align}
(32) \begin{align} C_t = C_t^y + C_t^o \end{align}
(33) \begin{align} K_{t+1} = (1-\delta )K_{t} + I_t \end{align}
(34) \begin{align} A_t = K_t \end{align}
(35) \begin{align} A_t = A_t^y + A_t^o \end{align}.

3.6 Model parameter calibration

Table 2 presents calibrated model parameter values. The RBC model parameters such as $\alpha$ , $\beta$ , and $\delta$ are set as commonly used values in literature like King and Rebelo (Reference King and Rebelo2000), etc. The capital share $\alpha$ is 1/3, and the time discount factor $\beta = 0.995$ corresponds to the annual discount rate of 2 percent. The depreciation rate $\delta = 0.025$ matches to the annual depreciation rate of 10 percent. The proportion of young households $\phi = 0.8$ is set based on the relevant data. Specifically, the average ratio of the population aged 15–64 to the population aged 15 and older from 2014 to 2023 is 80.3 percent. The probability of surviving in the next period $\gamma$ is 0.9959 following Gali (Reference Gali2021)’s calibration in which he uses lifetime expectancy.Footnote 17 Then, this paper assigns 0.9989 to the probability of staying workers in the following period $v$ to set the young households’ share to 80 percent ( $\phi =0.8$ ). The weight on utility from leisure $\theta$ is 3.39 which satisfies the labor supply condition of young households in the steady state. Most importantly, the gain parameters for old and young households, $g^o=0.010$ and $g^y=0.018$ , are derived from Malmendier and Nagel (Reference Malmendier and Nagel2016) in which they estimate the gain parameter by age using inflation expectation data. For robustness checks, I also employ various gain parameter values, which is discussed in Section 4.1. The standard deviation of the technology shock $\sigma _{u_{\chi }}$ is calibrated to 0.0078 to match the volatility of output in the model to that in data as seen in Table 4. The growth of productivity in the steady state $\bar{\chi }$ is 1.0053 based on Eusepi and Preston (Reference Eusepi and Preston2011), and the labor supply in the steady state $\bar{H}$ is 1/4.

Table 2. Model parameter calibration

4. Effects of heterogeneous biased expectations on business cycles

This section compares the LC model under learning with the LC model under RE to investigate how heterogeneous biased expectations affect the dynamics of macroeconomic variables compared to RE. Then, I also compare the LC learning model with the representative agent (RA) learning model to show the life-cycle assumption better explains the data in the real world. Lastly, this paper studies the impacts of heterogeneous biased expectations on the volatility of business cycles and presents the implications for an aging society.

4.1 LC model under learning vs LC model under RE

The simulation process for obtaining the impulse responses of macro-variables is as follows. First of all, the 2000-period simulation under RE provides initial steady-state coefficients for the household forecasting model, Equation (10) – (13), and the simulation data are discarded. Then, the coefficients are updated every period in the learning model but keep their initial values in the RE model. Next, an N-period impulse response is obtained with a 1 percent permanent technology shock in the period 2001 and no more shocks after that. Then, this simulation is repeated 2,000 times, and I report the median response of model variables to the technology shock for IRF. The simulation methods are based on the work of Eusepi and Preston (Reference Eusepi and Preston2011) and Mitra et al. (Reference Mitra, Evans and Honkapohja2013).

4.1.1 Effects on aggregate variables

Figure 4 illustrates the responses of output, consumption, investment, and hours to a 1 percent permanent technology shock. Then, the upper left panel reveals that the household sensitivity to recent observations in the learning model generates amplification effects on output but not in the RE model due to no biased expectations under RE. We can interpret this amplification as households become overly optimistic about the future economy after the positive economic shock. Particularly, the output response exhibits a hump-shaped profile as found in Eusepi and Preston (Reference Eusepi and Preston2011) and Cogley and Nason (Reference Cogley and Nason1995).Footnote 18 The upper right panel shows the consumption response in the learning model is less than that in the RE model immediately after the shock, as households have overly optimistic expectations about the future returns to capital, and thus, increase investment sharply, as depicted in the lower left panel. However, in the long run, households in the learning model consume more than those in the RE model by utilizing their over-savings. The lower right panel suggests an overshooting in labor supply under learning.

Figure 4. Impulse responses of macrovariables to 1 percent permanent technology shock. Notes: The solid and dotted lines denote the median, 25th, and 75th percentile impulse response under learning, respectively. The dashed line indicates the impulse responses under RE. Unshocked BGP signifies the initial balanced growth path prior to the permanent technology shock.

Mechanism

To explicate the mechanisms of the heterogeneous biased expectations in the learning model, I provide a graphical representation in Figure 5. This figure illustrates the young and old households’ expected present values (EPV) of returns to capital and labor after a positive technology shock under learning and RE. For example, the EPV of returns to capital and labor for young households under learning and RE are

\begin{equation*} \text {Under Learning:} \quad \tilde {E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h \hat {R}_{t+1+h} \quad \text {and} \quad \tilde {E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h \hat {w}_{t+1+h} \end{equation*}
\begin{equation*} \text {Under RE:} \quad E_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat {R}_{t+1+h} \quad \text {and} \quad E_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat {w}_{t+1+h} \end{equation*}

where each term appears in the young households’ aggregate consumption decision rule. Here, the only difference between learning and RE models is the way of forming expectations, i.e., $\tilde{E}$ and $E$ .

Then, the EPV of market prices in the learning model is significantly different from those in the RE model: i.e., the expected returns to capital are higher, while wages are lower under learning than under RE. This is because households in the learning model have biased expectations due to their overweight on recent observations, which causes forecast errors. In particular, young households have more optimistic expectations about the future returns to capital than old households, primarily because of their relatively higher sensitivity to recent shocks. However, in the RE framework, both young and old households have almost the same expectations since no different weighting schemes and heterogeneous biased expectations exist between the young and old.

Figure 5. Expected present value of returns to capital and labor.

Consumption, investment, and hours

The mechanism above gives further explanations about the dynamics of the macro-variables illustrated in Figure 4. The households’ overly positive expectations regarding future returns to capital result in lower consumption and higher investment right after the shock, followed by an increase in consumption in the future through savings. Thus, households smooth their consumption through investment. Also, young households seek to work more today to offset their overly pessimistic wage expectations and to boost their investment and future consumption.

Oscillations in expectations

As seen in Figure 5, the household learning behavior that places more weight on recent observations results in oscillations in expectations, which can lead to fluctuations in economic activities. The paper argues that these oscillations in expectations can create further volatility in business cycles, a topic that is discussed in Section 4.3 with more details.

Persistence

An important distinction between the learning and RE models is whether persistent effects occur. As shown in Figure 4, the responses in the learning model are more persistent than those in the RE model. This persistence arises from the household learning behavior illustrated in Figure 5. Since households assign more significance to recent data, it takes a considerable amount of time for expectations under learning to approach those under RE. In other words, the persistence in the learning model is caused by adjustments in beliefs. However, in the absence of additional technology shocks, household expectations will converge to rational expectations as forecasting errors gradually diminish.

4.1.2 Effects on variables by age group

Based on the impulse responses depicted in Figure 6, it can be inferred that the biased expectations of young households play a crucial role in the amplified response of macroeconomic variables. The upper panels display the response of consumption by age group in the learning and RE model. As shown in the left panel, young households in the learning model exhibit a greater reaction to the technology shock than old households, owing to their relatively higher sensitivity to the shock. Conversely, the right panel indicates no discernible difference between young and old households’ consumption in the RE model since rational expectations do not generate forecast errors or biased expectations. The lower panels also show that heterogeneous biased expectations cause young and old households to respond differently in terms of saving, particularly with young households engaging in more over-saving.

Figure 6. Impulse responses of consumption and saving to 1 percent permanent technology shock by age group.

Figure 7 denotes the response of output to a 1 percent permanent technology shock in the learning model by the proportion of young households in the economy $\phi$ : 0.7 (=70 percent), 0.8 (=80 percent), and 0.9 (=90 percent). Consistent with previous results, a decrease in the proportion of young households, which corresponds to an increase in the share of the old population, results in a smaller response of output. Therefore, it can be concluded that the amplification effects of biased expectations become weaker in an aging society since population aging leads to a higher proportion of old households who exhibit lower sensitivity to recent shocks.

Figure 7. Impulse response of output by proportion of young households ( $\phi$ ).

Gain parameters

To ensure the robustness of the results, this paper explores how the response of output to the positive technology shock varies as the gain parameters change, which is shown in Figure 8. When the gain parameters of young and old households, i.e., $g^y$ and $g^o$ , are zero, then the response under learning is identical to the one under RE. The zero gain parameters imply no biased expectations in the economy. Thus, the household expectations under learning match rational expectations. The left panel also illustrates that the response of output increases as the gain parameters rise. Larger gain parameters signify households react more sensitively to recent shocks, leading to more significant amplification effects. The right panel suggests an increase in the gain parameter of young households and a decrease in that of old households result in a larger response than the opposite case. This is because young households account for a larger share of the population and have biased expectations not only about future returns to capital but also future wages.

Figure 8. Impulse response of output under learning by gain parameter.

4.2 LC model under learning vs RA model under learning

I compare the LC learning model featuring heterogeneous beliefs and adaptive expectations with the RA learning model incorporating adaptive learning and homogeneous beliefs to offer a more comprehensive perspective. In particular, by comparing the RA and LC learning models, I show the life-cycle assumption better explains the data in the real world.

4.2.1 Representative agent learning model

I modify some parts of the LC learning model to obtain the RA learning model.

Firms

The firm sector in the RA learning model is assumed to be the same as in the LC learning model.

Households

An infinitely-lived representative agent maximizes the following Bellman equation

(36) \begin{align} V(A^{RA}_{t}, K^{RA}_{t}) = \text{Max} \{\ln C^{RA}_{t} + \theta \ln (1-H^{RA}_{t}) + \beta \tilde{E}^{RA}_t[V(A^{RA}_{t+1}, K^{RA}_{t+1})]\} \end{align}

and their budget constraint is

(37) \begin{align} C^{RA}_{t} + A^{RA}_{t+1} = R^{RA}_tA^{RA}_{t} + W^{RA}_tH^{RA}_{t} - T^{RA}_t \end{align}

where $\tilde{E}^{RA}_t$ is the subjective expectation or belief by the representative agent, $C^{RA}_{t}$ is consumption, $A^{RA}_{t}$ is the asset, $W^{RA}_t$ is the real wage, $H^{RA}_{t}$ is the labor supply, and $T^{RA}_t$ is the lump-sum tax.

Then, this paper obtains log-linearized aggregate consumption of the representative agent using the intertemporal budget constraint and Euler equation

(38) \begin{align} \bar{c}\hat{c}^{RA}_t = \; & \frac{(1-\beta )\bar{a}}{(1+\theta )\beta }\hat{a}^{RA}_t + \Big [\bar{c}-\frac{\bar{w}}{(1+\theta )}\Big ](\hat{R}^{RA}_t-\hat{\chi }^{RA}_t) \\\notag & + \frac{(1-\beta )\bar{w}}{(1+\theta )}\hat{w}^{RA}_t - \frac{(1-\beta )\bar{\tau }}{(1+\theta )}\hat{\tau }^{RA}_t \\\notag & - \frac{\beta (\bar{w}^-\bar{\tau })}{(1+\theta )}\tilde{E}^{RA}_t\sum ^{\infty }_{h=0}\beta ^h[\hat{R}^{RA}_{t+1+h}-\hat{\chi }^{RA}_{t+1+h}] \\\notag & + \frac{(1-\beta )\beta \bar{w}}{(1+\theta )}\tilde{E}^{RA}_t\sum ^{\infty }_{h=0}\beta ^h\hat{w}^{RA}_{t+1+h} \\\notag & - \frac{(1-\beta )\beta \bar{\tau }}{(1+\theta )}\tilde{E}^{RA}_t\sum ^{\infty }_{h=0}\beta ^h\hat{\tau }^{RA}_{t+1+h} \end{align}.

For the comparison, log-linearized aggregate consumption in the LC learning model, which is the summation of Equation (22) and (29), is as follows

(39) \begin{align} \bar{c}\hat{c}_t = \; & \Big [\frac{(1-\gamma \beta )(1-v)}{\beta }+\psi ^{^{\prime}}v\Big ]\bar{a}^y\hat{a}^y_t + \frac{(1-\gamma \beta )}{\beta }\bar{a}^o\hat{a}^o_t \\\notag & + \Big [\bar{c}-\frac{\psi ^{^{\prime}}\phi \beta \bar{w}}{(1-v\gamma \beta )}\Big ](\hat{R}_t-\hat{\chi }_t) + \psi ^{^{\prime}}\phi \beta \bar{w}\hat{w}_t - \psi ^{^{\prime}}\phi \beta \bar{\tau }\hat{\tau }_t \\\notag & - \Big [\frac{\psi ^{^{\prime}}\phi v \gamma \beta ^2(\bar{w}-\bar{\tau })}{(1-v\gamma \beta )}\Big ]\tilde{E}^y_t \sum ^{\infty }_{h=0}(v\gamma \beta )^h[\hat{R}_{t+1+h}-\hat{\chi }_{t+1+h}] \\\notag & + \psi ^{^{\prime}}\phi v \gamma \beta ^2\bar{w}\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h} \\\notag & - \psi ^{^{\prime}}\phi v \gamma \beta ^2\bar{\tau }\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{\tau }_{t+1+h} \end{align}

where

\begin{equation*} \psi ^{^{\prime}}=\frac {(1-v\gamma \beta )(1-\gamma \beta )}{(1+\theta )\beta (1-\gamma \beta )+(1-v)\gamma \beta ^2} \end{equation*}.
Belief updating

The representative agent is assumed to use an economic model to forecast future wages and returns to capital. The model relates the wage and returns to capital to the aggregate capital which is the unique state variable in the RA model

(40) \begin{align} \hat{R}^{k,RA}_t = \mu ^{RA}_r + \mu ^{RA}_{rk}\hat{k}^{RA}_t + e^{r,RA}_t \end{align}
(41) \begin{align} \hat{w}^{RA}_t = \mu ^{RA}_w + \mu ^{RA}_{wk}\hat{k}^{RA}_t + e^{w,RA}_t \end{align}
(42) \begin{align} \hat{k}^{RA}_{t+1} = \mu ^{RA}_k + \mu ^{RA}_{kk}\hat{k}^{RA}_t + e^{k,RA}_t \end{align}

where $e_t$ denotes a regression error.

Gain parameter

The gain parameter for the representative agent is calibrated as $g^{RA}=0.0164$ [= $g^o\times (1-\phi )+g^y\times \phi =0.010\times 0.2+0.018\times 0.8$ ] based on the Malmendier and Nagel (Reference Malmendier and Nagel2016)’s estimation.

4.2.2 Main differences between LC and RA learning model

Differences in model

The LC and RA learning models have similar aggregate consumption equations, Equation (38) and (39), but with some notable differences. First, the LC learning model has one more state variable: other than aggregate capital ( $\hat{k}_t$ ), the distribution of wealth between young and old households ( $\hat{a}^y_t$ and $\hat{a}^o_t$ ) is also necessary to determine aggregate consumption. Thus, there is one more explanatory variable in the household forecasting system in the LC model. Second, Table 3 compares the coefficients and time discount factors on the expected terms from both LC and RA learning models and suggests that the differences in consumption behavior between the two models mostly stem from whether households consider the probability of surviving ( $\gamma$ ) and staying workers ( $v$ ). In the LC learning model, households know they may not survive into the next period or retire in the following period, which leads to greater discounting of the future and so smaller coefficients and time discount factors on the expected terms. On the other hand, the representative household in the RA learning model places relatively greater weight on the expectations about market prices as they do not consider the possibility of death or retirement in the future.

Table 3. Coefficients and time discount factors on expected terms in aggregate consumption

Results

Figure 9 displays the impulse responses in the LC and RA learning models. Despite their overall similarities, the responses in the RA learning model are found to be 1.5 to 2 times larger than those in the LC learning model. This is because the representative household places much more weight on their overly optimistic expectations after a positive shock, in comparison to households in the LC model. However, this poses a challenge as the excessive over-reactions of the representative household do not align with the actual data observed in the real world. Section 4.3 presents quantitative results about this issue.

Figure 9. Impulse responses of output, consumption, investment, and hours to 1 percent permanent technology shock in LC and RA learning model.

4.3 Effects on volatility of business cycles

This paper utilizes the LC learning model to examine the impact of population aging on the recent reduction in business cycle fluctuations.

Simulation method

The 2000-period simulation under RE presents initial steady-state coefficients for the household forecasting system, Equation (10)–(13), and then the simulated data are discarded. After that, the 300-period simulation is conducted to match the sample size for the U.S. data from 1947Q1 to 2019Q4. Here, I do not include the COVID-19 period data due to unprecedentedly high volatility. Finally, I calculate the volatility of business cycles using this 300-period simulated data.

Results

There are several findings from the relative standard deviations of the macroeconomic variables provided by the data and model simulations in Table 4. First, the amplification effects in the LC learning model produce macroeconomic variables’ variations that are almost identical to the data when the proportion of young households $\phi$ is at its baseline value of 0.8 (=80 percent).Footnote 19 However, the RA learning model generates more substantial fluctuations due to the extensive over-response of the representative household as seen in Section 4.2.Footnote 20 This is because the representative household does not consider the probability of death and losing a job, and so they discount over-expectation less than households in the LC model, which triggers extensive over-responses. Second, the LC learning model shows that the fluctuations in business cycles decrease as the population ages. For instance, when the proportion of young households $\phi$ decreases from 0.9 (=90 percent) to 0.7 (=70 percent), the relative standard deviation of output decreases from 1.77 to 1.49.Footnote 21 Lastly, the RE models, regardless of the RA or LC assumption, can only explain half of the volatility of the macro-variables seen in the data.

Table 4. Relative standard deviations of macro-variables from model simulations

Notes: All macro-variables are logged and detrended with the HP filter. $Pr$ stands for productivity.

Implications

The findings above imply a meaningful role of heterogeneous biased expectations and population aging in the recent moderation of business cycle fluctuations. The stylized facts suggest population aging is associated with a reduction in the fluctuations of the macro-variables. Table 5 indicates that the volatility of macroeconomic variables has decreased sharply in recent years. Specifically, the standard deviation of GDP, Consumption, and Investment has declined 74.2 percent, 59.6 percent, and 57.9 percent respectively from the 1980s to 2010s. Moreover, during the same period, the population has aged fast. The ratio of the population aged 65 and over to the population aged 15 and over in the U.S. has risen from 14.6 percent in 1999 to 20.2 percent in 2019 according to OECD.stat. In line with these facts, the LC learning model simulations also show a 10 percentage point increase in the old population ratio leads to about a 16 percent decrease in output volatility.Footnote 22

Table 5. Standard deviations of annual growth rates of macro-variables from U.S. Data

Notes: This table is the extended version of the table in Stock and Watson (Reference Stock and Watson2003). The parenthesis ( ) indicates the volatility excluding the data for the Great Recession from 2007Q4 to 2009Q2. I exclude the COVID-19 period data due to unprecedentedly high volatility.

It is a novel channel that the increase in the proportion of old individuals who are relatively less sensitive to recent observations contributes to the lower volatility of business cycles. Literature lists the improved monetary policy, regulatory changes, financial market innovation, etc. as the causes of the decline in macroeconomic volatility since the 1990s. Jaimovich and Siu (Reference Jaimovich and Siu2009) also claim that demographic changes account for 1/5 to 1/3 of the moderation of the US economy. However, they mainly point out the channel that the low volatility of old individuals’ employment and hours worked leads to the Great Moderation.

5. Policy implications of heterogeneous biased expectations

This section investigates how heterogeneous biased expectations affect the output effects and multipliers of government spending in an aging population. Then, I also examine the welfare effects of government spending under population aging.

5.1 Population aging and effects of government spending

Figure 10 illustrates the impact of a government spending shock on output, where the government spending is 5 percent of the steady-state output level and is financed by an equal amount of lump-sum tax increase.Footnote 23 Here, the government spending shock is a one-time surprise shock. Specifically, normalized government spending $g_t$ evolves via

\begin{equation*} g_t = \begin {cases} \bar {g} + 0.05\bar {y}, & \text {for} \; t=1 \\ \bar {g}, & \text {for} \; t=2,3,4,\cdot \cdot \cdot \\ \end {cases} \end{equation*}

where $\bar{y}$ and $\bar{g}$ are the steady-state values for output and government spending.

Then, the key takeaway is that the responsiveness of output to the government spending shock is inversely related to the proportion of old households in the economy. Since old households have relatively lower sensitivity to the recent government spending shock, the amplification effects become weak in an aging society, which is consistent with the findings from the technology shock analysis shown in Figure 7.

I also note that after government spending, the learning model generates an output response that is the opposite of the response from the RE model. The government spending shock decreases consumption and investment and increases hours worked, causing a rise in returns to capital and a decline in wages. In the learning model, households overreact to increased capital returns and raise investment dramatically. However, in the RE model, households know returns to capital will decrease immediately, so they reduce investment. Therefore, the output responses show opposite reactions.

Figure 10. Impulse response of output to government spending shock by proportion of young households ( $\phi )$ .

Figure 11 provides the cumulative government spending multipliers in the first, second, and third year after the shock using the impulse responses in Figure 10. The cumulative government spending multiplier ( $M$ ) is defined as

(43) \begin{align} M_T = \frac{\sum \limits ^T_{i=0} (\beta \gamma )^i(y_{t+i} - \bar{y})}{g_t - \bar{g}} \end{align}

where $T$ is the cumulation period. Overall, the government spending multiplier declines about 10 percent when the old population ratio rises 10 percentage points. This result is consistent with previous studies, which empirically and theoretically show that the government spending multiplier is low in an aging society [e.g., Basso and Rachedi (Reference Basso and Rachedi2021) and Honda and Miyamoto (Reference Honda and Miyamoto2020)]. However, my model offers a novel factor for the lower government spending multiplier: i.e., heterogeneous biased expectations between the young and old.

Figure 11. Cumulative government spending multiplier by proportion of young households ( $\phi )$ .

5.2 Welfare analyses of government spending

This paper investigates the welfare effects of government spending in the LC learning and LC RE model by conducting welfare experiments and finds the implications for an aging society.

5.2.1 Welfare experiments

This paper conducts welfare experiments under the following scenarios to study the welfare effects of government spending. Specifically, during 500 periods, a random technology shock with mean zero and standard deviation $\sigma ^u$ takes place every period in the LC learning and LC RE model.Footnote 24 Then, I assume two economies in each LC learning and LC RE model. In the first economy, the government counteracts negative technology shocks by increasing its spending.Footnote 25 However, in the second economy, the government does not respond to the technology shocks. By comparing the welfare of the population in these two economies, we can estimate the welfare effects of the government spending policy.

In this setting, government spending shocks can be beneficial to households by reducing the adverse effects of negative technology shocks. In particular, it can be mitigated more in the learning model due to the amplification effects of biased expectations.

Equivalent variations

Following literature such as Hunt (Reference Hunt2021), this paper estimates the welfare effects of government spending using equivalent variations (EV) which are calculated by:

(44) \begin{align} \text{RE:}\quad \sum ^{T}_{t=1}(\beta \gamma )^{t-1}U(C^{NonGov}_{RE,t}(1+\Delta ^{Gov}_{RE}))=\sum ^{T}_{t=1} (\beta \gamma )^{t-1}U(C^{Gov}_{RE,t}) \end{align}
(45) \begin{align} \text{Learning:}\quad \sum ^{T}_{t=1}(\beta \gamma )^{t-1}U(C^{NonGov}_{Learning, t}(1+\Delta ^{Gov}_{Learning}))=\sum ^{T}_{t=1}(\beta \gamma )^{t-1}U(C^{Gov}_{Learning, t}) \end{align}

where $C_t^{NonGov}$ and $C_t^{Gov}$ indicate consumption in the economy without and with the government spending policy, respectively. Also, the welfare experiment length $T$ is 500. The interpretation of Equation (44) and (45) is that the EV equates the utility of household in the first economy with the government spending policy to the utility of household in the second economy without government spending plus a fraction $\Delta$ . In other words, for example, the household in the second economy without government spending needs to consume $\Delta ^{Gov}\%$ more every period to have the same welfare as the household has in the first economy with government spending. As a result, $\Delta ^{Gov}_{RE}\%$ and $\Delta ^{Gov}_{Learning}\%$ represent the welfare effects of government spending in the LC RE and LC learning model.

5.2.2 Welfare effects of government spending

Table 6 provides details of the welfare effects of government spending in the LC learning model, denoted by ( $\Delta ^{Gov}_{Learning}\%$ ), across varying proportions of young households. As the proportion of old households increases, both young and old households experience a decline in welfare due to the weakened amplification effects. However, young households experience a more significant decline in their welfare compared to that of old households. With an increase in the proportion of old households, fewer individuals have a high degree of optimism triggered by government spending. Consequently, the economy exhibits relatively poor performance after the government spending shock. Then, young households respond more strongly to this weaker economic performance by reducing consumption and increasing labor supply, while old households are less responsive. Thus, the welfare of young households deteriorates more significantly as the population ages.

Table 6. Equivalent variations under learning by proportion of young households ( $\phi$ )

Table 7 presents the welfare effects of government spending in the LC learning and LC RE models ( $\Delta ^{Gov}_{Learning}\%$ and $\Delta ^{Gov}_{RE}\%$ , respectively). The welfare effects are more significant in the LC learning model compared to the LC RE model since the biased optimism induced by government spending amplifies the output effects, resulting in an overall improvement in households’ welfare. However, this learning behavior only benefits young households, while it reduces the welfare of the old households. Young households or workers in the LC learning model increase labor supply and investment, leading to higher future consumption. This excessive consumption, funded by savings, causes an improvement in young households’ welfare, despite the disutility caused by the increased labor supply. However, the young’s over-consumption crowds out the consumption of old households, causing it to fall below the RE level, resulting in a reduction in the old’s welfare.

Table 7. Equivalent variations under learning and rational expectation (RE)

Notes: In this experiment, $\phi$ is 0.8. The last column in bold shows the welfare effects of the household expectations under learning relative to under RE.

6. Conclusion

Economists have long considered the expectations of economic agents one of the key drivers of economic fluctuations. To that end, numerous studies have been conducted to deal with this topic. I contribute to this body of literature by examining the implications of the heterogeneous biased expectations between young and old individuals on business cycles and economic policies. The life-cycle learning model incorporating heterogeneous biased expectations reveals that the fluctuations of business cycles decline as the population ages due to the relatively lower sensitivity of older households to recent observations. Additionally, in an aging society, the output effects of government spending, also known as government spending multipliers, decrease, and these reduced effects cause a welfare loss for households, particularly the young.

Based on these findings, several policy recommendations can be made. First, economic policies should be devised and implemented from a long-term perspective in an aging society since population aging reduces the volatility of business cycles and hinders the short-term effects of government policies. Moreover, the government needs to consider how its policies affect young and old individuals differently, given their heterogeneous biased expectations. Finally, a faster and more extensive fiscal stimulus policy is necessary during recessions in an aging society to support a swift recovery, which also necessitates the government to raise fiscal space in advance for the aging population.

Appendix A. Log-linearized Aggregate Consumption of Young and Old Households in Life-Cycle Learning Model

This section derives log-linearized aggregate consumption of young and old households in the life-cycle learning model.

A.1 Old Households (Retirees)

This paper normalizes the one-period budget constraint of old households

(46) \begin{align} c^o_{a,t}+\gamma a^o_{a+1,t+1} = R_ta^o_{a,t}\chi ^{-1}_t + s_t \end{align}

and iterate forward Equation (46). Then, the intertemporal budget constraint of old households is

(47) \begin{align} \underbrace{\tilde{E}^o_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}\gamma ^n\frac{\chi _{t+h}}{R_{t+h}} c^o_{a+n,t+n}}_{= (i)} = \underbrace{a^o_{a,t}}_{= (ii)}+\underbrace{\tilde{E}^o_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}\gamma ^n\frac{\chi _{t+h}}{R_{t+h}}s_{t+n}}_{= (iii)} \end{align}.

After that, I log-linearize the intertemporal budget constraint using Euler equation

\begin{align*} (i) = \frac{\beta \bar{c}^o_a}{1-\gamma \beta }\hat{c}^o_{a,t}+\frac{\beta \bar{c}^o_a}{1-\gamma \beta }(\hat{\chi _t}-\hat{R}_t) \end{align*}
\begin{align*} (ii) = \bar{a}^o_a\hat{a}^o_{a,t} \end{align*}
\begin{align*} (iii) = & \frac{\beta \bar{s}}{1-\gamma \beta }(\hat{\chi }_t-\hat{R}_t)+\frac{\gamma \beta ^2\bar{s}}{1-\gamma \beta }\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ & +\beta \bar{s}\hat{s}_t + \gamma \beta ^2\bar{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{s}_{t+1+h} \end{align*}

and plug (i), (ii), and (iii) into Equation (47)

(48) \begin{align} \bar{c}^o_a\hat{c}^o_{a,t} = & \frac{(1-\gamma \beta )}{\beta }\bar{a}^o_a\hat{a}^o_{a,t} + (\bar{c}^o_a-\bar{s})(\hat{R}_t-\hat{\chi }_t)+(1-\gamma \beta )\bar{s}\hat{s}_t\\\notag & +\gamma \beta \bar{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag & + \frac{(1-\gamma \beta )}{\gamma \beta }\bar{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{s}_{t+1+h} \end{align}.

To aggregate the consumption of old households, this paper multiplies both sides with $\sum \limits ^{\infty }_{a=0}N^o_{a,t}$ where $N^o_{a,t}$ is the number of old households who belong to cohort ‘ $a$ ’. Then, I get

(49) \begin{align} \bar{c}^o\hat{c}^o_t = & \frac{(1-\gamma \beta )}{\beta }[(1-v)\bar{a}^y\hat{a}^y_{t-1}+\bar{a}^o\hat{a}^o_t]+[\bar{c}^o-(1-\phi )\bar{s}](\hat{R}_t-\hat{\chi }_t) \\ \notag & + (1-\phi )(1-\gamma \beta )\bar{s}\hat{s}_t\\ \notag & + (1-\phi )\gamma \beta \bar{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag & + (1-\phi )(1-\gamma \beta )\gamma \beta \bar{s}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{s}_{t+1+h} \end{align}.

In particular, obtaining Equation (49) is based on Baksa and Munkacsi (Reference Baksa and Munkacsi2019)’s following assumption

(50) \begin{align} \sum ^{\infty }_{a=o}N^o_{a,t}a^o_{a,t} = N^o_{0,t}a^o_{0,t} + \sum ^{\infty }_{a=1}N^o_{a,t}a^o_{a,t} = (1-v)a^y_{t-1} + a^o_t \end{align}
\begin{align*} \bigg (\because \; N^o_{0,t}a^o_{0,t} \approx (1-v)N^y_{t-1}\frac{a^y_{t-1}}{N^y_{t-1}} = (1-v)a^y_{t-1} \quad \text{and} \quad \sum ^{\infty }_{a=1}N^o_{a,t}a^o_{a,t} = \sum ^{\infty }_{a=1}\gamma N^o_{a-1,t-1}a^o_{a,t} = a^o_t\bigg ) \end{align*}

and therefore

(51) \begin{align} \sum ^{\infty }_{a=o}N^o_{a,t}\bar{a}^o_a\hat{a}^o_{a,t} = (1-v)\bar{a}^y\hat{a}^y_{t-1} + \bar{a}^o\hat{a}^o_t \end{align}.

Next, I rearrange Equation (49), and finally, the consumption decision rule of old households in the LC learning model is

(52) \begin{align} \hat{c}^o_t = & \frac{(1-\gamma \beta )(1-v)}{\beta }\frac{\bar{a}^y}{\bar{c}^o}\hat{a}^y_{t-1}+\frac{(1-\gamma \beta )}{\beta }\frac{\bar{a}^o}{\bar{c}^o}\hat{a}^o_t\\ \notag & + \bigg [1-(1-\phi )\frac{\bar{s}}{\bar{c}^o}\bigg ](\hat{R}_t-\hat{\chi }_t) + (1-\phi )(1-\gamma \beta )\frac{\bar{s}}{\bar{c}^o}\hat{s}_t\\ \notag & + (1-\phi )\gamma \beta \frac{\bar{s}}{\bar{c}^o}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag & + (1-\phi )(1-\gamma \beta )(\gamma \beta )\frac{\bar{s}}{\bar{c}^o}\tilde{E}^o_t\sum ^{\infty }_{h=0}(\gamma \beta )^h\hat{s}_{t+1+h} \end{align}.

A.2 Young Households (Workers)

I rearrange the budget constraint of young households using the labor supply condition

(53) \begin{align} (1+\theta )c^y_{b,t}+v\gamma a^y_{b+1,t+1} + (1-v)\gamma a^{yo}_{b+1,t+1} = R_t\chi ^{-1}_ta^y_{b,t}+w_t-\tau _t \end{align}

and iterate forward Equation (53). Then, the intertemporal budget constraint of young households is

(54) \begin{align} \underbrace{(1+\theta )\tilde{E}^y_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}(v\gamma )^n\frac{\chi _{t+h}}{R_{t+h}} c^y_{b+n,t+n}}_{= (i)} = & \underbrace{a^y_{b,t}}_{= (ii)} + \underbrace{\tilde{E}^y_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}(v\gamma )^n\frac{\chi _{t+h}}{R_{t+h}}w_{t+n}}_{= (iii)} \\ \notag & - \underbrace{\tilde{E}^y_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}(v\gamma )^n\frac{\chi _{t+h}}{R_{t+h}}\tau _{t+n}}_{= (iv)} \\ \notag & - \underbrace{(1-v)\gamma \tilde{E}^y_t\sum ^{\infty }_{n=0}\prod ^n_{h=0}(v\gamma )^n\frac{\chi _{t+h}}{R_{t+h}}a^{yo}_{b+1+n, t+1+n}}_{= (v)} \end{align}.

After that, this paper log-linearizes the intertemporal budget constraint of young households using Euler equations

\begin{align*} (i) = \frac{(1+\theta )\beta \bar{c}^y_b}{(1-v\gamma \beta )}(\hat{\chi }_t-\hat{R}_t+\hat{c}^y_{b,t}) \end{align*}
\begin{align*} (ii) = \bar{a}^y_b\hat{a}^y_{b,t} \end{align*}
\begin{align*} (iii) = & \frac{\beta \bar{w}}{1-v\gamma \beta }(\hat{\chi _t}-\hat{R_t}) + \beta \bar{w}\hat{w}_t + \frac{v\gamma \beta ^2\bar{w}}{1-v\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ & + v\gamma \beta ^2\bar{w}\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h} \end{align*}
\begin{align*} (iv) = & \frac{\beta \bar{\tau }}{1-v\gamma \beta }(\hat{\chi }_t-\hat{R}_t) + \beta \bar{\tau }\hat{\tau }_t + \frac{v\gamma \beta ^2\bar{\tau }}{1-v\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ & + v\gamma \beta ^2\bar{\tau }\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{\tau }_{t+1+h} \end{align*}
\begin{align*} (v) = (1-v)\gamma & \bigg [\frac{\beta ^2\bar{c}^y_b}{(1-\gamma \beta )(1-v\gamma \beta )}(\hat{\chi }_t-\hat{R}_t+\hat{c}^y_{b,t}) - \frac{\beta ^2\bar{s}}{1-\gamma \beta }(\hat{\chi }_t-\hat{R}_t) \\ &-\frac{\beta ^2\bar{s}}{1-\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ &-\frac{\beta ^2\bar{s}}{1-\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}(\frac{1-v^h}{1-v})(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ &-\frac{\beta ^2\bar{s}}{\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}(\frac{1-v^h}{1-v})(\gamma \beta )^h\hat{s}_{t+1+h}\bigg ] \end{align*}.

Next, I plug (i), (ii), (iii), (iv), and (v) into Equation (54) and rearranges it, and then, multiply both sides with $\sum \limits ^{\infty }_{b=0}N^y_{b,t}$ where $N^y_{b,t}$ is the number of young households who belong to cohort ‘ $b$ ’. Then, I get

(55) \begin{align} \bar{c}^y\hat{c}^y_t = & \bigg [\frac{(1-\gamma \beta )(1-v\gamma \beta )}{(1+\theta )\beta (1-\gamma \beta )+(1-v)\gamma \beta ^2}\bigg ] \bigg [v\bar{a}^y\hat{a}^y_t\\\notag & + \bigg (\big (\frac{(1+\theta )\beta }{1-v\gamma \beta }+\frac{(1-v)\gamma \beta ^2}{(1-\gamma \beta )(1-v\gamma \beta )}\big )\bar{c}^y - \phi (\frac{\beta (\bar{w}-\bar{\tau })}{1-v\gamma \beta }+\frac{(1-v)\gamma \beta ^2\bar{s}}{1-\gamma \beta })\bigg )(\hat{R}_t-\hat{\chi }_t) \\ \notag & + \phi \beta \bar{w}\hat{w}_t - \phi \beta \bar{\tau }\hat{\tau }_t + \phi \bar{w}v\gamma \beta ^2\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h} - \phi \bar{\tau }v\gamma \beta ^2\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{\tau }_{t+1+h}\\ \notag & + \phi \bigg (\frac{(\bar{w}-\bar{\tau })v\gamma \beta ^2}{1-v\gamma \beta }+\frac{(1-v)\gamma \beta ^2\bar{s}}{1-\gamma \beta }\bigg )\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag & + \phi (1-v)\frac{\gamma \beta ^2\bar{s}}{1-\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}\bigg (\frac{1-v^h}{1-v}\bigg )(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag &+ \phi (1-v)\frac{\gamma \beta ^2\bar{s}}{\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}\bigg (\frac{1-v^h}{1-v}\bigg )(\gamma \beta )^h\hat{s}_{t+h}\bigg ] \end{align}

and rearrange Equation (55). Finally, the consumption decision rule of young households in the LC learning model is

(56) \begin{align} \hat{c}^y_t = & \; \psi v\bar{a}^y\hat{a}^y_t + \psi \phi \beta \bar{w}\hat{w}_t - \psi \phi \beta \bar{\tau }\hat{\tau }_t\\ \notag & +\bigg [1- \psi \phi \bigg (\frac{\beta (\bar{w}-\bar{\tau })}{1-v\gamma \beta }+\frac{(1-v)\gamma \beta ^2\bar{s}}{1-\gamma \beta }\bigg )\bigg ](\hat{R}_t-\hat{\chi }_t)\\ \notag & + \psi \phi \bar{w}v\gamma \beta ^2\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h} - \psi \phi \bar{\tau }v\gamma \beta ^2\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{\tau }_{t+1+h}\\ \notag & + \psi \phi \bigg [\frac{(\bar{w}-\bar{\tau })v\gamma \beta ^2}{1-v\gamma \beta }+\frac{(1-v)\gamma \beta ^2\bar{s}}{1-\gamma \beta }\bigg ]\tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag & + (1-v)\gamma \psi \phi \frac{\beta ^2\bar{s}}{1-\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}\bigg (\frac{1-v^h}{1-v}\bigg )(\gamma \beta )^h(\hat{\chi }_{t+1+h}-\hat{R}_{t+1+h})\\ \notag &+ (1-v)\gamma \psi \phi \frac{\beta ^2\bar{s}}{\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}\bigg (\frac{1-v^h}{1-v}\bigg )(\gamma \beta )^h\hat{s}_{t+h} \end{align}

where

\begin{align*} \psi = \frac{(1-\gamma \beta )(1-v\gamma \beta )}{\bar{c}^y[(1+\theta )\beta (1-\gamma \beta )+(1-v)\gamma \beta ^2]} \end{align*}.

B. Solutions to Life-Cycle Learning Model

This section briefly shows the solutions to the LC learning model.Footnote 26 As in Equation (52) and (56), we need to calculate the expected present value of returns to capital and labor to have the solutions for the consumption of young and old households. Variables other than the market prices in the consumption decision rules are predetermined or perfectly foresighted.

From the household forecasting model, Equation (10) to (13), I calculate the expected evolution of the aggregate capital and wealth distribution between young and old households as follows

\begin{equation*} x_t^{\prime} = \begin {pmatrix} 1 \\ \hat {k}_t \\ \lambda _{t} \end {pmatrix},\; \tilde {B} = \begin {pmatrix} 1 & 0 & 0 \\ \mu _{k,t-1} & \mu _{kk,t-1} & \mu _{k\lambda, t-1} \\ \mu _{\lambda, t-1} & \mu _{\lambda k,t-1} & \mu _{\lambda \lambda, t-1} \end {pmatrix},\; \end{equation*}
(57) \begin{align} x_{t+1} = \tilde{B}x_t \end{align}

and iterating Equation (57) gives

\begin{equation*} x_{t+h} = \tilde {B}^hx_t \end{equation*}.

Then, the expected present value of the market prices by young households are

(58) \begin{align} \tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{w}_{t+1+h} & = \frac{1}{v\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}(v\gamma \beta )^h\hat{w}_{t+h}\\ \notag & = \frac{1}{v\gamma \beta }\tilde{E}^y_t\sum ^{\infty }_{h=1}(v\gamma \beta )^h(\mu _{w,t-1} \; \mu _{wk,t-1} \; \mu _{w\lambda, t-1})\tilde{B}^hx_t\\ \notag & = \frac{1}{v\gamma \beta }(\mu _{w,t-1} \; \mu _{wk,t-1} \; \mu _{w\lambda, t-1})(v\gamma \beta )\tilde{B}(I-v\gamma \beta \tilde{B})^{-1}x_t \\ \notag & = (\mu _{w,t-1} \; \mu _{wk,t-1} \; \mu _{w\lambda, t-1})\tilde{B}(I-v\gamma \beta \tilde{B})^{-1}x_t \notag \end{align}

and

(59) \begin{align} \tilde{E}^y_t\sum ^{\infty }_{h=0}(v\gamma \beta )^h\hat{R}_{t+1+h}& = (\mu _{r,t-1} \; \mu _{rk,t-1} \; \mu _{r\lambda, t-1})\tilde{B}(I-v\gamma \beta \tilde{B})^{-1}x_t \end{align}.

Finally, I can have the solution to young households’ consumption, and the solution to old households’ consumption also can be obtained in the same way above.

Footnotes

1 The ICE focuses on three areas: how consumers view prospects for their own financial situation, how they view prospects for the general economy over the near term, and their view of prospects for the economy over the long term.

2 Cogley and Sargent (Reference Cogley and Sargent2008) state “Agents are eager to learn at the beginning of each period, but their decisions reflect a pretense that this is the last time they will update beliefs, a pretense that is falsified at the beginning of every subsequent period.”

3 In this paper, the old population share or ratio indicates $\frac{\text{Number of people aged 65 and over}}{\text{Number of people aged 15 and over}} \times 100 (\%).$

4 Keane and Runkle (Reference Keane and Runkle1990), using a SPF, find that professional forecasters have rational expectations.

5 The anticipated returns to labor and capital are crucial elements influencing household consumption decisions, as noted by Milton Friedman’s Permanent Income Hypothesis, which posits that consumption expenditures are contingent on anticipated future income, rather than current income.

6 See Section 4.2 for the minimum state variables in the LC learning model.

7 I find that the results of the learning model are almost the same as those of the RE model if the technology shock is included in the household forecasting system.

8 Equation (15) and (16) are derived from the following constant gain least squares estimator. Derivations are in Carceles-Poveda and Giannitsarou (Reference Carceles-Poveda and Giannitsarou2007).

$$ \zeta _{k,t} = \big [ \sum ^t_{i=1}(1-g)^{i-1}x_{t-i}x_{t-i}^{\prime }\big ]^{-1}\big [ \sum ^t_{i=1}(1-g)^{i-1}x_{t-i}k_{t-i+1}\big ] $$

Also, the model nests rational expectations since the model converges to the model under RE as $g$ goes to zero.

9 According to Honkapohja and Mitra (Reference Honkapohja and Mitra2006), the agents’ different degree of responsiveness in the updating function, i.e., different gain parameters, can be a source of heterogeneity in the learning model.

10 Finance literature such as Nakov and Nuño (Reference Nakov and Nuño2015) and Malmendier et al. (Reference Malmendier, Pouzo and Vanasco2020) adopts the different gain parameter by age. However, individuals in each cohort have the same gain parameter in this paper to make the model more tractable in the period-by-period aging framework.

11 2.5 percent for the young and 0.5 percent for the old.

12 In this setting, financial intermediaries solve a linear profit maximization problem and end up with zero profit, which means their decisions are not affected by the degree of uncertainty. So, they are risk-neutral.

13 I borrow notations from Baksa and Munkacsi (Reference Baksa and Munkacsi2019).

14 Log utility for consumption is necessary for steady-state labor supply along a balanced growth path [see King et al. (Reference King, Plosser and Rebelo1988)].

15 The derivation of Equation (22) is explained in Appendix A.1.

16 The derivation of Equation (29) is explained in Appendix A.2.

17 In Gali (Reference Gali2021), “To calibrate $\gamma$ , I use the expected lifetime at age 16, which is 63.2 years in the United States, and thus set $\gamma = 1 - (1/(4\times 63.2)) \simeq 0.996$ .”

18 At time $t$ when the productivity shock occurs, there are no biased expectations in the learning model since households have the same expectations as in the RE model. However, when households observe at time $t+1$ that the productivity shock in period $t$ has occurred, they will have biased expectations due to forecast errors. These biased expectations then greatly impact the macroeconomic variables and the impulse responses become hump-shaped.

19 Eusepi and Preston (Reference Eusepi and Preston2011) also find that a learning model fits the data better.

20 Eusepi and Preston (Reference Eusepi and Preston2011) use the small gain parameter 0.002 (0.2 percent) that is below the normal range for it, 0.007 (0.7 percent) – 0.05 (5.0 percent), which is given by the literature. This small gain parameter helps reduce the extensive over-response of the representative agent in their learning model and match the data. See Section 4.2.

21 Instead of the “Perpetual Youth” assumption in this paper, we can also assume that young individuals have a lower probability of dying than old individuals. Then, young individuals give more weight to their biased expectations due to a higher time discount factor, which increases business cycle volatility more significantly. Thus, an aging population with fewer young individuals would reduce business cycle volatility more than when households have the same probability of death.

22 I calculate the output volatility from the model simulations using the growth rate of output for the comparison with Table 5.

23 Contrary to the assumptions of this paper, government spending could be financed by a reduction in social security payments. This would have different implications for the consumption and asset choices of young and old individuals. However, the impact of government spending on aggregate variables is not expected to change significantly.

24 Chari, Kehoe, and McGrattan (Reference Chari, Kehoe and McGrattan2007) show that input financing frictions causing inefficiency in the usage of input factors can be observationally equivalent to negative productivity shocks. So, Mitra et al. (Reference Mitra, Evans and Honkapohja2019) consider negative innovations to productivity as a convenient shortcut for modeling distortions related to the financial crisis.

25 For instance, the government increases its spending by 5 percent of the steady-state output level in response to a negative 1 percent technology shock.

26 More details are in Mitra et al. (Reference Mitra, Evans and Honkapohja2019).

References

Adam, K., Matveev, D. and Nagel, S. (2021) Do survey expectations of stock returns reflect risk adjustments? Journal of Monetary Economics. 117, 723740.CrossRefGoogle Scholar
Angeletos, G.-M. and La’O, J. (2013) Sentiments. Econometrica 81(2), 739779.Google Scholar
Baksa, D. and Munkacsi, Z.. (2019) More Gray, More Volatile? Aging and (Optimal) Monetary Policy, IMF Working Papers.Google Scholar
Basso, H. S. and Rachedi, O. (2021) The young, the old, and the government: demographics and fiscal multipliers. American Economic Journal: Macroeconomics 13(4), 110141.Google Scholar
Benhabib, J., Wang, P. and Wen, Y. (2015) Sentiments and aggregate demand fluctuations. Econometrica 83(2), 549585.Google Scholar
Benhabib, J., Liu, X. and Wang, P. (2016) Sentiments, financial markets, and macroeconomic fluctuations. Journal of Financial Economics 120(2), 420443.CrossRefGoogle Scholar
Bhandari, A., Borovicka, J. and Ho, P. (2024) Survey data and subjective beliefs in business cycle models. The Review of Economic Studies, rdae054.CrossRefGoogle Scholar
Blanchard, O. J. (1985) Debt, deficits, and finite horizons. Journal of Political Economy 93(2), 223247.CrossRefGoogle Scholar
Blanchard, O. J. (2010) Sustaining a global recovery. Journal of Policy Modeling 32(5), 604609.CrossRefGoogle Scholar
Branch, W. A. and McGough, B. (2009) A new Keynesian model with heterogeneous expectations. Journal of Economic Dynamics and Control 33(5), 10361051.CrossRefGoogle Scholar
Carceles-Poveda, E. and Giannitsarou, C. (2007) Adaptive learning in practice. Journal of Economic Dynamics and Control 31(8), 26592697.CrossRefGoogle Scholar
Chari, V. V., Kehoe, P. J. and McGrattan, E. R. (2007) Business cycle accounting. Econometrica 75(3), 781836.CrossRefGoogle Scholar
Cogley, T. and Nason, J. M. (1995) Output dynamics in real-business-cycle models. American Economic Review 85(3), 492511.Google Scholar
Cogley, T. and Sargent, T. J. (2008) Anticipated utility and rational expectations as approximations of Bayesian decision making. International Economic Review 49(1), 185221.CrossRefGoogle Scholar
Collin-Dufresne, P., Johannes, M. and Lochstoer, L. A. (2017) Asset pricing when this time is different. The Review of Financial Studies 30(2), 505535.CrossRefGoogle Scholar
Eusepi, S. and Preston, B. (2011) Expectations, learning, and business cycle fluctuations. American Economic Review 101(6), 28442872.CrossRefGoogle Scholar
Evans, G. W. (2019) Adaptive Learning in Macroeconomics, Second Behavioral Macroeconomics Workshop: Heterogeneity and Expectations in Macroeconomics and Finance.CrossRefGoogle Scholar
Evans, G. W. and Honkapohja, S. (2001) Learning and Expectations in Macroeconomics. Princeton, New Jersey: Princeton University Press.CrossRefGoogle Scholar
Evans, G. W., Honkapohja, S. and Mitra, K. (2009) Anticipated fiscal policy and adaptive learning. Journal of Monetary Economics 56(7), 930953.CrossRefGoogle Scholar
Gali, J. (2021) Monetary policy and bubbles in a new Keynesian model with over-lapping generations. American Economic Journal: Macroeconomics 13(2), 121167.Google Scholar
Gertler, M. (1999). Government debt and social security in a life-cycle economy. Carnegie-Rochester Conference Series on Public Policy 50, 61110.CrossRefGoogle Scholar
Honda, J. and Miyamoto, H. (2020) Would Population Aging Change the Output Effects of Fiscal Policy?. IMF Working Papers 2020/092.CrossRefGoogle Scholar
Honkapohja, S. and Mitra, K. (2005) Performance of monetary policy with internal central bank forecasting. Journal of Economic Dynamics and Control 29(4), 627658.Google Scholar
Honkapohja, S. and Mitra, K. (2006) Learning stability in economies with heterogeneous agents. Review of Economic Dynamics 9(2), 284309.CrossRefGoogle Scholar
Hunt, E. C. (2021) Adaptive learning, social security reform, and policy uncertainty. Journal of Money, Credit and Banking 53(4), 677714.Google Scholar
Jaimovich, N. and Siu, H. E. (2009) The young, the old, and the restless: demographics and business cycle volatility. American Economic Review 99(3), 804826.CrossRefGoogle Scholar
Keane, M. P. and Runkle, D. E. (1990) Testing the rationality of price forecasts: new evidence from panel data. American Economic Review 80(4), 714735.Google Scholar
King, R. G., Plosser, C. I. and Rebelo, S. T. (1988) Production, growth and business cycles. Journal of Monetary Economics 21, 195232.CrossRefGoogle Scholar
King, R. G. and Rebelo, S. T. (2000) Resuscitating Real Business Cycles. NBER Working Papers 7534.CrossRefGoogle Scholar
Lorenzoni, G. (2009) A theory of demand shocks. American Economic Review 99(5), 20502084.CrossRefGoogle Scholar
Malmendier, U., Pouzo, D. and Vanasco, V. (2020) Investor experiences and financial market dynamics. Journal of Financial Economics 136(3), 597622.CrossRefGoogle Scholar
Malmendier, U. and Shen, L. S. (2024) Scarred consumption. American Economic Journal: Macroeconomics 16(1), 322355.Google Scholar
Malmendier, U. and Nagel, S. (2011) Depression babies: do macroeconomic experiences affect risk taking? The Quarterly Journal of Economics 126(1), 373416.CrossRefGoogle Scholar
Malmendier, U. and Nagel, S. (2016) Learning from inflation experiences. The Quarterly Journal of Economics 131(1), 5387.CrossRefGoogle Scholar
Mankiw, N. G., Reis, R. and Wolfers, J. (2003) Disagreement about inflation expectations. NBER Macroeconomics Annual 18, 209248.CrossRefGoogle Scholar
Mitra, K., Evans, G. W. and Honkapohja, S. (2013) Policy change and learning in the RBC model. Journal of Economic Dynamics and Control 37(10), 19471971.CrossRefGoogle Scholar
Mitra, K., Evans, G. W. and Honkapohja, S. (2019) Fiscal policy multipliers in an RBC model with learning. Macroeconomic Dynamics 23(1), 240283.CrossRefGoogle Scholar
Nakov, A. and Nuño, G. (2015) Learning from experience in the stock market. Journal of Economic Dynamics and Control. 52, 224239.CrossRefGoogle Scholar
Stock, J. H. and Watson, M. W. (2003) Has the business cycle changed and why? NBER Macroeconomics Annual 2002 17, 159230,CrossRefGoogle Scholar
Figure 0

Figure 1. GDP volatility and population aging, and index of consumer expectations by age group during crises. Notes: 10-Year S.D. of GDP Growth indicates the 10-year rolling window standard deviation of the quarterly GDP growth rates. The Old-Age Dependency Ratio is the ratio of people older than 64 to those ages 15–64. The y-axis in lower panels denotes the difference between ICE in a current period and the average ICE from 1978q1 to 2020q4. Source: FRED and Surveys of Consumers

Figure 1

Figure 2. Belief wedges. Notes: The belief wedges are computed based on Bhandari et al. (2024) and Mankiw et al. (2003). This paper does not report VAR Wedge after 2020 as the COVID-19 shock causes a structural break in the VAR model.

Figure 2

Figure 3. Difference in belief wedge between “Under 65” and “Over 65.”

Figure 3

Table 1. Effects of biased expectations by age group on consumption

Figure 4

Table 2. Model parameter calibration

Figure 5

Figure 4. Impulse responses of macrovariables to 1 percent permanent technology shock. Notes: The solid and dotted lines denote the median, 25th, and 75th percentile impulse response under learning, respectively. The dashed line indicates the impulse responses under RE. Unshocked BGP signifies the initial balanced growth path prior to the permanent technology shock.

Figure 6

Figure 5. Expected present value of returns to capital and labor.

Figure 7

Figure 6. Impulse responses of consumption and saving to 1 percent permanent technology shock by age group.

Figure 8

Figure 7. Impulse response of output by proportion of young households ($\phi$).

Figure 9

Figure 8. Impulse response of output under learning by gain parameter.

Figure 10

Table 3. Coefficients and time discount factors on expected terms in aggregate consumption

Figure 11

Figure 9. Impulse responses of output, consumption, investment, and hours to 1 percent permanent technology shock in LC and RA learning model.

Figure 12

Table 4. Relative standard deviations of macro-variables from model simulations

Figure 13

Table 5. Standard deviations of annual growth rates of macro-variables from U.S. Data

Figure 14

Figure 10. Impulse response of output to government spending shock by proportion of young households ($\phi )$.

Figure 15

Figure 11. Cumulative government spending multiplier by proportion of young households ($\phi )$.

Figure 16

Table 6. Equivalent variations under learning by proportion of young households ($\phi$)

Figure 17

Table 7. Equivalent variations under learning and rational expectation (RE)