1. Introduction
Population aging has posed a major challenge for actuaries in pension management. Most pension funds can be classified into two schemes: defined-benefit (DB) and defined-contribution (DC) plans. In a DB plan, retirees receive a guaranteed retirement benefit based on their salary histories, years of service, and age. By contrast, a DC plan requires its members to contribute a predetermined amount during the accumulation phase, and their retirement benefit is based on the investment returns generated by those contributions. The prevalence of DC plans has been increasing globally, with less than 50% of pension assets being managed through DB schemes in 28 of the 33 reporting jurisdictions according to OECD (2020).
The long-term investment horizon is a key feature of DC plan management. As accumulation periods typically last for 20–40 years, managing time-varying interest and inflation rates is of great significance. The existing literature on DC plan management extensively explores these long-term risks. For instance, Boulier et al. (Reference Boulier, Huang and Taillard2001) study a DC pension plan with a retirement guarantee dependent on stochastic interest rates. They show that the optimal fund composition should include a loan linked to future contributions, a contingent claim delivering the guarantee, and a hedging portfolio. Battocchio and Menoncin (Reference Battocchio and Menoncin2004) investigate the utility maximization problem of a DC pension member under stochastic interest rates, salary rates, and inflation rates. They emphasize that the addition of inflation rates makes the previously riskless asset risky and the lack of tools to hedge against inflation risk results in heightened return risk. Other pertinent studies in this field include Han and Hung (Reference Han and Hung2012), Yao et al. (Reference Yao, Yang and Chen2013), Guan and Liang (Reference Guan and Liang2014), Chen and Delong (Reference Chen and Delong2015), Menoncin and Vigna (Reference Menoncin and Vigna2017), Tang et al. (Reference Tang, Chen, Lai and Wu2018), Dong and Zheng (Reference Dong and Zheng2020), Xu et al. (Reference Xu, Sherris and Shao2020), Forsyth et al. (Reference Forsyth, Vetzal and Westmacott2020), Wang et al. (Reference Wang, Shen, Zhang and Kang2021), Chen et al. (Reference Chen, Li and Zeng2023), and Wei and Yang (Reference Wei and Yang2023), among others.
In addition to financial risks, DC plan members also face the risk of pre-retirement death. To address this risk, most DC plans include death benefit clauses during the accumulation period. The current literature primarily focuses on two types of death benefits: return of premiums and return of account value. The return of premiums clause entails returning the contributed premiums to the beneficiary, with or without predetermined interest, during the accumulation phase (e.g., He and Liang, Reference He and Liang2013; Sun et al., Reference Sun, Li and Zeng2016; Li et al., Reference Li, Rong, Zhao and Yi2017; Bian et al., Reference Bian, Li and Yao2018). On the other hand, the return of account value clause involves returning the pension account value to the beneficiary (e.g., Blake et al., Reference Blake, Wright and Zhang2008; Yao et al., Reference Yao, Lai, Ma and Jian2014; Konicz and Mulvey, Reference Konicz and Mulvey2015; Wu and Zeng, Reference Wu and Zeng2015).The latter clause is more commonly observed in practice because the investment revenue is part of the estate that can be inherited by the designated beneficiary (IRS, 2020). However, the existing literature only considers exogenously determined death benefits, leaving the mortality risk unhedged.
This research presents a study on the DC pension plan management problem that encompasses both financial and mortality risks. The financial market is modeled using a two-factor model by Koijen et al. (Reference Koijen, Nijman and Werker2011), which incorporates time variations in real interest rates, inflation rates, and risk premia. A representative pension plan member can invest a part of the account balance in a stock index, nominal and inflation-linked bonds, and a cash account. Furthermore, the individual can utilize the other part of the account balance to purchase life insurance to manage the individual’s risk of pre-retirement death. Specifically, the individual continuously pays the insurance premium to the insurer while alive. In the event of the member’s pre-retirement death, the beneficiary is entitled to receive a death benefit comprising the account balance and the life insurance payment. We formulate this DC plan management problem as a utility maximization problem with a random horizon and derive the corresponding Hamilton–Jacobi–Bellman (HJB) equation. Specifically, for the standard constant relative risk aversion (CRRA) utility function, we obtain the optimal strategy explicitly up to the solution to a Hermitian matrix Riccati differential equation (HRDE). The global existence of the HRDE’s solution depends on the relative risk aversion parameter and model parameters, and we provide a detailed analysis of the HRDE along with sufficient conditions for its global existence. Additionally, we prove a verification theorem by demonstrating that the candidate strategy is indeed optimal to the stochastic control problem.
We calibrate our model to the U.S. data and numerically illustrate the optimal investment and insurance strategies of a representative male plan member. In particular, we observe that the expected life insurance premium exhibits a hump-shaped pattern with respect to age, reaching its peak at age 59. This pattern is influenced by four key factors: the force of mortality, the surplus process, the bequest-wealth ratio, and future contributions. During early ages, the increasing force of mortality and the accumulation of the surplus process contribute to a higher demand for life insurance. However, as the individual transitions into the mid-to-retirement periods, the decreasing bequest-wealth ratio and the depletion of future contributions lead to a decline in the demand for insurance. Furthermore, we discover that the individual’s demand for life insurance follows a U-shaped pattern in relation to the real short rate and expected inflation. Specifically, the individual tends to purchase more life insurance when the real short rate and expected inflation are either exceptionally high or extremely low. This U-shaped pattern can be attributed to the combined effects of the bequest-wealth ratio and future contributions.
This research contributes to the literature by providing a comprehensive analysis of the DC pension plan management problem with consideration of both financial and mortality risks. Unlike the existing literature that focuses only on investment risk, we take into account the risk of pre-retirement death and introduce life insurance to the pension account. Our model allows the DC plan member to purchase life insurance from the account balance to manage the individual’s risk of pre-retirement death. Our findings reveal that the demand for life insurance exhibits a hump-shaped pattern with age and a U-shaped pattern with real interest rates and expected inflation. These insights have important implications for pension plan management and product design. Specifically, our model suggests that the DC account of an individual resembles a variable annuity with a flexible death benefit, which differs from traditional variable annuities that have exogenously determined death benefits. In our model, the optimal death benefit within the DC account is endogenously determined and depends on various factors such as age, force of mortality, account balance, future contributions, preferences, and market conditions (including interest rates and inflation). Our results recommend the offering of variable annuities with flexible death benefits to better accommodate the bequest demands of DC plan members.
2. Economic setting
2.1. Financial market
We consider a financial market similar to that presented in Koijen et al. (Reference Koijen, Nijman and Werker2011), which accommodates time variations in real interest rates, inflation rates, and risk premia. Let $(\Omega, \mathcal{F}, \mathbb{P})$ be a filtered complete probability space. The financial risk is described by $Z_t$ , a four-dimensional vector of independent Brownian motions, which is adapted to the filtration $\mathbb{F}\;:\!=\;\{\mathcal{F}_t\}_{t\in[0,T]}$ .
In the financial market, the real short rate is driven by a single factor, $X_1$ ,
and expected inflation is affine in a second factor, $X_2$ ,
Following the literature, we assume the two factors satisfy the following Ornstein–Uhlenbeck (OU) process
where $X_t = (X_{1,t}, X_{2,t})^{\top}$ , $K_X = diag(\kappa_1, \kappa_2),\;\kappa_i>0,\;i=1,2$ , $\Sigma_X = (\sigma_1,\sigma_2)^{\top},\;\sigma_i \in \mathbb{R}^4,\;i=1,2$ . The use of the OU process captures mean reversion in the real short rate and expected inflation. It is worth noting that the OU process can lead to negative values of the real short rate and expected inflation, which are not uncommon in the U.S. Negative values of the real short rate occur when inflation exceeds the nominal interest rate, while negative expected inflation captures instances of deflation.
The realized inflation is then given by
where $\Pi_t$ denotes the level of the (consumer) price index at time t and $\sigma_{\Pi} \in \mathbb{R}^4$ .
The equity index $S_t$ satisfies the following dynamics
where $\mu_t = R_t + \mu_0 + \mu^{\top}_1 X_t$ and $R_t$ denotes the instantaneous nominal short rate that is derived in (2.3) below. For identification purposes, we assume the volatility matrix $(\sigma_1, \sigma_2, \sigma_{\Pi}, \sigma_S)^{\top}$ is lower triangular.
We assume the nominal state price density $\phi$ satisfies
in which the market prices of risk, $\Lambda_t$ , are affine in the term-structure variables, that is,
We follow Koijen et al. (Reference Koijen, Nijman and Werker2011) to impose restrictions on $\Lambda_0$ and $\Lambda_1$
with $\sigma^{\top}_S\Lambda_0 = \mu^{\top}_0$ and $\sigma^{\top}_S\Lambda_1 = \mu^{\top}_1$ . The real state price density $\phi^R_t = \phi_t\Pi_t$ then satisfies
which implies for the instantaneous nominal short rate
where $\delta_R = \delta_r + \delta_{\pi^e} - \sigma^{\top}_{\Pi}\Lambda_0$ and $\iota_2 = (1,1)^{\top}$ .
Finally, we present the prices of nominal and inflation-linked bonds. The derivation is standard in the literature (e.g., Duffie and Kan, Reference Duffie and Kan1996). The time-t price of a nominal bond with maturity s is
where $A_0$ and $A_1$ satisfy the following ODE system
In addition, the dynamics of $P(X_t,t,s)$ satisfy
Similarly, the time-t real price of an inflation-linked bond with maturity s is
where $A^R_0$ and $A^R_1$ satisfy the ODE system
in which $e_i$ represents the i-th unit vector in $\mathbb{R}^2$ . Then, the nominal price of the inflation-linked bond $\Pi_t P^R(X_t,t,s)$ satisfies
2.2. Mortality
In this subsection, we introduce mortality risk. Denote by $T_x$ the future lifetime of an individual aged $x$ , which is a nonnegative random variable independent of the financial market (i.e., $T_x$ is independent of the filtration $\mathbb{F}$ associated with the financial market). We can define the following probabilities
where $_{t}p_x$ is the probability that the individual alive at age x survives to at least age $x+t$ and $_{t}q_x$ is the probability that the individual dies before age $x+t$ . In actuarial science, it is common to work with the instantaneous force of mortality (or hazard rate)
and we have
The probability density function of $T_x$ is then given by $f_{T_x}(t) = {_{t}p_x}\mu_{x+t}, \;\text{for}\; t>0.$
2.3. Wealth process
The individual enters the DC pension plan at age $x$ at time 0 and retires at time $T$ (so the retirement age is $x+T$ ). The future lifetime of the individual is denoted by $T_x$ . Before retirement or death, the individual contributes a fixed percentage of labor income continuously to the fund. We assume that real labor income is deterministic and thus the real contribution rate, $C_t = C^{\$}_t \Pi^{-1}_t$ , satisfies
where $\wedge$ is the minimum of the two variables, $C^{\$}_t$ is the nominal contribution rate, and $g^R_t$ is the growth rate of the real contribution rate (which is also the growth rate of labour income).
During the accumulation period, the individual allocates his or her wealth dynamically to the stock index, two nominal bonds, and an inflation-linked bond. In particular, the individual uses the “rolling bond” strategy for purchasing bonds, as outlined in Boulier et al. (Reference Boulier, Huang and Taillard2001). Denote by $\alpha_t$ the proportions of wealth invested in these assets at time $t$ . The rest of the wealth is invested in the cash account. Additionally, the individual can purchase (term) life insurance that is available continuously to manage mortality risk. This assumption is unrealistic but necessary to obtain the closed-form solution. Suppose the individual pays the life insurance premium at a rate of $I^{\$}_t$ (in nominal terms) continuously to the insurer while alive. If the individual dies at time $T_x =t$ prior to retirement, the beneficiary receives the death benefit (the face value of life insurance) $I^{\$}_t/\mu_{x+t}$ in addition to the account balance. The individual’s DC account balance then evolves according to the following equation:
where $W_0=0$ and $\Sigma$ is the volatility matrix of tradable assets. We assume the two nominal bonds have maturities $T_1$ and $T_2$ , and the inflation-linked bond has maturity $T_3$ . Consequently,
We can then derive the dynamics of the real wealth $W^R_t = W_t/\Pi_t$
where $0\leq t < T \wedge T_x$ , $W^R_0 = 0$ , and $I_t = I^{\$}_t/\Pi_t$ is the real insurance premium rate.
If the individual dies before retirement, then the death benefit is added to the account balance
2.4. Preference
We assume the individual chooses investment and insurance strategies $(\alpha, I)$ to maximize the expected utility of account balance at retirement or death, whichever occurs first, that is,
Because $T_x$ is independent of financial risks, we can show
From a technical point of view, Equation (2.8) allows us to convert the random horizon optimization problem to a problem with a fixed terminal time.
3. Optimization problem
3.1. Dynamic programming
Following Deelstra et al. (Reference Deelstra, Grasselli and Koehl2003), we introduce the surplus process $W^{\widetilde{C}}_t$
where $\widetilde{C}(t,X_t)$ is the time-t value of (discounted) future contributions
Next, by Ito’s formula, we have
Assume that there exists a process $\xi_t$ such that
then we can obtain $\xi$ by comparing the relevant terms in (3.2) and (3.3)
Furthermore, adding (2.7) and (3.3), we derive the SDE for the surplus process
where $W^{\widetilde{C}}_t = W^R_t + \widetilde{C}(t,X_t)$ , $0\leq t < T \wedge T_x$ , and $\beta^{\top}_t = [W^R_t \alpha^{\top}_t + \widetilde{C}(t,X_t)\xi^{\top}_t]/W^{\widetilde{C}}_t$ . The SDE (3.4) models the investment in the financial market, and the purchase of life insurance with premium $ - \mu_{x+t} \widetilde{C}(t, X_t) + I_t$ . When the individual dies before retirement, the surplus process has the following jump
Then, by definition (3.1), and given that $W^{\widetilde{C}}_T=W^R_T$ at T, the objective function (2.8) can be transformed to
Define the value function
where $X = (x_1, x_2)^{\top}$ and $E_{t,w^{\widetilde{C}},X}[\cdot]$ is short for $E[\cdot|W^{\widetilde{C}}_t=w^{\widetilde{C}},X_t=X]$ . Then, by the dynamic programming principle, we derive the following HJB
where
$\frac{\partial}{\partial X}(\!\cdot\!)=(\frac{\partial}{\partial x_1}(\!\cdot\!),\frac{\partial}{\partial x_2}(\!\cdot\!))$ is the gradient operator with the factor vector $X = (x_1, x_2)^{\top}$ , and $\frac{\partial}{\partial X^{\top}}(\!\cdot\!)=$ $(\frac{\partial}{\partial x_1}(\!\cdot\!),\frac{\partial}{\partial x_2}(\!\cdot\!))^{\top}$ is the gradient operator with $X^{\top}=(x_1, x_2)$ . The first-order condition with respect to $\beta_t$ and $I_t$ yields that
Under the optimal strategy, the individual’s DC account evolves as a variable annuity with an endogenously determined death benefit. This is in contrast to traditional variable annuities that have a fixed death benefit, typically defined as the maximum of the account value and a guaranteed minimum. The death benefit in the DC account of our model is endogenously determined and adapts based on various factors such as age, mortality, account balance, future contributions, personal preferences, and market conditions like interest rates and inflation. In particular, numerical examples in Section 4 indicate that the optimal face value is hump-shaped and peaks at age 50. These insights suggest that pension plan sponsors should offer variable annuities with more flexible death benefits based on individual circumstances and market conditions, to meet the diverse needs of their members. For more details of the variable annuities, see SEC (2009).
3.2. Solution under the CRRA utility
We proceed by solving the optimization problem under the assumption of constant relative risk aversion (CRRA) utility.
where $\gamma>0$ and $\gamma \neq 1$ is the Arrow-Pratt coefficient of relative risk aversion.
Proposition 3.1. The candidate solution (value function) to the HJB Equation (3.5) is given by
where
Functions $\Gamma_0(\tau) \in \mathbb{R}$ , $\Gamma_1(\tau) \in \mathbb{R}^2$ and $\Gamma_2(\tau) \in \mathbb{R}^2 \times \mathbb{R}^2$ are given by the following ODE system
in which
The candidate strategies are given by
Next, we prove the candidate solution’s global existence and verify it is indeed optimal.
3.3. The global existence and verification theorem
Among the ODEs determining the candidate solution, (3.13) and (3.14) are linear ODEs. Their solutions are unique and exist globally (see Theorem 1.1.1. in Abou-Kandil et al., Reference Abou-Kandil, Freiling, Ionescu and Jank2012). However, the ODE (3.12) is a Hermitian matrix Riccati differential equation (HRDE), whose existence requires special treatment. The HRDE has the following matrix representation
where $\widetilde{I}_2$ is the 2nd-order identity matrix,
which is called the Hamiltonian matrix. The global existence of the HRDE (3.17) largely depends on the relative risk aversion coefficient $\gamma$ , which is also the case for the verification theorem. Inspired by Honda and Kamimura (Reference Honda and Kamimura2011), we divide the proofs in this subsection into two cases $\gamma>1$ and $0<\gamma<1$ .
Proposition 3.2. For $\gamma>1$ , define the admissible set as
If $\Sigma_X\Sigma^{\top}_X>0$ and $\Lambda^{\top}_1\Lambda_1>0$ , then the candidate solution $G(t,W^{\widetilde{C}}_t,X_t)$ exists in [0, T] and satisfies $G(t,W^{\widetilde{C}}_t,X_t)=V(t,W^{\widetilde{C}}_t,X_t)$ . The strategy $(\beta^*,I^*)$ given by (3.15) and (3.16) is the optimal portfolio and insurance strategy. For matrices, “ $>$ ” (“ $<$ ”) indicates positive (negative) definite.
For $0<\gamma<1$ , we can prove the existence of (3.12) by Radon’s Lemma with additional conditions. Denote $(Q,P)^{\top}$ as a solution to the linear system of differential equations
By Radon’s Lemma (see Theorem 3.1.1. in Abou-Kandil et al., Reference Abou-Kandil, Freiling, Ionescu and Jank2012), we can represent the solution to (3.12) as $\Gamma_2(\tau) = P(\tau)/Q(\tau)$ . Next, we only need $\Gamma_2(\tau)<0$ to guarantee the candidate solution’s global existence. For tractability, we follow Abou-Kandil et al. (Reference Abou-Kandil, Freiling, Ionescu and Jank2012) and assume H is diagonalizable, that is, there exists a 4-dimensional basis of eigenvectors
where $\mathbb{C}^4$ denotes the complex vector space of $4 \times 1$ complex vectors, and the corresponding eigenvalues are $\lambda_1, ..., \lambda_4$ sorted by their real parts
Denote $V = (v_1,..., v_4) \in \mathbb{C}^{4\times 4}$ , in which $\mathbb{C}^{4\times 4}$ denotes the complex vector space of $4 \times 4$ complex matrices, then we have that the solution to (3.18) satisfies
where $\Delta \;:\!=\; V^{-1}HV = \text{diag}(\lambda_1, ..., \lambda_4)$ .
Furthermore, define
we can finally prove the following proposition for global existence and verification.
Proposition 3.3. For $0<\gamma<1$ , define the admissible set as
If
then the candidate solution $G(t,W^{\widetilde{C}}_t,X_t)$ exists in $[0, T]$ and equals $V(t,W^{\widetilde{C}}_t,X_t)$ . Moreover, the strategy $(\beta^*,I^*)$ given by (3.15) and (3.16) is the optimal portfolio and insurance strategy. The expressions of $\widetilde{\Delta}$ , $q$ , and $s$ are given in Appendix C.
4. Numerical results
4.1. Model calibration
For the financial market, we use monthly U.S. data from June 1961 to December 2020 to estimate the parameters. We use zero-coupon nominal yields from Gürkaynak et al. (Reference Gürkaynak, Sack and Wright2007) with eight maturities: three months, six months, one year, two years, three years, five years, seven years, and ten years. The realized inflation index is obtained from CRSP’s Consumer Price Index for All Urban Consumers (CPI-U NSA index). The equity index is based on the CRSP’s value-weighted NYSE/Amex/Nasdaq index, which includes the dividend payments.
We estimate the parameters with the help of a Kalman filter (see Appendix D for details) and present the results in Table 1 and Figure 1. Similar to Koijen et al. (Reference Koijen, Nijman and Werker2011), we have $\kappa_1>\kappa_2$ , which means that expected inflation is more persistent than the real short rate. For the innovations, we capture the negative correlation between the real short rate and expected inflation ( $\sigma_{2(1)}<0$ ). For the equity index process, we find the risk premium is decreasing with the real short rate and expected inflation ( $\mu_{1(1)}, \mu_{1(2)}<0$ ). Moreover, the unconditional price of risk, $\Lambda_0$ , is negative for the real short rate and expected inflation but positive for the equity index. Finally, all the parameters in the conditional price of risk, $\Lambda_1$ , are negative, which means the price of risk is decreasing with two factors $X_t$ . Figure 1 shows the estimated short rates and expected inflation.
The parameters in the table are annualized. $\Lambda_0(1)$ , $\Lambda_0(2)$ , $\Lambda_1(4,1)$ , and $\Lambda_1(4,2)$ can be obtained by solving three equations: $\delta_R = \delta_r + \delta_{\pi^e} - \sigma^{\top}_{\Pi}\Lambda_0$ , $\sigma^{\top}_S\Lambda_0=\mu_0$ , $\sigma^{\top}_S\Lambda_1=\mu_1$ . So, there are 21 parameters in total to be estimated. More details can be found in Appendix D.
We assume that the pension member enters at age 22 and retires at age 66, which implies $T=44$ . The pension member allocates his or her wealth among 3-year nominal bonds, 10-year nominal bonds, 10-year inflation-linked bonds, the equity index, and cash ( $T_1=3, T_2=T_3=10$ ), and also purchases life insurance. Moreover, the risk-aversion coefficient $\gamma$ equals 5 for the pension member.
Similar to Koijen et al. (Reference Koijen, Nijman and Werker2011), we suppose that the growth rate $g^R_t$ in the real contribution rate (2.6) follows
which corresponds to an individual with a high school education in the estimates of Cocco et al. (Reference Cocco, Gomes and Maenhout2005) and Munk and Sorensen (2010). The initial real contribution rate $C_0$ is set to be $\$ 1$ kUSD (per annum).
For the individual mortality rate, we use the U.S. data of males in the “2017 Period Life Table for the Social Security area population.” Following Forfar et al. (Reference Forfar, McCutcheon and Wilkie1988), we assume the force of mortality $\mu_x$ follows a general form of the Gompertz-Makeham approach
where the change of location $x-22$ is used to improve the significance of parameters. After some trials, we find that a few parameters become insignificant when estimating $GM^{4,0}_a$ or $GM^{0,5}_a$ . Therefore, we test all the combinations of $s_1$ and $s_2$ in $3\times 4$ and pick up the $GM^{s_1,s_2}_{a}$ model with the lowest Bayesian information criterion (BIC) with all parameters significant. Table 2 shows the estimation results for the force of mortality.
4.2. Sensitivity of optimal strategies with respect to the age
In this subsection, we demonstrate how the individual’s optimal strategies change with age. We use the Monte-Carlo method with 10,000,000 simulations and a time-step of one year. We plot the expected annual optimal investment strategies $E[\beta^*_t]$ and insurance strategy $E[I^*_t]$ and the corresponding confidence intervals in Figures 2 and 3, respectively.
For the investment strategies, the first two graphs in Figure 2 illustrate the individual’s allocations to 3-year nominal bonds and 10-year nominal bonds. Specifically, the individual shorts 3-year nominal bonds and longs 10-year nominal bonds. He gradually reduces the absolute exposure to these bonds before reaching age 60 after which he increases the exposure to nominal bonds up to retirement. The individual holds long positions in the stock and 10-year inflation-linked bonds and the allocations are relatively insensitive to the age. Furthermore, the individual holds a smaller proportion of stocks compared to bonds. It should be noted that Figure 2 is based on the expected optimal investment strategy. The 50% confidence intervals are relatively wide, indicating significant uncertainty around the optimal investment strategy. We perform a sensitivity analysis of the individual’s strategy with respect to the two factors $X_t$ in the next subsection.
To gain some intuition, we decompose the optimal investment strategy in (3.15) into the following form
Among three components, the standard myopic demand (SMD) exploits the risk-return trade-off of the assets. The inflation hedging demand (IFHD) accounts for the individual’s desire to hedge against realized inflation $\Pi_t$ in (2.2) ( $\sigma_{\Pi}$ is the volatility term of $\Pi_t$ ). Lastly, the intertemporal hedging demand (ITHD) depends on the investment horizon and reflects the individual’s desire to hedge against changes in future investment opportunity sets.
We present the unconditional expectations of SMD and IFHD in Table 3. The expected SMD is a constant vector because $X_t$ in (2.1) follows a normal distribution $N(0,\Sigma_t)$ , where $\Sigma_t = \int_0^t e^{-K_x(t-s)}\Sigma_X \Sigma^{\top}_X e^{-K^{\top}_X(t-s)} ds$ . Moreover, $\text{IFHD}_4$ is zero as the fourth entry of $\sigma_{\Pi}$ is zero, which is implied by the assumption that $(\sigma_1, \sigma_2, \sigma_{\Pi}, \sigma_S)^{\top}$ is lower triangular.
We also plot the unconditional expectations of ITHD in Figure 2. The $\text{IFHD}_3$ and $\text{IFHD}_4$ are zero since the third and fourth rows of $\Sigma_X$ are zero. Compared with the optimal investment strategy in Figure 2, we observe that it is ITHD that mainly affects the evolution of the individual’s expected investment strategies.
For the insurance strategy, the first graph in Figure 3 depicts the expected insurance premium paid by the individual, which is hump-shaped and peaks at age 59. While our model does not restrict $I^*_t>0$ , the 95% confidence interval for $I^*_t$ is positive. The second graph illustrates that the expected face value $E[I^*_t]/\mu_{x+t}$ is also hump-shaped but peaks at age 50. Equation (3.16) shows that the optimal insurance premium depends on four components, the force of mortality $\mu_{x+t}$ , the optimal surplus process $(W^{\widetilde{C}}_t)^*$ , the bequest-wealth ratio $1/f_1(t,X_t)$ , and the future contributions $\widetilde{C}(t,X_t)$ . Specifically, rearranging terms in (3.16) leads to
which is the ratio of the bequest to the current account balance plus future contributions, and thus termed the bequest-wealth ratio.
The last four graphs of Figure 3 demonstrate that the expected surplus process $E[(W^{\widetilde{C}}_t)^*]$ and the force of mortality $\mu_{x+t}$ are increasing with age, while the bequest-wealth ratio $1/f_1(t, X_t)$ and the future contributions $\widetilde{C}(t,X_t)$ are decreasing with age. At early ages, the bequest-wealth ratio is high and thus the insurance demand is primarily driven by the increase in the surplus process. The bequest-wealth ratio decreases to 1 when the individual is near retirement, and as a result of this, the effect of the surplus process vanishes and the depletion of future contributions reduces the insurance demand. Therefore, the optimal face value is humped-shaped with a peak at age 60. Moreover, because the force of mortality grows at increasing rates (especially during mid-to-retirement ages), the expected insurance face value peaks earlier than the insurance premium.
4.3. Sensitivity of optimal strategies with respect to the two factors $X_t$
This section conducts the static analysis of optimal strategies with two factors $X_t$ . For all the figures in this section, we set the range of $X_1$ as $[\!-0.0736,0.0736]$ and the range of $X_2$ as $[\!-0.1032,0.1032]$ , which cover eight standard deviations of $X_{1,T}$ and $X_{2,T}$ , respectively.
$\text{SMD}_i$ is the ith entry of the standard myopic demand vector. $\text{IFHD}_i$ is the ith entry of the inflation hedging demand vector.
Figure 4 illustrates the investment strategy $\beta^*$ of the pension member with respect to two factors. Notably, the individual prioritizes 3-year nominal bonds among all three types of bonds. Moreover, an increase in the real short-rate factor $X_1$ prompts the individual to sell more 10-year nominal and 10-year inflation-linked bonds and buy more 3-year nominal bonds. On the other hand, when the inflation factor $X_2$ increases, the individual purchases more 3-year nominal bonds but fewer 10-year nominal and inflation-linked bonds. Finally, the individual shorts more equity when either the real short-rate factor $X_1$ or the inflation factor $X_2$ increases.
In addition to the investment strategies $\beta^*$ , we have decomposed them into three components: standard myopic demand (SMD), inflation hedging demand (IFHD), and intertemporal hedging demand (ITHD), based on (4.1). While the IFHD is a constant vector as shown in Table 2, SMD and ITHD are represented in Figure 4. Our analysis reveals that SMD plays a more prominent role in determining an individual’s investment strategies, as evidenced by the larger range of SMD as compared to ITHD. Specifically, SMD significantly influences an individual’s allocation to 3-year nominal bonds, 10-year inflation-linked bonds, and equity. Notably, the allocations to inflation-linked bonds and equity due to ITHD are both zero, as the third and fourth rows of $\Sigma^{\top}_X$ in ITHD from (4.1) are zero.
Figure 5 reveals that the individual’s demand for life insurance follows a U-shaped pattern. In other words, the individual purchases more life insurance when the real short rate and expected inflation are both extraordinarily high or both extremely low. This phenomenon is due to the combined effects of the two components in the optimal insurance strategy (3.16). One component is the bequest-wealth ratio $1/f_1(t,X_t)$ . Since $\Gamma_2(\tau)<0$ (guaranteed by Proposition 3.2 and 3.3), we know that $f(t,X_t)$ in $f_1(t,X_t)$ (see (3.10) and (3.11)) follows a quadratic form opening downwards for $X_t$ on the exponential. Therefore, the bequest-wealth ratio $1/f_1(t,X_t)$ exhibits a U-shaped pattern. The other component is the future contributions $\widetilde{C}(t,X_t)$ . It decreases with the real short-rate factor $X_1$ and is insensitive to the inflation factor $X_2$ . We plot the insurance premium with respect to these two components in three ages (22, 59, and 65) in Figure 5. At each age, the optimal surplus process in the insurance premium (see expression (3.16)) is set to be the expected surplus process $E[(W^{\widetilde{C}}_t)^*]$ , which is 48.42, 1,462.00, and 2,516.00, respectively. We observe that the bequest effect dominates the insurance demand throughout the individual’s lifetime when comparing $E[(W^{\widetilde{C}}_t)^*]/f_1(t,X_t)$ with $\widetilde{C}(t,X_t)$ .
4.4. Additional results
In this section, we present additional numerical results to complement our main findings. Details are available upon request.
4.4.1. Low risk aversion
While our numerical examples have assumed a risk aversion parameter of $\gamma=5$ , reasonable for pension management, we investigate the impact of varying risk aversion. Under our calibrated parameters (Table 1), we find that for $0<\gamma<1$ , the condition for the existence of a well-defined solution is violated. Specifically, the function $\Gamma_2(\tau) = P(\tau)/Q(\tau)$ given by (3.12) is positive definite throughout the time horizon, violating condition (3.22) in Proposition 3.3.
4.4.2. Ignoring the risk of pre-retirement death
We consider an individual who ignores the risk of pre-retirement death and thus opts out of life insurance. The “no insurance” individual solves the following optimization problem
which can be viewed as a limiting case of the original optimization problem (2.8), where the risk of pre-retirement death is ignored ( $\mu_{x+t} \equiv 0$ ). The optimal investment strategy in this scenario can be derived from (4.1) by setting $\mu_{x+t} \equiv 0$ , that is,
Compared with (4.1), the only difference is the intertemporal hedging demand, which is quantitatively small.
Figure 6(a) plots densities of wealth upon retirement, conditional on no pre-retirement death. Because the individual who recognizes the risk of pre-retirement death purchases life insurance, the accumulated wealth at retirement will be lower if the individual survives to retirement. Figure 6(a) shows the expected payoff upon pre-retirement death (account balance plus insurance payout), conditional on death at a given age. The protection provided by life insurance is advantageous (in terms of expected payoff) if the individual dies before age 59.
4.4.3. Mortality improvement
We have employed a relatively simple Gompertz-Makeham model and ignored the fact that mortality rates may change over time. We conduct additional analyses comparing the Gompertz-Makeham model with and without mortality improvement to see how mortality projection affects optimal investment and insurance strategies. We use mortality data from the Society of Actuaries’ Mortality Improvement Model (MIM-2021-v4) to incorporate mortality projections from 2017 to 2061. We design two experiments. The first one is “base year 2017” without mortality improvement. The second is “mortality projection 2017–2061” with mortality improvement.
We find that mortality improvement only marginally affects the optimal investment strategy, the optimal face value of the life insurance, and the expected surplus (accumulated account balance). Mortality improvement increases the optimal insurance premium in early ages but reduces it in mid-to-retirement ages, and the change is mostly due to the force of mortality, that is, how the life insurance is priced.
5. Conclusion
This research studies a DC pension plan management problem under both financial and mortality risks. The pension member can invest their account balance in a stock index, nominal and inflation-linked bonds, and a nominal cash account. Additionally, they can purchase life insurance to hedge against the risk of pre-retirement death. We formulate this pension management problem as a random horizon utility maximization problem and derive its explicit solution under the assumption of constant relative risk aversion (CRRA) utility. We calibrate our model to the U.S. data via a Kalman filter and demonstrate that the pension member’s demand for life insurance has a hump-shaped pattern with age and a U-shaped pattern with the real interest rate and expected inflation. The optimal pension account balance resembles a variable annuity with death benefits that are endogenously determined based on various factors, including age, mortality, account balance, future contributions, preferences, and market conditions. Our model offers insights into the design of new insurance products that variable annuities with more flexible death benefits should be offered in the DC account to cater to its member’s bequest demand.
The model presented in this research has several limitations that warrant further exploration. First, we assume the pension plan member can purchase (term) life insurance that is available continuously. This assumption is unrealistic but necessary to obtain the closed-form solution. It would be interesting to investigate other types of life insurance, such as single premium and annual renewable term life insurance. Second, our model focuses on term life insurance during the accumulation phase. It is an interesting direction to consider the decumulation phase and explore the role of whole life insurance. Third, we assume that the force of mortality is deterministic. We can consider stochastic mortality models that explicitly account for the uncertainty in future mortality rates. We leave these problems for future research.
Appendix A
A. Proof of Proposition 3.1
Proof. We substitute the candidate solution $G(t,W^{\widetilde{C}}_t,X_t)$ into the HJB equation (3.5) to verify the result. The derivatives of the candidate solution are given by
Substitute these derivatives into the HJB Equation (3.5), we can verify that the equality holds. Therefore, $G(t,W^R_t,X_t)$ is the candidate solution to the HJB equation (3.5). Finally, plug $G(t,W^R_t,X_t)$ into (3.7) and (3.8), we can derive the optimal strategies (3.15) and (3.16).
B. Proof of Proposition 3.2
Proof. We can apply Theorems 4.1.4. and 4.1.6 in Abou-Kandil et al. (Reference Abou-Kandil, Freiling, Ionescu and Jank2012) to prove the global existence of $\Gamma_2(\tau)$ . It is not difficult to restrict their comparison theorem from a semi-definite matrix case to a definite matrix case. Because $\Gamma_2(\tau)$ exists and $\Gamma_2(\tau)<0$ for $\forall \tau \in (0,T]$ , the existence of the candidate solution $G(t,W^{\widetilde{C}}_t,X_t)$ in (3.9) follows.
To prove the verification theorem, we define the value process for any $(\beta_t, I_t)\in \mathcal{A}_{\gamma}(0,T)$
where $s\in[t,T]$ . By Ito’s formula, we have
where $\mathcal{D}^{\beta,I}$ is the infinitesimal generator defined in (3.6) and $h^{\beta,I}$ satisfies
Next, fix $(t,w^R,X)\in[0,T]\times[0,\infty) \times \mathbb{R}^2$ and denote the conditional expectation of the value process as
where $E_{t,w^{\widetilde{C}},X}[\cdot]$ is short for $E[\cdot|W^{\widetilde{C}}_t=w^{\widetilde{C}},X_t=X]$ . Then, we have
Finally, we can prove the verification theorem by the following three steps:
Step 1: Verify the optimal strategy $(\beta^*,I^*)$ is in the admissible set $\mathcal{A}_{\gamma}(0,T)$ .
Recall from (3.15)
which satisfies a linear growth with $X_t$ due to the linear growth of $(\Lambda_t-\sigma_{\Pi})$ and $\frac{1}{f_1}\frac{\partial f_1}{\partial X^{\top}}$ . Moreover, substitute (3.15) and (3.16) into (3.4), we have
where $(\eta_t)^{\top} = \frac{1}{\gamma} (\Lambda^{\top}_t - \sigma^{\top}_{\Pi}) + \frac{1}{f_1} \frac{\partial f_1}{\partial X}\Sigma_X$ . Since the drift term and volatility term of SDE (B5) are almost surely sample continuous, then by Proposition 1.1 in Kraft (Reference Kraft2004), we can show that the SDE (3.4) has a unique strong solution under $(\beta^*,I^*)$ . Consequently, $(\beta^*_t, I^*_t)\in \mathcal{A}_{\gamma}(0,T)$ .
Step 2: Verify $J(t,W^{\widetilde{C}}_t,X_t) \leq G(t,W^{\widetilde{C}}_t,X_t)$ for any $(\beta,I)\in \mathcal{A}_{\gamma}(0,T)$ .
Before we proceed, we need the following useful lemma.
Lemma B.1. Assume a n-dimensional stochastic process $\widetilde{X}_t$ is driven by a m-dimensional Brownian motion $\widetilde{Z}$
where $\widetilde{x}_0$ is a constant n-dimensional vector, $\mu(t,\widetilde{X})\;:\; (0,\infty) \times \mathbb{R}^n \rightarrow \mathbb{R}^n$ is a borel function and $\sigma(t) :(0,\infty) \rightarrow \mathbb{R}^n\times \mathbb{R}^m$ a continuous function satisfying
where $||\cdot||_2$ is the Euclidean norm and $L^2(0,T;\;\mathbb{R})$ represents the set of Lebesgue measurable function $\psi : [0,T]\rightarrow \mathbb{R}$ , such that $\int_0^T |\psi(t)|^2 dt<\infty$ . If a stochastic process $\widetilde{g}(t,\widetilde{X}_t)$ , $\widetilde{g}\;:\;[0,T]\times \mathbb{R}^n \rightarrow \mathbb{R}^n$ , grows linearly with respect to $\widetilde{X}_t$ ( $||\widetilde{g}(t,\widetilde{X}_t)||_2\leq c_0 + c_1 ||\widetilde{X}_t||_2$ ), then we have
where
Proof. The proof is an extension of Lemma 4.1.1. in Bensoussan (Reference Bensoussan2004) to the case where $\widetilde{X}_t$ and $\mathcal{E}(t,\widetilde{g})$ share the same Brownian motion.
Next, following (3.5) and (B2), we have
Recall from (B3), for $s\in [t,T]$ ,
It is easy to prove $h^{\beta,I}(s,W^{\widetilde{C}}_s,X_s)$ is subject to a linear growth with respect to $X_t$ . By Lemma 7.1, $\mathcal{E}(t,h)$ is a martingale. Hence, for $\forall (\beta,I)\in \mathcal{A}_{\gamma}(0,T)$ , we have
Step 3: Verify $V(t,W^{\widetilde{C}}_t,X_t) = G(t,W^{\widetilde{C}}_t,X_t)$ under the optimal strategy $(\beta^*,I^*)$ .
Since $(\beta^*_t,I^*_t)$ maximizes the HJB (3.5) and $G(t,W^{\widetilde{C}}_t,X_t)$ is the solution to (3.5), the equality in (B6) holds
where
It is easy to verify that $h^{\beta^*,I^*}$ satisfies the linear growth condition. Therefore, by Lemma 7.1, $\mathcal{E}(t,h^{\beta^*,I^*})$ is a martingale. Then, we have the inequality for $G(t,w^{\widetilde{C}},X)$
Collecting (B7), (B8), and (B4), we show that $G(t,W^{\widetilde{C}}_t,X_t)=V(t,W^{\widetilde{C}}_t,X_t)$ , and $(\beta^*,I^*)$ given by (3.15) and (3.16) is the optimal portfolio and insurance strategy. The proof is complete.
C. Proof of Proposition 3.3
Proof. Substituting $y = \lambda - \frac{b}{4}$ into (3.19), we have
where $q = (8c - 3b^2)/8$ , $r = (b^3 - 4bc + 8d)/8$ , and $s =(\!-3b^4 + 256j - 64bd + 16 b^2c)/256$ . Moreover, the discriminant of $f_{y}(y)$ is given by
According to Rees (Reference Rees1922), once condition (3.21) is satisfied, (3.19) has four distinct real roots, that is, the Hamiltonian matrix H has four different real eigenvalues, which guarantees its diagonalizability and the full rank of its eigenvector matrix V. By Radon’s lemma (see Theorem 3.1.1. in Abou-Kandil et al., Reference Abou-Kandil, Freiling, Ionescu and Jank2012), we we have $\Gamma_2(\tau) = P(\tau)/Q(\tau)$ and the existence and negative definiteness of $\Gamma_2(\tau)$ from (3.22). Since $\Gamma_2(\tau)$ exists and $\Gamma_2(\tau)<0$ for $\tau \in (0,T]$ , the candidate solution $G(t,W^{\widetilde{C}}_t,X_t)$ in (3.9) exists globally.
Similar to Appendix B, we can prove the verification theorem in three steps. There are only two differences: first, in Step 1, we need to check $(W^{\widetilde{C}}_t)^*>0$ . It is easy to see the solution to (B5) satisfies
which is positive and satisfies the requirement of the admissible set (3.20). The argument for the strong solution is the same as Step 1 in Appendix B.
Second, in Step 2, we can use Fatou’s lemma rather than Lemma B.1 to prove the inequality (B7), as the value process (B1) is bounded below by zero when $0<\gamma<1$ . Define
and $\tau_n \;:\!=\; T\wedge \inf\{s\in[t,T]|\Psi(s)\geq n\}, n\in \mathbb{N}$ . For $s\in [t,\tau_n]$ , the stochastic integral
$\int_t^s g^{\beta,I}(u,W^{\widetilde{C}}_u,X_u)h^{\beta,I}(u,W^{\widetilde{C}}_u,X_u)dZ_u$ is a martingale. Then, following (3.5) and (B2), we have
Because $\lim \limits_{n\rightarrow \infty} \tau_n = T$ and $g^{\beta,I}(t,W^{\widetilde{C}}_{t},X_{t})\geq0$ for any $t\in[0,T]$ under $0<\gamma<1$ , we can show
for $\forall (\beta,I)\in \mathcal{A}_{\gamma}(0,T)$ , where the first inequality is by Fatou’s lemma and the second inequality can be obtained by taking conditional expectation on both sides of (C1). The proof is complete.
D. Estimation details for financial market
Denote $K_t = (X_{1,t},X_{2,t},\log \Pi_t, \log S_t)^{\top}$ , then the underlying states in the financial market are given by
where
$e_i$ represents the ith unit vector in $\mathbb{R}^2$ and $\iota_2 = (1,1)^{\top}$ . By Ito’s formula, the transition equation for states follows
where
For monthly data, we set $\Delta t = 1/12$ . Every month, there are 10 observations in the financial market: inflation index, equity index, and yield rate of nominal zero-coupon bonds with eight maturities. Following Koijen et al. (Reference Koijen, Nijman and Werker2011), we also assume that the yield rates are observed with independent errors. Let $R^{Y}(t,\tau_i), i=1,2,...,8$ denote the yield rates of nominal zero-coupon bonds at time t with maturity $\tau_i, i=1,2,...,8$ , then we have the measurement equation for the states
where $L_t = (R^{Y}(t,\tau_i)_{i=1,2,...,8}, \log \Pi_t, \log S_t)^{\top}$ is the observation vector. Moreover, the coefficients in (D2) are
where $A_0$ and $A_1$ are given by (2.4) and (2.5), respectively, $\widetilde{I}_2$ is the 2nd-order identity matrix, and $\chi_i, i=1,2,...,8$ are measurement errors in yields to be estimated.
Let $\widetilde{L}_t$ denote the set of the past observations $\{ L_1, L_2, ..., L_t \}$ for $t=1,2,..., n$ and define the conditional means and variances
We can then derive the Kalman filter for (D1) and (D2)
Denote the vector of all model parameters by $\psi$ , which contains 21 parameters in Table 1 and eight parameters in $\Sigma_{\eta}$ . Then, we can derive the log-likelihood function via “prediction error decomposition” (see Chapter 3.4 of Harvey, Reference Harvey1990).
Finally, we can maximize $\text{logL}(\widetilde{L}_n|\psi)$ to obtain the maximum likelihood estimate for the unknown parameters $\psi$ . Alternative estimation approaches include the expectation-maximization (EM) algorithm and Markov chain Monte Carlo (MCMC) algorithm (see Chapters 7.3.4 and 13.4 of Durbin and Koopman, Reference Durbin and Koopman2012).