1. Introduction
Multivariate normal distribution, the classical model for incorporating dependence, while able to accommodate any valid correlation matrix, by its very nature, imposes normal marginals. Use of copulas, see Sklar (Reference Sklar1959), Embrechts et al. (Reference Embrechts, Frey and McNeil2011), Jaworski et al. (Reference Jaworski, Durante, Hardle and Rychlik2010), Joe (Reference Joe2014), Nelsen (Reference Nelsen2006), and Salvadori et al. (Reference Salvadori, De Michele, Kottegoda and Rosso2007), was driven by the desire to decouple modeling of the marginals and dependence. An early article, likely the foremost in the actuarial literature, exploring the use of copulas in actuarial applications is that of Frees and Valdez (Reference Frees and Valdez1998), with the use of copulas now well established as the primary tool for dependence modeling – see Bassamboo et al. (Reference Bassamboo, Juneja and Zeevi2008) (credit risk), Chen et al. (Reference Chen, MacMinn and Sun2015) (mortality risks), Hua et al. (Reference Hua, Xia and Basu2017) (spatial dependence), Hsieh et al. (Reference Hsieh, Tsai and Wang2021) (mortality risks), Kularatne et al. (Reference Kularatne, Li and Pitt2021) (general and life insurance), Li et al. (Reference Li, Balasooriya and Liu2021) (mortality risks), and Oh et al. (Reference Oh, Ahn and Lee2021) (automobile insurance).
While Gaussian copula has found many successful uses, see Othus and Li (Reference Othus and Li2010), Renard and Lang (Reference Renard and Lang2007), Song et al. (Reference Song, Li and Yuan2009), and Van de Vyver and Van den Bergh (Reference Van de Vyver and Van den Bergh2018), a key feature of Gaussian dependence is that while it inherits the ability to model a wide range of central dependence from the multivariate normal distribution it also inherits tail independence. Li (Reference Li2000) is a pioneering paper that introduced the use of the Gaussian copula for modeling credit curves to incorporate default correlation between credits, with applications in the valuation of credit default swaps. While its simplicity made it wildly popular, its very popularity and simplicity, coupled with its inability to capture tail dependence, made its (ab)use far beyond its original intended purpose a contributor to the global financial crisis of 2008, see Donnelly and Embrechts (Reference Donnelly and Embrechts2010) and Salmon (Reference Salmon2009). The importance of incorporating tail dependence in modeling has been noticed in many applications, see Blanchet and Davison (Reference Blanchet and Davison2011), Brechmann et al. (Reference Brechmann, Czado and Aas2012), Davis and Mikosch (Reference Davis and Mikosch2009), and Da Silva et al.(Reference Da Silva, Rebelo and Afonso2014).
One requires a measure of tail dependence for modeling in the presence of tail dependence. The popular Pearson’s correlation coefficient, a measure of central dependence, fails to capture tail dependence see Figure 7.4 of McNeil et al. (Reference McNeil, Frey and Embrechts2015). Among the various bivariate tail dependence measures, the tail dependence coefficient is the most widely used measure in the literature. For a random vector $(X_1,X_2)$ , with copula $C(\cdot,\cdot)$ , the lower tail dependence coefficient, $\chi(X_1,X_2)$ , is defined as
provided that the limit exists, with the zero limit corresponding to tail independence. The tail dependence matrix (TDM), $(\chi(X_i,X_j))_{1\leq i, j \leq d}$ , is the array of tail dependence coefficients corresponding to a d-dimensional random vector $\mathbf{X}_d=(X_1,\ldots,X_d)$ . The TDM serves as a d-dimensional measure of tail dependence, akin to the (linear) correlation matrix for central dependence. We denote the set of all d-dimensional TDMs by $\mathcal{T}_d$ .
An extension of the Gaussian copula that allows for tail dependence is the t-copula, though imposing identical dependence in the lower and upper tails. The extension is in the same sense that the multivariate t-distribution is an extension of the multivariate normal distribution, with the latter lying at the $\nu=\infty$ boundary of the former, where $\nu$ is the degrees of freedom of a t-distribution. The multivariate t-distribution we refer to is the distribution of the following ratio, where X and Y are independent:
The parameters of a t-copula comprise the correlation matrix of the underlying multivariate normal and the degrees of freedom of the chi-square distribution. Since a multivariate normal copula exists corresponding to every valid correlation matrix, a natural question is if the t-copula is similarly exhaustive with respect to $\mathcal{T}_d$ . Interestingly, while the set of correlation matrices is neatly characterized as positive semidefinite matrices with a unit diagonal, the characterization of the set of TDMs is far more complex. As observed in Strokorb (Reference Strokorb2013) and Fiebig et al. (Reference Fiebig, Strokorb and Schlather2017), $\mathcal{T}_d$ is a ${ \binom{d}{2}}$ -dimensional polytope, with explicit facet representations known only for $d \leq 6$ . Moreover, even determining if a given matrix belongs to ${\mathcal{T}}_d$ is NP-hard – see Shyamalkumar and Tao (Reference Shyamalkumar and Tao2020) and Janßen et al. (Reference Janßen, Neblung and Stoev2023). So, given the closed-form expression for the tail dependence coefficient of a t-copula, see equation (7.38) in McNeil et al. (Reference McNeil, Frey and Embrechts2015) or equation (4) in Shyamalkumar and Tao (Reference Shyamalkumar and Tao2022), it essentially follows that the t-copula does not cover ${\mathcal{T}}_d$ . Surprisingly, in Shyamalkumar and Tao (Reference Shyamalkumar and Tao2022), it it is shown that for $d=6$ , the relative volume of TDMs supported by t-copula is less than 1% – in this sense, thus providing only a sparse coverage of ${\mathcal{T}}_d$ . Given that the t-copula is parameterized with $O(d^2)$ number of parameters, not to mention that it only represents a small part of ${\mathcal{T}}_d$ , there is interest in the construction and study of models allowing for tail dependence with a more parsimonious parameterization, for example, with the number of parameters being linear in d.
A classical idea from the broader dependence modeling literature is to employ a linear factor structure, $X=L_{d\times p}V_{p\times 1}+\epsilon$ , where V is a vector of factors with mean $\mathbf{0}$ and identity variance–covariance matrix, and $\epsilon$ , the idiosyncratic component, independent of V, has mean $\mathbf{0}$ and a diagonal variance-covariance matrix $\boldsymbol{\Lambda}$ . Note that this results in the variance–covariance matrix $\mathbf{\Sigma}_{d\times d}$ satisfying $\mathbf{\Sigma}=LL' + \boldsymbol{\Lambda}$ , and hence parametrized by $d(p+1)=O(d)$ number of parameters. The multivariate normal distribution with the variance-covariance matrix allowing such a decomposition admits such a linear factor structure with $V \sim \mathcal{N}(\mathbf{0}, \mathbf{I})$ and $\epsilon \sim \mathcal{N}(\mathbf{0}, \mathbf{\Lambda})$ . Similarly, the t-copula with such a correlation matrix admits a near-linear factor structure – the non-linearity is with respect to the chi-square random variable in the denominator.
Such models, even with $p=1$ , find applications in finance and insurance. For example, in portfolio credit risk modeling, the common factor represents the systematic risk, and the reciprocal of the denominator represents the common shock the obligors face. The generality is achieved by relaxing the normality of the systematic and idiosyncratic risk and allowing for a more general class of common shocks. Bassamboo et al. (Reference Bassamboo, Juneja and Zeevi2008) models economies where the dependence amongst obligor defaults is primarily due to common shocks with light-tailed systematic and idiosyncratic risks, whereas Tang et al. (Reference Tang, Tang and Yang2019), Chen et al. (Reference Chen, Tong and Yang2023) allow heavy-tailed systematic and idiosyncratic risks, in particular, to show that during periods of market stress – like recessions – the exogenous common shocks can be dominated by the systematic and idiosyncratic risks.
A generalization of the above near-linear factor structure is the factor copula of Krupskii and Joe (Reference Krupskii and Joe2013), which retains the number of parameters to be O(d). The factor copula model is a conditional independence model given p latent factors, which are taken, without loss of generality, to be standard uniform random variables. Specifically, such a factor copula is the joint distribution of marginally uniform random variable $(U_1,\ldots, U_d)$ , specified by conditional independence given the latent factors $V_i$ , $i=1,\ldots, p$ , and the conditional distributions of $(U_i, V_1,\ldots, V_p)$ , $i=1,\ldots, d$ , by d many $(p+1)$ -dimensional copulas. Clearly, this implies that the parametrization is of size O(d). In many applications, there are natural candidates for factors; for example, in finance and insurance, these could represent systemic risk, market risk, sector risk, and common shock, among others. In such cases, the use of a factor copula is rather natural. But importantly, even otherwise, the factor structure alleviates the paucity of high-dimensional parametric copulas by allowing construction of them using lower-dimensional parametric copulas.
The simplest of factor copula models are the one-factor copula models. Theoretically, a single latent variable model is equivalent to the general p latent variables case, but they are quite distinct from the practical parametric modeling point of view. In the d-dimensional one-factor copula model, $U_i$ , for $i=1,\ldots,d$ , are assumed to be conditionally independent given a latent random variable $V \sim U(0,1)$ . In particular, the one-factor copula can be expressed as
where $C_{i0}(u_i,v)$ , for $i=1,\ldots,d$ , are bivariate copulas, and $C_{i|0}(u_i|v)\;:\!=\;\partial C_{i0}(u_i,v)/\partial v $ , for $i=1,\ldots,d$ , are the conditional distribution functions. Since $C_{i0}(u_i,v)$ , for $i =1,\ldots,d$ , link the random variables $U_i$ and V, they are referred to as the linking copulas of the one-factor copula models. The popularity of one-factor copulas partly derives from the fact that the collection of parametric bivariate copula families is substantially rich in contrast to those catering to higher dimensions.
We note that, more generally, vine copulas allow for the construction of copula models in higher dimensions, building up by linking together bivariate copulas using the regular vine specification, see Bedford and Cooke (Reference Bedford and Cooke2001, Reference Bedford and Cooke2002). See Aas et al. (Reference Aas, Czado, Frigessi and Bakken2009) See Czado and Nagler (Reference Czado and Nagler2022), for a discussion of vine copulas in the insurance context, where an application to financial time series in the presence of tail dependence is discussed. One of the challenges of working with vine copula, see Morales-Napoles (Reference Morales-Napoles2010), is that even after fixing the bivariate linking copula family, the choice of the regular vine tree structure is hard as there are, see Morales-Napoles (Reference Morales-Napoles2010), as many as $d!2^{(d-2)(d-3)/2-1}$ of them. This is an issue that one-factor copula models avoid at the expense of generality. It is worth noting that the one-factor copula model is a particular case of a C-vine (canonical vine) copula model, where we have conditional independence in the trees below the first, see Figure 1; see Chapter 3 of Joe (Reference Joe2011).
One-factor copulas, and factor copulas in general, have gained importance in actuarial science, as is evident from their use across application areas. Hua et al. (Reference Hua, Xia and Basu2017) employs it for modeling thunderstorm loss data, with an innovative approach to model nonlinear high-dimensional spatial dependence while capturing micro-scale dependence and keeping model complexity low. In property insurance, modeling hail loss, Gao (Reference Gao2022) introduces a spatial factor copula model to capture spatial dependencies between real estate properties and aspatial dependence induced by the common shock of experiencing the same storm. Chen et al. (Reference Chen, MacMinn and Sun2015) concerns multi-population mortality modeling, where the first stage of the model caters to the temporal dependence in a population’s mortality index, using ARMA and GARCH, and the second stage caters to dependence between the standardized innovations across populations for a fixed time epoch. It is in this second stage that they employ a single-factor copula model. The copula model is that of Oh and Patton (Reference Oh and Patton2017), which offers a flexible dependence structure with the ability to model heterogeneity to exchangeability, and tail independence to symmetric or asymmetric tail dependence. In Hsieh et al. (Reference Hsieh, Tsai and Wang2021), the interest is in mortality risk management of insurance policy pools, and it explores the use of factor copulas to model the dependence among the future lifetimes of insured individuals in an insurance policy pool. In particular, this is used to compute the hedge ratio for hedging mortality risk in a block of whole-life policies using a life settlement policy block as a hedge. Beyond mortality modeling, one-factor copula models find application in credit risk management, as demonstrated by Kolman (Reference Kolman2014), who developed an alternative model for portfolio credit risk.
From the above, we see that the factor copula models have arisen naturally as parsimoniously parameterized models that accommodate nonlinear dependence and asymmetric tail dependence in high-dimensional data. Also, in a sense, even the simplest of them all, the one-factor copula, has found successful applications in insurance and finance. This prompted the current study to evaluate the range of tail dependence one-factor copula models can cater to with respect to the TDM as the measure of tail dependence. In other words, just the choice of a one-factor copula model implicitly determines a certain form of tail dependence?
Note that modeling with pair-copula constructions, in general, requires the choice of a parametric family for the linking copula. Since the analysis to answer the above-posed question is dependent on the linking copula family, we chose the family of copulas referred to as BB1 (Bivariate-Biparameter), see Joe and Hu (Reference Joe and Hu1996) and section 5.2 of Joe (Reference Joe1997). Our choice was driven by Joe (Reference Joe2014) which on page 220, after a comparative study of tail-dependent copulas, observes that Because of the above properties, for the modeling with vine and factor copulas, it might be sufficient to consider the BB1, reflected BB1, and BB7 for the bivariate copula families with asymmetric lower and upper tail dependence. Moreover, Joe (Reference Joe2011) on page 184 observes that Because of the known concordance ordering only for BB1, the BB1 family might be preferable over BB4 and BB7 for getting asymmetric tail dependence with vine copulas. Also, for actuarial interest, we note that Nikoloulopoulos et al. (Reference Nikoloulopoulos, Joe and Li2012) on page 15 states that Sensitivity analysis, for extreme value inference of a portfolio, that uses more lower tail dependence than likelihood-based estimates, can be performed with BB1/BB7 copulas (but not t copulas) in the vine, and the forecasting can be substantially improved [sic].
In the following, we briefly mention some studies where it has found use. Krupskii and Joe (Reference Krupskii and Joe2020) proposes a copula-based model for non-Gaussian data with time-varying dependence. Their model uses a static one-factor model with BB1 linking copula and incorporates time-varying dependence by modeling the conditional distribution given the factor using the Gaussian copula with a time-varying variance-covariance matrix. In financial applications, the market indices are natural choices for the latent factor. In a stress-testing scenario for bond yields and CDS spreads, they find that the use of a tail-dependent copula like BB1 generates more distressed scenarios. Krupskii and Joe (Reference Krupskii and Joe2013) investigates the efficacy of one-factor and two-factor models alongside truncated regular-vine models with various choices for the linking copula. Their one-factor model using the BB1 as the linking copula, and their two-factor model using the BB1 copula for the first factor and the Frank copula for the second, with independence between the two factors, are among the best performers according to the AIC. Righi and Ceretta (Reference Righi and Ceretta2013) considers pair copula construction for modeling multivariate dependence between sector indices in the Brazilian market surrounding the subprime crisis period. They recommend a D-vine over a C-vine structure, and for each of the pair copulas, they considered Student’s t, Frank, and many members of the BB and Joe families. They find that for some of the sector index pairs the BB1 was the preferred copula. De Luca et al. (Reference De Luca, Ruscone and Amati2023), again in a pair copula construction setting, find a use for the BB1 copula for studying the influence of economic sectors on European stock returns employing novel tail dependence-based measures of influence. Nonfinancial references using BB1 copula include Lazoglou and Anagnostopoulou (Reference Lazoglou and Anagnostopoulou2019) and Lan et al. (Reference Lan, Shao, Zhu, Lo and Ng2021).
In Section 2, for the one-factor model with the BB1 linking copula family, we derive the tail dependence coefficients as a function of the linking copula parameters. Also, we establish some basic properties of this function as it is only implicitly defined. In Section 3, we explore the representativeness of the class of TDMs generated by the one-factor BB1 copula model. Since even with $O(d^2)$ -parametrized t-copula leads to vanishing relative volume of the set of TDMs generated, with an O(d)-parameterization of our model, the relative volume is bound to be unsatisfactory. While we confirm this, we note that even with zero relative volume, the model could well cater to a wide variety of tail dependencies in an approximate sense. We present a novel manner of evaluating the representativeness of the set of TDMs supported by our model. While unsatisfactory in this new sense as well, it gives the modeler using a one-factor model a way to check the appropriateness of the model for his particular application. In Section 4, we formulate the problem of finding the best representative one-factor BB1 model given a target TDM and suggest an implementation along with a simulation study of its performance across dimensions. In Section 5, we model rainfall data using the one-factor BB1 model to illustrate the use of the results in our study. The analysis presented in this paper is centered around the lower tail of a one-factor copula model utilizing the BB1 family as the linking copula. While this focus is admittedly narrow, we explain in Section 6 that our findings have much broader implications.
Notation: Throughout this paper, all vectors and matrices are represented in boldface. The entry in the i-th row and j-th column of a matrix $\mathbf{A}$ is denoted interchangeably by $a_{ij}$ or $a_{i,j}$ . We use $\mathbf{I}$ to represent the identity matrix and $\mathbf{0}$ and $\mathbf{1}$ / $\mathbf{J}$ to represent vectors/matrices with all elements equal to 0 and 1, respectively. Since the TDMs are defined by their upper-diagonal elements, the vectorized form of a matrix in this paper refers to the vector consisting of its upper off-diagonal elements. The vectorized form of TDMs is used in the context of describing their geometry, for example, ${\mathcal{T}}_d$ as a convex polytope. We denote the indicator function of a set A by $I_A$ . When working with matrices, say $\mathbf{A}_{m \times n}=(a_{ij})$ , the Frobenius norm are denoted by $\Vert \cdot\Vert$ , and is defined as $||\mathbf{A}||\;:\!=\;\sqrt{\sum_{i=1}^m\sum_{j=1}^n a_{ij}^2}$ .
We focus on the set of TDMs generated by the one-factor model with BB1 family linking copulas. Let $\mathcal{T}_d'$ denote the set of all d-dimensional TDMs that can be supported by such copulas. We say a TDM is feasible if it is in $\mathcal{T}_d'$ ; otherwise, this TDM is infeasible.
2. Tail dependence: one-factor copula model with BB1 family
In the following, we derive the lower tail dependence coefficients for the one-factor copula model with the BB1 family for the linking copula. To this end, we define the BB1 family of copulas indexed by two parameters $\theta> 0$ and $\delta\geq 1$ as that given by
see Joe and Hu (Reference Joe and Hu1996) and section 5.2 of Joe (Reference Joe1997). This is the copula derived from the bivariate Weibull distribution of Lu and Bhattacharyya (Reference Lu and Bhattacharyya1990). Also, the BB1 copula is referred to as the generalized Clayton copula, see page 259 of McNeil et al. (Reference McNeil, Frey and Embrechts2015).
Remark 2.1. The limiting case of (2.1) as $\theta$ approaches zero is the Gumbel copula. It is noteworthy that as $\theta \to \infty$ , the limiting copula is the comonotonicity copula $u\wedge v$ , corresponding to $U=V$ .
In Joe and Hu (Reference Joe and Hu1996), it is shown that for an Archimedean copula of the form
the lower tail dependence coefficient equals $2 \lim_{x \to \infty } \left[\eta'(2x)/ \eta'(x) \right]$ . Since the expression in (2.1) can be written in the form (2.2) with $\eta(x)= \left(1+x^{1/\delta }\right)^{- 1/ \theta } \in (0,1]$ and noting that $\eta^{-1}(x)=\left(x^{-\theta }-1\right)^{\delta}$ , the tail dependence coefficient for a BB1 copula equals
The normalized contour plots in Figure 2 display the ability of the BB1 family to model a variety of tail dependence.
Henceforth, we consider the case of a one-factor copula with the linking copulas belonging to the BB1 family with the parameters of the i-th linking copula denoted by $\theta_i(>0),\delta_i(\geq 1)$ , for $i=1,\ldots,d$ . The contour plots in Figure 3 display the ability of the one-factor copula with BB1 family as the linking copula to model a variety of tail dependence. The tail dependence coefficients $\chi_{ij}$ , for $1 \leq i < j \leq d$ , are given by
where
Theorem 2.1. The tail dependence coefficients $\chi_{ij}$ , for $1 \leq i < j \leq d$ , of the one-factor copula with the linking copula being the Family BB1 are given by
In the case that $\theta_i=\infty$ , and $\theta_j\in(0,\infty)$ , we have
Finally, if $\theta_i=\theta_j=\infty$ , $\chi_{ij}=1$ .
Proof. The case when either or both $\theta$ ’s equal infinity, the result follows from the fact that the BB1 copula with $\theta=\infty$ is the comonotonic copula. Hence, in this case, the corresponding coordinate(s) equals the common factor, and $\chi_{ij}$ corresponds to that for the BB1 copula. For the case when $\theta_i,\theta_j\in(0,\infty)$ , note that the limits of the conditional distributions $C_{i|0}(u|uw)$ , for $i =1,\ldots,d$ , as u approaches 0, are given by
The result now follows by applying the dominated convergence theorem, as the integrand
is bounded for all $u\in(0,1)$ by the integrable $g_ig_j$ of Lemma A.1 in the appendix.
Note that the tail dependence coefficients $\chi_{ij}$ , can be expressed as $\chi_{ij}=\Psi (\theta_i \delta_i,\theta_j \delta_j)$ , for $1 \leq i < j \leq d$ , where for $0<x,y<\infty$ ,
Hence, the study of the geometric properties of the set of TDMs generated by the one-factor copula with the BB1 family for linking copulas, requires study of the symmetric function $\Psi$ . Toward this, we denote the integrand above by $\psi$ , that is
The proof of the following proposition is in the appendix.
Proposition 2.1. $\Psi$ is a symmetric strictly increasing function on $(0,\infty]^2$ with continuous partial derivatives.
The following corollary conveniently lists some properties of $\Psi$ that are needed for the results in the following sections, and the proof can be found in the appendix.
Corollary 2.1. We have the following properties for $\Psi$ :
-
(i) The interval (0,1] is the the range of $\Psi(\cdot,\cdot)$ .
-
(ii) If $\Psi(x,y)=\Psi(x',y')\in (0,1)$ , then $\textrm{sgn}(x- x')+\textrm{sgn}(y-y')=0$ .
3. Representativeness of $\mathcal{T}_d'$ as a subset of $\mathcal{T}_d$
To study the geometric properties of $\mathcal{T}_d'$ , we define the embedding function $\mathbf{\Psi}_d\;:\; \mathbb{R}_+^d \to \mathcal{T}_d' $ by
where $ \mathbf{x}=[x_1,x_2,\ldots,x_d] \in \mathbb{R}_+^d$ is the vector of the parameters defining the tail dependence coefficients, that is $x_i\;:\!=\;\theta_i\delta_i$ , for $i=1,\ldots,d$ . Note that $\mathcal{T}_d'$ is the range of $\mathbf{\Psi}_d(\mathbb{R}_+^d)$ .
For $d=2$ , Corollary 2.1 readily shows that $\mathcal{T}_2'=(0,1] \subset \mathcal{T}_2=[0,1]$ . On the other hand, when $d \geq 3$ , $\mathcal{T}_d'$ is a strict subset of $\mathcal{T}_d$ . To see this, we consider MA(1) matrices with unit diagonal, that is, $\mathbf{T}=(t_{ij})$ , where $t_{ij}=I_{i=j}+\alpha I_{|i-j|=1}$ , for $1 \leq i,j \leq d$ and $\alpha \in (0, 1/2]$ . Any such matrix is a member of $\mathcal{T}_d$ , for $d \geq 2$ (see Section 4.1 of Embrechts et al., Reference Embrechts, Hofert and Wang2016 or Proposition 6 of Shyamalkumar and Tao, Reference Shyamalkumar and Tao2020). However, its third-order leading principal sub-matrix of the form
is not feasible by Proposition 3.2 below. Therefore, $\mathcal{T}_d \setminus \mathcal{T}_d'$ in non-empty.
In the first subsection, we establish some topological and geometric characteristics of $\mathcal{T}_3'$ . The second subsection introduces an imposed ordering and elaborates on geometric properties for $\mathcal{T}_d'$ , $d \geq 4$ .
3.1. Volume of $\mathcal{T}_d'$ relative to $\mathcal{T}_d$
As a starting point, we aim to determine the precise volume of the TDMs generated by the one-factor copula, using the Family BB1 as the linking copula in a three-dimensional setting. Our analysis begins with an exploration of a key property concerning the three-dimensional tail dependence coefficients, which is presented in the following proposition.
Proposition 3.1. Let $\chi_{12},\chi_{13},\chi_{23} \in (0,1]$ . Then,
can have at most one solution. Furthermore, if $\{x,y,z\}$ is the solution, then $\chi_{12} \leq \chi_{13} \leq \chi_{23}$ if an only if $x\leq y \leq z$ .
Proof. Suppose that $\{x,y,z\}$ and $\{x',y',z'\}$ are both solutions to the system of equations. Without loss of generality, we can assume that $x \geq x'$ . By Corollary 2.1 (ii), we know that $y \leq y'$ and $z \leq z'$ from the first two equations. The third equation shows that $y=y'$ and $z=z'$ . Hence, $x=x'$ , and the two solutions are identical.
Now, let $\{x,y,z\}$ be the solution. The if part follows from monotonicity of $\Psi$ from Proposition 2.1. For the only if part, let $\chi_{12} \leq \chi_{13} \leq \chi_{23}$ . Again using monotonicity of $\Psi$ , the first two equations along with $\chi_{12} \leq \chi_{13}$ indicate that $y \leq z$ , and the last two equations similarly imply that $x \leq y$ . Therefore, we have $x\leq y \leq z$ .
Before delving into properties of ${\mathcal{T}}_d'$ at higher dimensions, we further examine the representativeness of ${\mathcal{T}}_3'$ . Proposition 3.1 establishes that $\mathcal{T}_3'$ displays invariance with respect to permutation of the coordinates. For expositional ease, we hence focus on the subset of the TDMs satisfying $\chi_{12} \leq \chi_{13} \leq \chi_{23}$ . This subset, which we denote as $ \mathcal{O}_3$ , takes the form of a tetrahedron with vertices at (0,0,0), (0,0,1), $(0, 1/2, 1/2)$ , and (1,1,1). We further denote the collection of feasible points within $\mathcal{O}_3$ by $\mathcal{O}'_{\!\!3}$ , that is $\mathcal{O}'_{\!\!3} \;:\!=\;\mathcal{T}_3' \cap \mathcal{O}_3$ . The volume of $\mathcal{O}_3$ (resp. $\mathcal{O}'_{\!\!3}$ ) is clearly one-sixth that of $\mathcal{T}_3$ (resp. $\mathcal{T}_3'$ ). Figure 4(a) displays the six cells of $\mathcal{T}_3$ belonging to its partition induced by the ordering of the coordinates. The following proposition gives a necessary and sufficient condition for membership in ${\mathcal{T}}_3'$ , and using it derives its volume.
Proposition 3.2. A matrix of the form
is in $\mathcal{T}'_{\!\!3}$ if and only if $a \geq \Psi \left(\Psi_1 ^{-1}(b),\Psi_1 ^{-1}(c) \right)$ . Furthermore, the $\mathcal{T}'_{\!\!3}$ has a volume of about $0.12882$ , which is about 26% of the volume of ${\mathcal{T}}_3$ .
Proof. For the only if part, by Proposition 2.1, we establish that for any given $b=\Psi(x,z)$ and $c=\Psi(y,z)$ , where $0 < b \leq c \leq 1$ , the lower bounds for x and y are determined as $\Psi_1 ^{-1}(b)$ and $\Psi_1 ^{-1}(c)$ , respectively. Consequently, the lower bound for $a=\Psi(x,y)$ is expressed as $\Psi \left(\Psi_1 ^{-1}(b),\Psi_1 ^{-1}(c) \right).$
For the if part, let a, b, and c satisfy $\Psi(\Psi_1^{-1}(b), \Psi_1^{-1}(c)) \leq a \leq b \leq c \leq 1$ . From Proposition 2.1, we have $\Psi$ is a symmetric, strictly increasing, and continuous function on $(0,\infty]^2$ . Since,
using continuity of $\Psi$ , we have the existence of a $z \geq \Psi_1^{-1}(c)$ satisfying $a = \Psi(\Psi_1^{-1}(b), z)$ . This establishes that the parameter $(\Psi_1^{-1}(b), z, \infty)$ corresponds to the TDM with off-diagonal elements of a, b, and $\Psi(z, \infty)$ . It is noteworthy that $\Psi(z, \infty) \geq \Psi(\Psi_1^{-1}(c), \infty) = c$ . If $\Psi(z, \infty) = c$ , our proof is complete; we now focus on the remaining case of $c < \Psi(z, \infty) \leq 1$ . Consider functions $\eta_{a,b}\;:\; (0,\infty) \to (\Psi_1^{-1}(a),z)$ and $\eta_{b,b}\;:\; (0,\infty) \to (\Psi_1 ^{-1}(b),\infty)$ , defined by
respectively. Given that $\Psi$ is strictly increasing and smooth, both $\eta_{a,b}$ and $\eta_{b,b}$ are continuous (see Lemma A.3 in the Appendix) and strictly decreasing. Defining $\phi(\!\cdot\!)$ by
we note that
The continuity of $\Psi(\cdot,\cdot)$ , $\eta_{a,b}$ and $\eta_{b,b}$ , imply the continuity of $\phi$ , and hence we have the existence of $z' > 0$ satisfying $\phi(z')=\Psi(\eta_{a,b}(z'),\eta_{b,b}(z'))=c$ . Consequently, the TDM with off-diagonal elements a, b, and c corresponds to the parameters $(\Psi_1 ^{-1}(b)+z', \eta_{a,b}(z'), \eta_{b,b}(z'))$ .
Recalling that $\mathcal{O}'_{\!\!3}$ is given by
the curved surface of the lower bound on $\chi_{12}$ is plotted in Figure 4(b). The region above this curved surface represents $\mathcal{O}'_{\!\!3}$ . Through integration across the curved surface and the $\chi_{13}-\chi_{23}$ plane, we can compute the volume below the surface as:
As a result, the volume of $\mathcal{O}'_{\!\!3}$ is approximately $0.021470$ , and this result, when multiplied by six, yields the volume of $\mathcal{T}'_{\!\!3}$ .
Remark 3.1. We observe that $\mathcal{O}'_{\!\!3}$ is not a convex set. This is so as the midpoint of the line segment between the points (0,0,1) and $(1/3,1/2,1/2)$ of $\mathcal{O}'_{\!\!3}$ (i.e. $(1/6,1/4,3/4)$ ) does not belong to $\mathcal{O}'_{\!\!3}$ . This is verified by referring to Proposition 3.2 and noting that:
Furthermore, note that the non-convexity of $\mathcal{O}'_{\!\!3}$ implies the non-convexity of $\mathcal{T}'_{\!\!3}$ .
Theorem 3.1. $\mathcal{T}'_{\!\!d}$ has a non-zero volume in $\mathbb{R}_+^{\binom{d}{2}}$ if and only if $d \leq 3$ .
Proof. Recall that the tail dependence coefficients of a d-dimensional one-factor copula with the linking copulas from a BB1 family are a function of the d-dimensional vector $(\theta_1\delta_1, \ldots, \theta_d\delta_d)\in(0,\infty)^d$ . Defining,
we see that the interior of $\mathcal{T}'_{\!\!d}$ is contained in the range of $\phi$ . Note that $d < d(d-1)/2$ , for $d \geq 4$ , hence the domain of $\phi$ is in a lower dimensional Euclidean space than its range. Moreover, Proposition 2.1 implies that $\phi$ is in $C^1$ . Hence, by using Sard’s theorem (see Theorem A.1 of the Appendix), we see that the range of $\phi$ has zero Lebesgue measure in $[0,1]^{\binom{d}{2}}$ .
Remark 3.2. It is important to note that choosing a different linking copula from the BB families listed in Joe (Reference Joe1997) would lead to a different set of TDMs generated by the one-factor copula for $d=3$ . In Table A1 in the Appendix, we provide additional information on related functions and the volume of the one-factor copula using two other BB families (BB4 and BB7) suggested in Section 8.6 of Joe (Reference Joe2011). To calculate the volume for the BB4 family, we employed a hit-and-run sampler, a method detailed in Kroese et al. (Reference Kroese, Taimre and Botev2013) and Shyamalkumar and Tao (Reference Shyamalkumar and Tao2022). This approach allowed us to uniformly generate a sample of 100,000 points from $\mathcal{T}_3$ , which we then assessed for their feasibility within the BB4 family. Importantly, it is worth noting that the tail dependence coefficient of the BB7 family adheres to the same mathematical form as that of the BB1 family. As a result, both families share identical volumes.
3.2. An imposed ordering
In the following, we show that the one-factor structure imposes a rather restrictive partial ordering on the tail dependence coefficients. While the discussion below is in the case of a BB1 linking copula, the development suggests that the phenomenon is more general. We begin by showing that, in general, on the contrary, all possible orderings of the tail dependence coefficients are possible. In the following ${\mathcal{I}}_d\;:\!=\;\{(1,2), (1,3),\ldots,(d-1,d)\}$ , the set of all ordered pairs of positive integers less than or equal to d.
Theorem 3.2. Given a linear ordering $\prec$ on ${\mathcal{I}}_d$ , there exists a TDM, $\mathbf{T}_d=(t_{ij})$ , such that the mapping $(i,j)\in{\mathcal{I}}_d\mapsto t_{ij}$ is strictly increasing in $\prec$ .
Proof. For $(i,j)\in{\mathcal{I}}_d$ , let $0\leq \varepsilon_{ij} \leq 1/d$ be such that $(i,j)\mapsto \varepsilon_{ij}$ is strictly decreasing in $\prec$ , and define $\varepsilon_{ji}\;:\!=\;\varepsilon_{ij}$ . Defining $t_{ij}\;:\!=\; 1/d-\varepsilon_{ij}$ , for $1 \leq i <j \leq d$ , we see that the mapping $(i,j)\mapsto t_{ij}$ is strictly increasing in $\prec$ . In the following, we will show that $\mathbf{T}_d=(t_{ij})$ is a TDM. By Theorem 1 of Shyamalkumar and Tao (Reference Shyamalkumar and Tao2020), it is equivalent to show that $\mathbf{T}_d/d$ is a Bernoulli Compatible Matrix (BCM). Also, by Corollary 3 therein, this is equivalent to showing the existence of events $A_i$ , $i=1,\ldots d$ , on a probability space with identical probabilities of $1/d$ such that $t_{ij}=\textrm{Pr}\left(A_i|A_j\right)$ . Toward this, we construct disjoint sets $A_{ij}$ , for $1 \leq i <j \leq d$ , and $B_i$ , $i=1,\ldots d$ , on a non-atomic probability space with probabilities
This can be done as the sum of the probabilities of the above-mentioned sets are
With $A_{ji}\;:\!=\;A_{ij}$ , for $1 \leq i <j \leq d$ , and
we see that $\textrm{Pr}\left(A_i\right)=1/d$ , $A_i\cap A_j=A_{ij}$ , and hence $t_{ij}=\textrm{Pr}\left(A_i| A_j\right)$ satisfies the sought ordering.
Remark 3.3. From the above, it is easily checked that for $\Vert \varepsilon\Vert_2\leq \sqrt{2}/d^2$ , by Cauchy-Schwartz inequality we have $\Vert \varepsilon\Vert_1\leq 1/d$ . The latter suffices for the validity of the above construction, and hence the above proof implies further that the TDM with off-diagonal elements equal to $1/d$ is in the interior of ${\mathcal{T}}_d$ , and moreover using the fact that ${\mathcal{T}}_d$ is a convex polytope we have that the line segment between the extreme points origin and $\mathbf{1}$ of ${\mathcal{T}}_d$ lies in its interior.
In the following, we assume that the linking copula is the BB1 copula and define $\eta_i\;:\!=\;\theta_i \delta_i$ , $i=1,\ldots,d$ . In the case $d=3$ , we have the ordering of $\eta_i$ ’s specifying the ordering of the tail dependence coefficients by the monotonicity of $\Psi(\cdot,\cdot)$ . In particular, if $\eta_1\leq \eta_2\leq \eta_3$ , then we have $\chi_{12}\leq \chi_{13}\leq \chi_{23}$ . For $d\geq 3$ , the lemma below shows that the ordering of $\eta_i$ s specifies a partial ordering on ordered pairs (i,j), for $1\leq i<j\leq d$ , denoted by $\prec_\eta$ , and that the mapping $(i,j)\mapsto \chi_{ij}$ is non-decreasing with respect to this partial order. Figure 5(a) contains the Hasse diagram for the partial ordering with non-decreasing $\eta_i$ s for $d=5$ and specifies the partial ordering for $d\leq5$ .
For the following development, we require an arbitrarily chosen strict total order on ${\mathcal{I}}_d$ . For simplicity, we use the reverse lexicographic ordering and denote it by $\prec_{\scriptstyle rl}$ . The notation used below does suppress the dependence on $\prec_{\scriptstyle rl}$ as we note below that it has little impact on the main conclusion.
Proposition 3.3. Given $\eta_i$ s, we define $(i,j)\prec_\eta(k,l)$ for $(i,j),(k,l)\in{\mathcal{I}}_d$ if $\min(\eta_i,\eta_j)\leq \min(\eta_k,\eta_l)$ and $\max(\eta_i,\eta_j)\leq \max(\eta_k,\eta_l)$ , with ties broken using $\prec_{\scriptstyle rl}$ . The binary relation $\prec_\eta$ is a partial order, and moreover, the mapping $(i,j)\mapsto \chi_{ij}$ is non-decreasing with respect to $\prec_\eta$ .
Proof. To show that $\prec_\eta$ is a partial order, we need to demonstrate the following properties for any (i,j), $(k,l),(u,v)\in {\mathcal{I}}_d$ :
-
i. Reflexivity: $(i,j)\prec_\eta(i,j)$ ;
-
ii. Antisymmetry: If $(i,j)\prec_\eta(k,l)$ and $(k,l)\prec_\eta(i,j)$ , then $(i,j)=(k,l)$ ;
-
iii. Transitivity: If $(i,j)\prec_\eta(k,l)$ and $(k,l)\prec_\eta(u,v)$ , then $(i,j)\prec_\eta(u,v)$ .
Part i. follows from the definition of $\prec_\eta$ and that $\prec_{\scriptstyle rl}$ is a partial order. For part ii., we have that
Hence, $\min(\eta_i,\eta_j)=\min(\eta_k,\eta_l)$ and $\max(\eta_i,\eta_j)=\max(\eta_k,\eta_l)$ . Since ties are broken (using $\prec_{\scriptstyle rl}$ ), we have $(i,j)=(k,l)$ . For part iii., the ordering implies that
Hence, since ties are broken using $\prec_{\scriptstyle rl}$ , we have $(i,j)\prec_\eta(u,v)$ .
The non-decreasing nature of the mapping $(i,j)\mapsto \chi_{ij}$ with respect to $\prec_\eta$ follows from Proposition 2.1 as for $(i,j)\prec_\eta(k,l)$ we have that
On the other hand, given a set of tail dependent coefficients corresponding to a TDM in ${\mathcal{T}}_d'$ , we can infer the ordering of the underlying $\eta_i$ s as demonstrated by the below proposition.
Proposition 3.4. Let $\bar{\chi}\;:\!=\;(\chi_{1,2},\ldots,\chi_{d-1,d})$ be tail dependence coefficients corresponding to a member of $\mathcal{T}_d'$ , with $d \geq 3$ . Also, let $n_{\bar{\chi}}$ be the function defined on $\{1,\ldots,d\}$ by
where $(i_m,j_m) \in \mathcal{I}_d$ , for $m=1,\ldots,d(d-1)/2$ , is defined by $\chi_{i_1,j_1} \leq \cdots \leq \chi_{i_{d(d-1)/2},j_{d(d-1)/2}}$ , with the ties broken using $\prec_{\scriptstyle rl}$ . Then $n_{\bar{\chi}}$ is injective, and $\eta_i$ s underlying $\bar{\chi}$ are non-decreasing with respect to $<_\chi$ , the strict total order on $\{1,\dots,d\}$ under which $n_{\bar{\chi}}$ increasing.
Proof. To show that $n_{\bar{\chi}}$ is injective, assume on the contrary that $n_{\bar{\chi}}(i) = n_{\bar{\chi}}(j)$ , for some $i<j$ . Then,
implying that $I_{(i_m=i \text{ or } j_m=i)} = I_{(i_m=j \text{ or } j_m=j)}$ , for $m=1,\ldots,d(d-1)/2$ . Since the latter implies that $i=j$ , we have a contradiction. Thus, $n_{\bar{\chi}}$ is injective.
Next, we prove that the $\eta_i$ s underlying $\bar{\chi}$ are non-decreasing with respect to the total order $<_\chi$ . For $i<j$ in the total order $<_\chi$ , we have $n_{\bar{\chi}}(i) < n_{\bar{\chi}}(j)$ , that is,
Hence, $\arg\max_m(i_m=i \text{ or } j_m=i) \leq \arg\max_m(i_m=j \text{ or } j_m=j)$ . Now, consider two cases. First, if $\arg \max_m(i_m=i \text{ or } j_m=i) < \arg\max_m(i_m=j \text{ or } j_m=j)$ , then $\chi_{ki} \leq \chi_{k'i}\leq \chi_{kj}$ , where $\chi_{kj}$ is the largest value in $\bar{\chi}$ that has j as an index, and $\chi_{k'i}$ is the largest value in $\bar{\chi}$ that has i as an index. By Proposition 2.1, this implies $\eta_i \leq \eta_j$ . Second, if $\arg\max_m(i_m=i \text{ or } j_m=i) = \arg \max_m(i_m=j \text{ or } j_m=j) \;:\!=\; l$ , then $\arg\max_{m\neq l}(i_m=i \text{ or } j_m=i) < \arg \max_{m\neq l}(i_m=j \text{ or } j_m=j)$ . This means that $\chi_{ki} \leq \chi_{k'i}\leq \chi_{kj}$ , where $\chi_{kj}$ is the second largest value in $\bar{\chi}$ that has j as an index, and $\chi_{k'i}$ is the second largest value in $\bar{\chi}$ that has i as an index. Hence, by Proposition 2.1, we again have $\eta_i \leq \eta_j$ . In both cases, we have shown that for $i<j$ in the total order $<_\chi$ , $\eta_i \leq \eta_j$ . Therefore, the $\eta_i$ s underlying $\bar{\chi}$ are non-decreasing with respect to the total order $<_\chi$ .
Proposition 3.4 implies that $\mathcal{T}_d'$ can be partitioned into $d!$ subsets with each corresponding to a permutation of $\{1,\ldots,d\}$ , consistent with the ordering of the underlying $\eta_i$ s. Note that the use of $\prec_{\scriptstyle rl}$ only is relevant for $\bar{\chi}$ with some tail dependence coefficients assuming the same value. By Proposition 3.3 we see that each of these subsets can be further partitioned corresponding to a permutation of $\mathcal{I}_d$ , consistent with the ordering of the tail dependence coefficients, and by symmetry we see that there are $(d(d-1)/2)!/d!$ such subsets. In the case $d=4$ , we have 30 such cells of $\mathcal{T}_4$ corresponding to the identity permutation on $\{1,\ldots,d\}$ , denoted by $\mathcal{O}_4$ , corresponding to an increasing $\eta_i$ s. Interestingly, only two of them are nonempty if we ignore $\bar{\chi}$ with some tail dependence coefficients assuming the same value. These correspond to the permutations,
consistent, respectively, with
This is so as these are the only permutations corresponding to linear orderings consistent with the partial order of Figure 5(a). In other words, of the 720 (non-empty interiors by Theorem 3.2 and Remark 3.3) cells of ${\mathcal{T}}_4$ determined by the ordering $<_\chi$ , only a mere $48\; (2\times 4!)$ have a non-empty intersection of their interiors with ${\mathcal{T}}_4'$ . The below theorem generalizes this observation for all $d\geq 3$ . In Table A2 of the Appendix, the volumes of these cells of $\mathcal{O}_4$ calculated using polymake (see Gawrilow and Joswig, Reference Gawrilow and Joswig2000) are given. It is worth noting that the nonempty cells 1 and 7 – polytopes – share a facet of dimension 5. The adjacency structure for all 30 polytopes of $\mathcal{O}_4$ is depicted in Figure A1 in the Appendix, where polytopes connected with lines indicate that they share a common facet of dimension 5.
Remark 3.4. For any point in $\mathcal{O}_4$ , we can find the lower bound of its Euclidean distance to $\mathcal{O}_4'$ , the feasible area under the strict total order $<_\chi$ , by projecting it onto cells 1 or 7 of the partition $\mathcal{O}_4$ . For example, the lower bound from the vertex $(1,0,0,0,0,1)\in \mathcal{O}_4$ to cells 1 and 7 is $\sqrt{0.8}$ . The lower bound to the feasible area can be considered as a lower bound to the closest feasible matrix, which we will discuss in the next section.
For the proof of the theorem, we need the concept of a Young tableaux. A Young tableaux, see Section 5.1.4 of Knuth (Reference Knuth1998), is an arrangement of $n_1\geq n_2\geq \ldots \geq n_{m-1}\geq n_m>0$ distinct integers in an array of left-justified rows with the i-th row containing $n_i$ elements, $i=1, \ldots m$ , such that the elements in each row are in an increasing order (left to right) and the elements in each column are increasing top to bottom. Below, find it convenient to refer to the underlying table structure as a Young graph, refer to the assignment of numbers to the graph as the labeling, and a labeling satisfying the above monotonicity property as a regular labeling. In Table 1a we depict a Young tableaux corresponding to the (4,2,1) partition of 7. In Table 1b, we depict a certain version of a shifted Young tableaux in which the i-th row begins at the i-th column, and in Table 1c, we depict the transpose of the shifted Young tableaux which we will refer to as a Thrall tableaux.
While there is an obvious equivalence between a regular labelings of a Thrall graph and its corresponding shifted Young graph, and such labelings yield a regular labeling of the corresponding Young graph, Table 2 shows that not all regular labelings of a Young graph yield a regular labeling of its corresponding Thrall graph. The following result from Thrall (Reference Thrall1952) gives the number of regular labelings of a Thrall graph.
Theorem 3.3 (Thrall Reference Thrall1952). The number of regular labelings of a Thrall graph associated with $\tilde{n}=(n_1,\ldots,n_m)$ , a partition of n, with label set $\{1,\ldots,n\}$ is given by
where $n=n_1+\cdots+n_m$ ,
The number of Young tableaux of shape $(n_1,\ldots,n_m)$ is given by, see (34) on page 58 of of Knuth (Reference Knuth1998),
Interestingly, the formula in (3.2) was first derived from a group theoretic perspective in Frobenius (Reference Frobenius1900), and later a combinatorial proof was given in MacMahon (Reference MacMahon1909).
Theorem 3.4. For $d\geq 3$ , of the $\binom{d}{2}!$ subsets of ${\mathcal{T}}_d$ determined by $<_\chi$ , the number with nonempty intersection of their interiors with ${\mathcal{T}}_d'$ is bounded above by
In particular, the corresponding proportion decays super-exponentially to zero as
Proof. Let us consider, without loss of generality, the interior of the subset of ${\mathcal{T}}_d$ , which corresponds via $<_\chi$ to the identity permutation on $\{1,\ldots,d\}$ . By Proposition 3.4, points in this interior belonging to ${{\mathcal{T}}}_d'$ have a strictly increasing underlying $\eta_i$ s. Hence, by Propositions 3.3 and 3.1, we have that any such point is strictly increasing in a strict linear order of $\mathcal{I}_d$ that is consistent with the partial order implied by a strictly increasing underlying $\eta_i$ s. This partial order for $d=5$ is depicted in Figure 5(a). So the number of distinct strict total orders corresponding to such points is bounded above by strict linear orders on $\mathcal{I}_d$ consistent with the partial order implied by a strictly increasing underlying $\eta_i$ s. So to prove (3.3), by symmetry with respect to the permutation implied by $<_\chi$ , it suffices to show that the latter equals $N(d)/d!$ .
Considering the case $d=5$ , we see that the number of strict total orders satisfying the partial order of Figure 5(a) equals the number of regular labelings with $\{1,\ldots,10\}$ of a Thrall graph such as that shown in Table 2(c). In general, it should be now clear that we seek the number of regular labelings of a Thrall graph associated with $\tilde{n}_d=(d-1,d-2,\ldots,1)$ , a partition of $n_d=d(d-1)/2$ , with label set $\{1,\ldots,n_d\}$ . The proof of (3.3) is completed by an application of Theorem 3.3, and by observing that
The super-exponential decay conveyed by (3.4) follows from Lemma A.5 in the Appendix.
While Theorem 3.4 provides an upper bound, it is unlikely that it is exact. Consider a similar problem of the attainable orderings of $\{a_ia_j\}_{1\leq i\leq j\leq n}$ for n positive reals $a_1>\ldots>a_n>0$ . For the case $n=4$ , the corresponding poset is denoted by the following Hasse diagram of Figure 6 (which is isomorphic to that in Figure 5(a)). But as discussed in Johnston (Reference Johnston2014), the number of attainable linear orderings is 10 and not 12 (the total number of linear orderings consistent with the partial order of Figure 6). In this problem, while the diagonal is involved, the problem is seemingly simpler to analyze than the one we consider as instead of our $\Psi(\cdot,\cdot)$ it involves the simpler binary operator, the product. In the specific case of $d=5$ , all of the 12 possible linear orderings are attainable.
In Fiebig et al. (Reference Fiebig, Strokorb and Schlather2017), they investigate the set of all $\{0,1\}$ valued vertices of ${\mathcal{T}}_d$ as ${\mathcal{T}}_d$ being a convex set contains the convex hull of such vertices. In fact, Proposition 21 therein shows by such an analysis that the clique partition polytope is contained in ${\mathcal{T}}_d$ . While the non-convexity of ${\mathcal{T}}'_d$ voids such a rationale, it is nevertheless interesting as a simple measure of the coverage of ${\mathcal{T}}_d$ by ${\mathcal{T}}_d'$ . For this to be meaningful, for this discussion, we extend the class of BB1 copulas to include the lower tail-independent Gumbel copula to accommodate lower tail independence (i.e., tail dependence coefficient of zero). We note that if one of the linking copulae is Gumbel, then the corresponding coordinate is lower tail independent from the rest of the coordinates. This means that the set of all feasible $\{0,1\}$ valued TDMs corresponds to a subset of coordinates being perfectly tail dependent with each other (tail dependence coefficient of 1; comonotonicity copula; $\theta=\infty$ ) and the rest of the coordinates being tail independent with the other $d-1$ coordinates. In particular, the set of feasible $\{0,1\}$ valued TDMs contains $2^d-d$ members out of the dth Bell number of many such vertices of ${\mathcal{T}}_d$ . Using the lower bound for the dth Bell number provided by Berend and Tassa (Reference Berend and Tassa2010), we conclude that that the proportion of $\{0,1\}$ -valued vertices that are feasible is bounded above by
Table 3 presents the exact results for $d\leq 6$ .
4. Modeling with one-factor copula
From Theorem 3.1, we know that $\mathcal{T}_d'$ has zero volume for $d \geq 4$ , and moreover, that feasible TDMs are confined to a relatively few sub-regions of ${\mathcal{T}}_d$ . This, in particular, implies that the one-factor copula model’s ability to handle arbitrary tail dependence is quite limited in higher dimensions, similar to the findings in the study by Shyamalkumar and Tao (Reference Shyamalkumar and Tao2022) on t-copula. When an actuarial modeler uses the one-factor copula and relies on actuarial opinion to obtain a target TDM or estimates the TDM using data, there is a potential issue when the copula is unable to accommodate the target or estimated TDM. This is because expert opinion and estimates are inherently imprecise, and the modeler may need to carefully search for a closely approximating TDM within ${\mathcal{T}}_d'$ . In this section, we present an approach for finding such a TDM.
Given an infeasible d-dimensional target TDM $\mathbf{T}_g=(t_{ij})$ , we aim to find the closest feasible TDM in Frobenius norm. To achieve this, we formulate an optimization problem as the following:
Here, $\mathbf{\Psi}_d(\mathbf{x})$ is the TDM generated by the one-factor copula model with parameters $\mathbf{x}$ , and $\Psi(x_i,x_j)$ denotes the (i,j)-th entry of $\mathbf{\Psi}_d(\mathbf{x})$ . The objective function $\mathbf{h}_d(\mathbf{x})$ measures the distance between the target TDM $\mathbf{T}_g$ and the TDM generated by the one-factor copula model with parameters $\mathbf{x}$ . The optimization problem is subject to the non-negative constraints on $\mathbf{x}$ to ensure the feasibility of the resulting TDM.
Clearly, the definition of $\Psi$ as a definite integral without a closed-form value rules out a closed-form solution to the above optimization problem. Hence, we must rely on numerical optimization methods to solve the optimization problem (4.1). The fmincon function of MATLAB (see MathWorks, 2021) finds local solutions to nonlinear non-convex problems using an interior-point algorithm. It is worth noting that by providing the gradient,
where
we can significantly decrease the computational time. However, the built-in trust-region algorithm of the fmincon function performs worse; thus, we do not report its results.
To test the algorithm, we do not have a standard TDM test library available for $d \geq 7$ . Therefore, we utilize two parametric families of TDMs presented in Shyamalkumar and Tao (Reference Shyamalkumar and Tao2020). Firstly, we consider the two-sector matrices:
where $(\alpha, \beta) \in [0,1]^2$ and $\gamma \in [0, \gamma_u(\alpha,\beta)]$ . The value of $\gamma_u(\alpha,\beta)$ can be computed using a linear programming formulation, as detailed in Shyamalkumar and Tao (Reference Shyamalkumar and Tao2020). In this two-section case, it is natural to model it using a two-factor model. However, in our analysis, we utilize a one-factor copula to model this situation. Secondly, in order to test the algorithm, we consider the two-dependent matrices of the form:
where the matrix of the form (4.3) is in $\mathcal{T}_d$ if and only if $\alpha$ and $\beta$ satisfy the following constraints:
It is important to note that all of the matrices of the form (4.3) are not in ${\mathcal{T}}_d'$ , except when $\alpha=\beta=0$ , by Lemma A.6 of the Appendix.
All the computations in this section were performed on a computer equipped with a single Intel Quad Core i7 2.7GHz processor and 16GB of RAM. To investigate the computational performance of the fmincon function in MATLAB for both the two-sector and two-dependence cases, we conducted experiments with various dimensions. For the two-sector case, we considered dimensions $d=5,10,\ldots,60$ and randomly generated 25 pairs of $(\alpha, \beta) \in [0,1]^2$ for each dimension. We calculated $\gamma_u(\alpha,\beta)$ for each pair and then randomly generated $\gamma \in [0, \gamma_u(\alpha,\beta)]$ . Similarly, for the two-dependence case, we randomly generated 25 pairs $(\alpha, \beta)$ from show convex polytope (4.4) dimensions $5, 10, \ldots, 60$ . For each dimension, we report the running times across all non-linear instances but in Figure 7.
As shown in the figure, the running times are under four minutes for all instances from both sets of TDMs, for dimensions up to 60. These experimental results show that the interior-point algorithm demonstrates good scaling for our problem. It is important to highlight, however, that the running times exhibit a non-linear, but less than cubic, growth.
5. An application to weather-related insurance
The global increase in both the frequency and severity of natural catastrophes, due to climate change or otherwise, and accompanying increased competition for food resources with the shrinking of arable land have contributed to the importance of crop insurance. Excessive rain and, in extreme cases, flooding or near-drought conditions are key risk events covered by crop insurance, as these can severely disrupt a farming business. From the point of view of an insurer or a reinsurer with a portfolio of crop insurance policies spanning a geographical area, an accurate assessment of the dependence structure of rainfall at different locations, especially in the tails, is a critical input to evaluate the geographical diversification or lack thereof, the latter affecting both pricing and capital considerations. In the following, we show how the methodology above can help develop a one-factor copula model that embodies the tail dependence exhibited in a rainfall dataset. We consider below the daily rainfall data from the European Climate Assessment & Dataset project (Klein Tank et al., Reference Klein Tank, Wijngaard, Können, Böhm, Demarée, Gocheva, Mileta, Pashiardis, Hejkrlik, Kern-Hansen, Heino, Bessemoulin, Müller-Westermeier, Tzanakou, Szalai, Pálsdóttir, Fitzgerald, Rubin, Capaldo, Maugeri, Leitass, Bukantis, Aberfeld, van Engelen, Forland, Mietus, Coelho, Mares, Razuvaev, Nieplova, Cegnar, Antonio López, Dahlström, Moberg, Kirchhofer, Ceylan, Pachaliuk, Alexander and Petrovic2002).Footnote 1 More precisely, we focus on a recent ten-year data (2011/1/1–2020/12/31) of 33 French cities with 1% of missing value.
A random vector (X,Y) of rainfall in two cites on a given day, as suggested by Shimizu (Reference Shimizu1993), is modeled as described below:
where $x,y,\lambda_i>0$ , $i=0,\ldots,3$ satisfy $\sum_{i=0}^3 \lambda_i=1$ , $F_1$ and $F_2$ are univariate continuous distribution functions supported on $(0,\infty)$ , and F is a bivariate continuous joint distribution function supported on the positive quadrant. In the following, as we restrict our interest to the upper tail dependence of rainfall in each pair of cities, we only need to model the joint cumulative distribution function F.
Our approach is to model F by choosing a one-factor copula with parameters $(\tilde{\theta},\tilde{\delta})$ and appropriate marginals $F_X$ and $F_Y$ . Following Serinaldi (Reference Serinaldi2008), we use the L-moments ratio diagrams (see Hosking (Reference Hosking1990)) for the purpose of choosing an appropriate distribution family for modeling the marginals. In Figure 8, the L-skewness versus L-kurtosis graph, we plot the $2\times\binom{33}{2}$ empirical estimates from the marginals of truncated (to $(0,\infty)^2$ ) bivariate distributions along with the curves from some commonly used distribution families. Figure 8 suggests the three-parameter Weibull distribution as a good fit. To estimate the parameters, we use the fact that the Weibull distribution is a reverse Generalized Extreme Value distribution. This allows us to employ the L-moments-based estimators for the parameters of Generalized Extreme Value distributions listed in Table 2 of Hosking (Reference Hosking1990). These estimators are conveniently made available on the R statistical environment by the packages lmom and lmomco (see Hosking, Reference Hosking2019 and Asquith, Reference Asquith2020).
Having estimated the marginals as described above, since we seek a model that captures well the tail dependence, we begin by estimating the tail dependence matrix. Among many nonparametric estimators for the tail dependence coefficient, the Reference Capéraà, Fougères and GenestCapéraà-Fougères-Genest estimator of the tail dependence coefficient (see Capéraà et al., Reference Capéraà, Fougères and Genest1997) given by
where n is the sample size, $u_{1,i}=\hat{F}_X(x_{1,i})$ , and $u_{2,i}=\hat{F}_Y(x_{2,i})$ , for $i=1,\ldots,n$ , shows the best performance in, for example Frahm et al. (Reference Frahm, Junker and Schmidt2005). Using this estimator we compute the $33 \times 33$ estimated TDM, denoted by $\widehat{\mathbf{T}}_{33}^{\text{CFG}}$ , which interestingly is a valid TDM.
On the other hand, there are 236 of $5{}456$ , $3 \times 3$ principal submatrices that are determined to be not feasible TDMs by Proposition 3.2. It is noteworthy that 236 represents only about $4.3\%$ of 5456, far smaller a proportion than the 26% relative volume of ${\mathcal{T}}_3'$ reported in Theorem 3.1. This, in particular, implies that $\widehat{\mathbf{T}}_{33}^{\text{CFG}} \in \mathcal{T}_{33} \setminus\mathcal{T}_{33}'$ , suggesting the use of the Frobenius norm closest one-factor feasible TDM, say $\widetilde{\mathbf{T}}_{33}$ , to $\widehat{\mathbf{T}}_{33}^{\text{CFG}}$ for estimating the parameters of a one-factor copula. As the upper tail-dependence coefficient only determines $\theta\delta$ , for our illustrative purposes we fix $\delta = 1.2$ , but in practice, one can use constrained maximum likelihood or any other suitable statistical paradigms.
To illustrate the goodness of fit resulting from the above inference procedure, we consider two sets of three cities. The first set consisting of Bourges, Nice, and Saint-Girons is chosen as its corresponding principal submatrix of $\widehat{\mathbf{T}}_{33}^{\text{CFG}}$ is not in $\mathcal{T}_{3}'$ ; this is so as according to Proposition 3.2, infeasibility is implied by
This matrix, its counterpart in the feasible TDM $\widetilde{\mathbf{T}}_{33}$ , and the corresponding one-factor copula parameters are given by
respectively. The above being quite similar suggests a good fit in the upper tails.
In Figure 9, the contour plots of the bivariate marginals from the fitted one-factor copula, with Weibull marginals, for these three cities being quite alike the contours of the kernel density estimate suggests a good overall fit. For this plot, we used the function sim1fact in the R package CopulaModel, see Joe (Reference Joe2014).
The second set of cities consisting of Bourges, Lyon, and Strasbourg was chosen as its corresponding principal submatrix of $\widehat{\mathbf{T}}_{33}^{\text{CFG}}$ is in $\mathcal{T}_{3}'$ ; this is so as according to Proposition 3.2, feasibility is implied by
This matrix, its counterpart in $\widetilde{\mathbf{T}}_{33}$ , and the corresponding one-factor copula parameters are given by
respectively. Note that these matrices are nearly identical. Also, the contour plots of the bivariate marginals from the fitted one-factor copula for these three cities are similar to the contours of the kernel density estimate, as shown in Figure 10, suggesting a good overall fit. Relatively speaking, notice that we see a better alignment of contours in Figure 10 than in Figure 9, agreeing with the observations on the empirical TDM.
Among the 40,920, $4 \times 4$ principal submatrices, as expected by Theorem 3.1, none are feasible as their distance to the nearest feasible matrix using the optimization problem of (4.1) is positive. Propositions 3.3 and 3.4, by partitioning these matrices into cells depicted in Figure A1, give a qualitative description of proximity to feasibility. In particular, 5691 are located in cells 1 or 7. Additionally, 11,117 matrices are situated in cells adjacent to cells 1 or 7, namely cells 2, 3, 8, and 13. To be precise, what we mean by the qualitative statement is the stochastic ordering depicted in Figure 11, which plots the kernel density estimates of the log-squared distance of each member to feasibility categorized by their membership in the cells. These distances were computed using the optimization problem of (4.1).
Consider the following set of four cities: Strasbourg, Nantes, Troyes, and Caen. Their principal submatrix within $\widehat{\mathbf{T}}_{33}^{\text{CFG}}$ is not in the cells 1 or 7 of Table A2. This matrix, its counterpart in $\widetilde{\mathbf{T}}_{33}$ , and the corresponding one-factor copula parameters are given by
Proposition 3.4 indicates that $\mathcal{T}_4'$ can be divided into $4!=24$ distinct subsets, each reflecting a permutation of the set $\{1,2,3,4\}$ , aligning with the sequence of the $\eta_i$ values. Furthermore, according to Proposition 3.3, these subsets can be subdivided based on permutations of $\mathcal{I}_4$ , arranged according to the tail dependence coefficients. Due to symmetry, this results in $6!/4! = 30$ such subdivisions. Notably, among these, only two subsets are nonempty. Located in cell 9 of Table A2 and Figure A1, the matrix’s projection falls within cell 1, preserving the ordering of the underlying $\eta_i$ ’s. Given the proximity of the TDM projection, this is not surprising.
6. Broader implications of our analysis
To begin with, we note that the tail dependence coefficients of the following linking copulas share the same functional form as that of the BB1 family: Clayton copula (see, section 7.1 of McNeil et al., Reference McNeil, Frey and Embrechts2015 or Family B4 in section 5.1 of Joe, Reference Joe1997), Galambos survival copula (see, section 4.9 of Joe, Reference Joe2014), BB4 survival copula (see, section 5.2 of Joe, Reference Joe1997), and BB7 copula (see, Table A1 in the Appendix). Hence, our results in Section 3 carry over to the one-factor model with these linking copulas as well.
Suppose that the tail dependence coefficient is defined by $\Psi\;:\; \Theta^2 \to [0,1]$ , with the parameter space $\Theta$ being an interval of $\mathbb{R}$ . In other words, we suppose that the i-th coordinate is associated with a parameter $\theta_i$ (possibly with parametrization, see FDG copulas below) such that the tail dependence coefficient between the i-th and the j-th coordinate equals $\Psi(\theta_i,\theta_j)$ . Further, if $\Psi$ is a symmetric, strictly monotone (not necessarily increasing) function with continuous derivatives, then the results outlined in Section 3 materially continue to hold. Only the computation of the volume and the non-convexity of $\mathcal{O}'_{\!\!3}$ referred to in Remark 3.1 use the specificity of the BB1 copula.
Examples of such linking copulas include the Gumbel survival copula (see, Section 7.1 of McNeil et al., Reference McNeil, Frey and Embrechts2015), B5 survival copula (see, Section 5.1 of Joe, Reference Joe1997), BB1 survival copula, and BB3 survival copula (see, Section 5.2 of Joe, Reference Joe1997). Interestingly, for all of these examples, the tail dependence coefficient between two coordinates has the form,
Note that the above $\Psi$ is a symmetric strictly increasing function on $[1,\infty]^2$ with continuous partial derivatives in the interior.
FDG copula family – one-factor copula with Durante Generators – is a one-factor copula model with the Durante class of copulas, a tractable nonparametric class, as linking copulas. See Mazo et al. (Reference Mazo, Girard and Forbes2016) for the FDG copula and Durante (Reference Durante2006) for the Durante class.
The Durante class of copulas is given by
where $f\;:\;[0,1] \to [0,1]$ , the generator, is a differentiable increasing function satisfying $f(1)=1$ with $u\mapsto f(u)/u$ a decreasing function. Note that the copula represents an exchangeable distribution, and by considering the conditional distribution of X given $Y\leq v$ , it is easy to see the need for the restrictions on f with differentiability added for the existence of the density. The lower and upper tail dependence coefficients are f(0) and $1-f'(1)$ , respectively.
It is interesting to note that the bivariate margins of an FDG copula are themselves members of the Durante class. Hence, if $f_1$ and $f_2$ are the generators of the first two coordinates, then their bivariate margin is a Durante class copula with generator
In particular, $f_1(0)\cdot f_2(0)$ and $(1-f'_1(1))(1-f'_2(1))$ are its lower and upper tail dependence coefficients, respectively. This implies that the ordering of the tail dependence coefficients of the linking copula imposes a partial ordering on the tail dependence coefficients of the FDG copula. Some parametric examples of generators are the Cuadras-Augé generator $u\mapsto u^{1-\theta}$ for $\theta\in[0,1]$ , Fréchet generator $u\mapsto (1-\theta)u+\theta$ , for $\theta \in[0,1]$ and the Durante-exponential generators
Finally, since for any copula C, $C^\ast(u_1,u_2)\;:\!=\;C(1-u_1,1-u_2)$ also forms a copula, and the lower tail dependence coefficient with respect to C corresponds to the upper tail dependence coefficient with respect to $C^\ast$ , and vice versa, the applicability of our results is invariant to the focus being on the left or the right tail. Moreover, for copula families like the FDG copula, the results apply to both the tails of the copula.
7. Conclusion
In statistical modeling with reasonable complexity, it is common that explicit modeling choices sweep under the carpet implicit model assumptions, some of which may be unjustified. In this study, we investigate if this happens in the context of modeling high-dimensional data that exhibits nonlinear tail-dependence when we use a one-factor copula model with a parametric family for the linking copulas. Apart from deriving basic tail dependence properties for this mode, we precisely describe the implicit ordering of the tail dependence coefficients, an implementation of a solution to the formulated optimization problem to help model data from a tail dependence perspective and conclude with an illustrative application.
We believe this paper’s qualitative implications are broad; as a next step, an extension to the multifactor setting will be interesting as we continue to retain the linear-in-dimension parsimonious parameterization of the one-factor copula models.
Acknowledgments
We extend our sincere gratitude to the editor for choosing two excellent referees and to the referees whose detailed review led to a significantly improved article.
Competing interests
There are no competing interest to declare.
A. Appendix
Lemma A.1. $C_{i|0}(u|uw)$ is bounded above by the function $g_i$ , for $i=1,\ldots,d$ , respectively, where
for $\theta_i\in (0,\infty)$ .
Proof. Since, $C_{i|0}(u|uw)$ is a conditional distribution function, it is clearly bounded by 1. Hence, it suffice to consider the case $1< w <(1/u)$ . Since,
the first term of (2.4) is bounded above by $2^{1/\delta_i-1}$ . Using the fact that $1-(uw)^{\theta_i} \geq 0$ , we can bound the second term of (2.4) above by $w^{-(1+\theta_i)}$ , completing the proof.
The following lemma is a version of Exercise 3.3 in Shorack (Reference Shorack2017).
Lemma A.2. Let $X(t,\cdot)$ be a measurable function on the measure space $(\Omega,{\mathcal{A}},\mu)$ for each $t\in[a,b]$ . Moreover, for a.e. $\omega$ , the partial derivative $\frac{\partial}{\partial t} X(t,\omega)$ exists for all t in the non-degenerate interval [a,b], and that
for an integrable Y. Then
Proof of Proposition 2.1. By splitting the integral representation of $\Psi(\cdot,\cdot)$ (see (2.6)) and change of variables on the integral part on $[1,\infty)$ , we get
This yields, for $(x,y)\in(0,\infty)^2$ ,
Hence, it suffices to show that $\Psi$ has continuous partial derivatives which are positive on $(0,\infty)^2$ . Toward this, we claim that,
Note that the integrand is positive. Hence, it suffices to establish the above expression and its continuity. Toward this, note that the integrated is bounded by
and since this is monotone decreasing in x and integrable for $x>0$ , we have by Lemma A.2 the validity of (A1) on $(0,\infty)^2$ . Moreover, since the integrate in (A1) is continuous, the bound in (A2) and the dominated convergence theorem together imply the continuity of the partial derivatives of $\Psi$ on $(0,\infty)^2$ .
Proof. (Corollary 2.1.) Part (i) directly follows from Proposition 2.1. For part (ii), the proof follows from observing that by Proposition 2.1, $x\geq x'$ (resp., $x< x'$ ) implies $y \leq y'$ (resp., $y > y'$ ).
Lemma A.3. Let $0 <a \leq b<1$ and $w \in (\Psi_1^{-1}(a), \infty]$ such that $a=\Psi(\Psi_1^{-1}(b),w)$ , and define the function $\eta_{a,b}\;:\;(0,\infty)\to (\Psi_1^{-1}(a),w)$ as $a=\Psi(\Psi_1^{-1}(b)+x,\eta_{a,b}(x))$ . Then the function $\eta_{a,b}$ is continuous.
Proof. Let $s > 0$ , $\epsilon>0$ , and $u\;:\!=\;\eta_{{a},b}(s/2) \vee (\Psi_1 ^{-1}(b)+s+\epsilon)$ . According to Lemma A.4, with $\alpha$ increasing in its first argument and $\beta$ decreasing, we have
The continuity of $\alpha(\Psi_1^{-1}({a}), \cdot)$ implies that
For $\delta\;:\!=\;\min\{s/2,m\epsilon/M\}$ and $t \in (s-\delta,s+\delta)$ , we observe that
This implies $|\eta_{{a},b}(t)-\eta_{{a},b}(s)| \leq \frac{M}{m} |t-s| <\epsilon$ , demonstrating the continuity of $\eta_{{a},b}$ on $(0,\infty)$ .
Lemma A.4. A lower and an upper bound for the partial derivative $\frac{\partial}{\partial y} \Psi(x,y)$ are given by functions $\alpha(x,y)$ and $\beta(y)$ , where
Proof. By (A1), we establish a lower bound of $\frac{\partial}{\partial y} \Psi(x,y)$ as
Similarly, according to equation (A2), the upper bound is calculated to be
Theorem A.1 (Sard’s Theorem). Let $f\;:\;\mathbb{R}^n \to \mathbb{R}^m$ be $C^k$ , where $k \geq \max\{n-m+1,1\}$ . Let $M \subseteq \mathbb{R}^n$ denote the critical set of f, which is the set of points $x \in \mathbb {R} ^{n}$ at which the Jacobian matrix of f has rank $< m$ . Then the image f(M) has Lebesgue measure 0 in $\mathbb {R} ^{m}$ .
Lemma A.5. For $d \geq 3$ , we have
Proof. We use a looser version of the bound presented in Robbins (Reference Robbins1955) that the following is valid for all positive integers n:
Then
The following lemma follows directly from Proposition 3.2.
Lemma A.6. If
is in $\mathcal{T}'_{\!\!3}$ , then at least one of a and b must be equal to zero.