Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-01-22T19:51:37.119Z Has data issue: false hasContentIssue false

Modeling of spatial extremes in environmental data science: time to move away from max-stable processes

Published online by Cambridge University Press:  15 January 2025

Raphaël Huser*
Affiliation:
Statistics Program, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Thomas Opitz
Affiliation:
INRAE, Biostatistics and Spatial Processes (BioSP, UR546), Avignon, France
Jennifer L. Wadsworth
Affiliation:
School of Mathematical Sciences, Fylde College, Lancaster University, Lancaster, UK
*
Corresponding author: Raphaël Huser; Email: [email protected]

Abstract

Environmental data science for spatial extremes has traditionally relied heavily on max-stable processes. Even though the popularity of these models has perhaps peaked with statisticians, they are still perceived and considered as the “state of the art” in many applied fields. However, while the asymptotic theory supporting the use of max-stable processes is mathematically rigorous and comprehensive, we think that it has also been overused, if not misused, in environmental applications, to the detriment of more purposeful and meticulously validated models. In this article, we review the main limitations of max-stable process models, and strongly argue against their systematic use in environmental studies. Alternative solutions based on more flexible frameworks using the exceedances of variables above appropriately chosen high thresholds are discussed, and an outlook on future research is given. We consider the opportunities offered by hybridizing machine learning with extreme-value statistics, highlighting seven key recommendations moving forward.

Type
Position Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Impact Statement

This position paper reviews the severe limitations of max-stable processes for environmental extreme data science, and discusses more appropriate alternative statistical frameworks for the modeling of spatial extremes that have emerged recently. Use of machine learning and artificial intelligence methods in spatial extreme-value modeling and inference is also discussed, and seven key recommendations to push the field forward are given.

1. Introduction

With the rise of statistical machine learning that marks the “data science revolution” (Donoho, Reference Donoho2017), and the increasing availability of massive high-quality environmental data products based on observation and simulation (e.g., large climate model ensembles, see Danabasoglu et al., Reference Danabasoglu, Lamarque, Bacmeister, Bailey, DuVivier, Edwards, Emmons, Fasullo, Garcia, Gettelman, Hannay, Holland, Large, Lauritzen, Lawrence, Lenaerts, Lindsay, Lipscomb, Mills, Neale, Oleson, Otto-Bliesner, Phillips, Sacks, Tilmes, van Kampenhout, Vertenstein, Bertini, Dennis, Deser, Fischer, Fox-Kemper, Kay, Kinnison, Kushner, Larson, Long, Mickelson, Moore, Nienhouse, Polvani, Rasch and Strand2020; reanalysis data, see Hersbach et al., Reference Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume and Thépaut2020; remote sensing, see OCO-2 Science Team et al., Reference Gunson and Eldering2020; wide in-situ observation networks, see Menne et al., Reference Menne, Williams and Vose2009; mobile sensors or citizen-science data, see https://www.nesdis.noaa.gov/about/citizen-science), the relevance of traditional statistical models is at stake more than ever. This is especially true with the modeling and prediction of environmental extreme events, where assumptions are crucial for accurate risk assessment and mitigation, and where applied findings are of key societal importance on a global scale (IPCC, Reference Romero2023). On the one hand, sophisticated models that are well supported by probability theory are desired, in order to provide sound probability estimates of future low-likelihood-high-impact events that occur in the tail of the distribution. On the other hand, practical considerations should guide the model construction to ensure that it can efficiently utilize available data and provide the answers we need to appropriately address the specific scientific problem of interest. In particular, the variability of processes along the space and time dimensions usually plays a key role in environmental science, and a given statistical model should capture the most important marginal and dependence features of the data, such as spatiotemporal trends and nonstationarity, non-Gaussianity, and subasymptotic forms of tail dependence (see Section 3 for details). Such models should also enable fast-enough inference, which includes model fitting, validation, simulation, and prediction. The speed at which a statistical extreme-value analysis must be performed and the amount of human and material resources required to achieve it depend strongly on the context; while spending months or years could be acceptable for academic purposes or for retrospective studies, it is crucial in some cases to do it within just a few days or weeks (as, e.g., with rapid extreme-event attribution studies to respond to the media about the role of anthropogenic forcings in the occurrence of a recent catastrophic event; see Stott et al., Reference Stott, Stone and Allen2004; Risser and Wehner, Reference Risser and Wehner2017) or even “online” (as, e.g., with operational early warning systems predicting natural hazards in real time, where the safety of people or infrastructure is at risk; see Nguyen et al., Reference Nguyen, Veraart, Taisne, Tan and Lallemant2023). Oftentimes, however, the requirement for both rigor and speed are at odds with each other: popular spatial model classes arising from asymptotic extreme-value theory are often computationally prohibitive due to their intricate probabilistic structure, or subject to important practical restrictions that hamper their widespread application in real operational settings, where data are generally big and complex. In this article, we recall that an asymptotic motivation should never supersede applied scientific considerations and proper model checking. Importantly, we argue that the class of max-stable processes (MSPs), characterized by extreme-value copulas, whose practical usage by statisticians and climate scientists has been multiplied since the article of Padoan et al. (Reference Padoan, Ribatet and Sisson2010), not only has many severe built-in limitations, but also fails to address the basic purpose it was made for: namely, to provide a suitable statistical framework for modeling spatial extremes and estimating small joint tail probabilities (or, similarly, high return levels of spatial aggregates) far beyond observed levels. The enthusiasm about, and adoption of, MSP models in environmental studies—for which we are partly responsible—is due to their solid theoretical foundation, which makes them appear as “natural” models to use, the fact that the extreme-value community has traditionally been more theory-oriented while being somewhat “detached” from concrete issues arising in real applications, and the availability of convenient user-friendly software such as the R package SpatialExtremes (Ribatet, Reference Ribatet2022) to fit and simulate these models. While (part of) the statistics of extremes community has already realized some of their limitations (Davison et al., Reference Davison, Huser, Thibaud, Gelfand, Fuentes, Hoeting and Smith2019; Huang et al., Reference Huang, Monahan and Zwiers2021; Huser and Wadsworth, Reference Huser and Wadsworth2022) and started to develop alternative modeling strategies that transcend the classical framework (Wadsworth and Tawn, Reference Wadsworth and Tawn2012; Opitz, Reference Opitz2016; de Fondeville and Davison, Reference de Fondeville and Davison2018; Huser and Wadsworth, Reference Huser and Wadsworth2019; Engelke and Hitz, Reference Engelke and Hitz2020; Huser et al., Reference Huser, Opitz and Thibaud2021; Wadsworth and Tawn, Reference Wadsworth and Tawn2022; Castro-Camilo et al., Reference Castro-Camilo, Huser and Rue2022), MSPs and extreme-value copulas still continue to be used in many spatial data applications and considered in simulation studies as the “default” option.

This article aims to openly discuss the known deficiencies of MSPs, and strongly encourage statisticians and climate scientists to move away from them in real applications unless better alternative solutions are not available. We argue that, as a community, it is now time to reflect and act upon the lessons learned over the past two decades, and move on with the broader adoption of more recent, flexible, efficient, and pragmatic solutions for the modeling of extremes in operational risk assessment studies and, more generally, environmental data science. We stress that extreme-value theory remains the appropriate framework to conduct reliable statistical analyses in the data-scarce setting of extreme events, but its historical standard models, MSPs, are often not the most appropriate tools, especially for environmental data applications.

The rest of this article is organized as follows. In Section 2, we review MSPs and their main limitations. In Section 3, we discuss alternative modeling strategies. In Section 4, we conclude with some final remarks, and an outlook on the future of environmental extreme data science, with a particular view on advances at the interface between statistics of extremes and modern machine learning. We also list seven key recommendations moving forward.

2. Max-stable processes: a restrictive tool for the wrong problem?

The theoretical foundations underpinning MSPs start with the wish to generalize univariate extreme-value theory (Davison and Huser, Reference Davison and Huser2015) to the spatial context. Consider a sequence of independent and identically distributed random processes, $ {Y}_1\left(\boldsymbol{s}\right),{Y}_2\left(\boldsymbol{s}\right),\dots $ , defined over a spatial domain $ \mathcal{S} $ . Extreme-value theory states that under broad conditions, the only possible limits, $ Z\left(\boldsymbol{s}\right) $ , for the process of pointwise maxima, $ {Z}_n\left(\boldsymbol{s}\right)=\max \left\{{Y}_1\left(\boldsymbol{s}\right),\dots, {Y}_n\left(\boldsymbol{s}\right)\right\} $ as $ n\to \infty $ , when appropriately affinely renormalized, are MSPs. This result implies that all univariate margins of $ Z\left(\boldsymbol{s}\right) $ follow the generalized extreme-value (GEV) distribution, while all finite-dimensional margins are characterized by an extreme-value copula (Davison et al., Reference Davison, Padoan and Ribatet2012; Segers, Reference Segers2012). This asymptotic characterization (as the block size $ n $ tends to infinity) has been the principal argument for fitting max-stable models and extreme-value copulas in practice (with fixed and finite $ n $ ). While the MSP theory was established in the 1980s, their popularity started to grow with Schlather (Reference Schlather2002) who showed how to construct max-stable models with realistic-looking realizations based on de Haan’s (Reference de Haan1984) spectral representation. Padoan et al. (Reference Padoan, Ribatet and Sisson2010) later promoted their adoption in environmental applications by proposing a method of inference for spatial max-stable models based on pairwise likelihoods, and Davison et al. (Reference Davison, Padoan and Ribatet2012) further advocated their use against other natural alternatives available at the time. The spectral representation of MSPs (de Haan, Reference de Haan1984; Schlather, Reference Schlather2002) essentially states that, on the unit Fréchet scale, they can be constructed as

(1) $$ Z\left(\boldsymbol{s}\right)=\underset{i\ge 1}{\sup}\;{\xi}_i{W}_i\left(\boldsymbol{s}\right), $$

for a Poisson point process $ {\left\{{\xi}_i\right\}}_{i\ge 1} $ on $ \left(0,\infty \right) $ with intensity $ {\xi}^{-2}\hskip0.1em d\hskip0.1em \xi $ , and independent copies $ {W}_i\left(\boldsymbol{s}\right) $ of a spatial process $ W\left(\boldsymbol{s}\right) $ satisfying $ \hskip0.1em E\hskip0.1em \left[\max \left\{W\left(\boldsymbol{s}\right),0\right\}\right]=1 $ . The unit Fréchet scale is a common standard since it permits simple expression of the finite-dimensional distribution functions as $ G\left(\boldsymbol{z}\right)=\exp \left\{-V\left(\boldsymbol{z}\right)\right\} $ , with $ V $ the so-called exponent function, which is homogeneous of order $ -1 $ , meaning that $ t\times V\left(t\boldsymbol{z}\right)=V\left(\boldsymbol{z}\right) $ for any positive $ t $ and $ \boldsymbol{z} $ . This construction principle has led to various MSP models, the most popular ones being the Smith (Reference Smith1990) model, the Schlather (Reference Schlather2002) model, the extremal- $ t $ model (Opitz, Reference Opitz2013), the Brown and Resnick (Reference Brown and Resnick1977) model (Kabluchko et al., Reference Kabluchko, Schlather and de Haan2009), and the Reich and Shaby (Reference Reich and Shaby2012) model, which have been widely used in numerous applications. However, MSPs have many intrinsic limitations and practical restrictions, and we now summarize the most critical ones. These restrictions are related to the actual definition of MSPs, the form of their dependence structure, and the cost of likelihood inference and simulation algorithms.

Definition of max-stable processes: A first drawback of MSPs goes back to their basic theoretical motivation, and the implicit definition of a spatial extreme event in this framework. By construction, MSPs $ Z\left(\boldsymbol{s}\right) $ approximate the distribution of pointwise maxima $ {Z}_n\left(\boldsymbol{s}\right) $ for large $ n $ . However, while the $ {Y}_i\left(\boldsymbol{s}\right) $ processes represent the original individual (e.g., daily) spatial events, which are directly observable and may or may not be extreme, the block (e.g., yearly) maximum process $ {Z}_n\left(\boldsymbol{s}\right) $ does not correspond to a real event, unless a single individual process, say $ {Y}_j\left(\boldsymbol{s}\right) $ , is more extreme than all other processes $ {Y}_i\left(\boldsymbol{s}\right) $ , $ i\in \{1,\dots, n\}\hskip0.1em \mathrm{\backslash}\hskip0.1em \{j\} $ , simultaneously at all sites $ \boldsymbol{s} $ within the domain $ \mathcal{S} $ , such that $ {Z}_n\left(\boldsymbol{s}\right)={Y}_j\left(\boldsymbol{s}\right) $ for all $ \boldsymbol{s} $ . This situation rarely occurs in practice, especially for large domains. Therefore, in fitting an MSP to realizations of $ {Z}_n\left(\boldsymbol{s}\right) $ , we effectively model artificially created spatial extreme data that may have little to do with real observations. This is illustrated in Figure 1 for daily summer precipitation data from the United Kingdom (UK) Climate Projections (see https://www.metoffice.gov.uk/research/approach/collaboration/ukcp) over the historical period 1981–2000. An argument used to persist and still fit MSPs to extremes of individual observations (e.g., Huser and Davison, Reference Huser and Davison2014) is that they provide, in some sense, an approximation to the joint tail of $ {Y}_i\left(\boldsymbol{s}\right) $ itself. Specifically, if a vector $ \boldsymbol{Y}\sim F $ has a joint distribution $ F $ with unit Fréchet margins attracted to the max-stable distribution $ G\left(\boldsymbol{z}\right)=\exp \left\{-V\left(\boldsymbol{z}\right)\right\} $ , then we have $ F\left(n\boldsymbol{y}\right)={\left[{\left\{F\left(n\boldsymbol{y}\right)\right\}}^n\right]}^{1/n}\approx {\left\{G\left(\boldsymbol{y}\right)\right\}}^{1/n}=G\left(n\boldsymbol{y}\right) $ , where the last equality uses the homogeneity of $ V $ , and the approximation is justified when $ n $ is large. However, as is clear from this simple derivation, the max-stable distribution $ G $ only provides an accurate approximation to $ F $ when all observations are large at the same time, and it does not provide a suitable approximation in the much more common situation where only a subset of variables are extreme. This issue can be partly dealt with by censoring small observations when performing inference; nevertheless, we argue that, by design, MSPs address the “wrong problem,” and that the very statistical framework motivating their practical use is flawed.

Figure 1. Left: Observational grid (gray) of the UK Climate Projections (UKCP) daily precipitation data, showing a particular horizontal transect (red) and a $ 32\times 32 $ subregion (blue). Middle: Illustration of underlying daily random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with three of the daily fields that contribute to the pointwise annual maximum for the years 1988 (yellow), 1990 (red), and 1996 (blue). Right: Illustration of underlying random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with the pointwise annual maximum $ {Z}_n\left(\boldsymbol{s}\right) $ for the years 1988 (yellow), 1990 (red), and 1996 (blue), corresponding to the three individual extreme events shown in the middle panel. In each case, there is no $ j=1,\dots, n $ such that $ {Z}_n\left(\boldsymbol{s}\right)={Y}_j\left(\boldsymbol{s}\right) $ for all $ \boldsymbol{s} $ .

The rigidity of max-stability: MSPs possess a dependence structure that is too rigid for most environmental applications. A max-stable distribution $ G $ is one for which $ {G}^t $ (with $ {G}^t(x) $ defined as $ {\left\{G(x)\right\}}^t $ ) remains a valid distribution within the same location-scale family for all $ t>0 $ . This implies that the dependence structure of MSPs is invariant to the maximum operator; in other words, if a specific MSP is an appropriate model for annual maxima of a given variable, then the same model also provides an appropriate characterization of the dependence structure for 10-year maxima, 50-year maxima, or even 1000-year maxima. This means that, upon marginal standardization, the dependence patterns of a spatial extreme event do not change with the severity of the event, no matter how extreme it is. This rigidity often contradicts empirical findings, where extreme events are dependent but tend to become spatially more localized as they become more extreme (Huser and Wadsworth, Reference Huser and Wadsworth2019; Castro-Camilo and Huser, Reference Castro-Camilo and Huser2020; Zhong et al., Reference Zhong, Huser and Opitz2022). Very few studies which employ MSPs actually scrutinize this stability property, but diagnostic tools can be found in Gabda et al. (Reference Gabda, Towe, Wadsworth and Tawn2012) and Huser et al. (Reference Huser, Opitz and Thibaud2021), or could be adapted from available multivariate hypothesis testing tools (Bücher and Kojadinovic, Reference Bücher and Kojadinovic2016).

Models only for limited subclasses of possible tail structures: The asymptotic world in which MSPs live is also “black and white”: max-stable models are indeed always asymptotically dependent unless they are exactly independent, and they lack more nuanced representation of the important case of asymptotic independence. Asymptotic independence means that the limiting dependence structure of the normalized $ {Z}_n\left(\boldsymbol{s}\right) $ corresponds to the independence copula, yet in practice, there is almost always residual positive dependence present in $ {Z}_n\left(\boldsymbol{s}\right) $ for finite $ n $ , even when the limit would be independence. By fitting an MSP model to maxima of asymptotically independent data, one incorrectly models this residual dependence as asymptotic dependence, leading to biased extrapolation further into the tail of the distribution. Put differently, a max-stable model fitted to block maxima stemming from data with a weakening tail dependence structure is inevitably misspecified; the effect of this misspecification is that MSPs will capture an “average” strength of dependence and, therefore, tend to slightly underestimate the occurrence probability of joint extreme events at relatively low levels but potentially grossly overestimate them at high levels situated far beyond the observed range of data. Another way to define asymptotic dependence of the process $ Y\left(\boldsymbol{s}\right) $ is to consider the coefficient $ {\chi}_{\left\{i,j\right\}}(u)=\Pr \left\{Y\left({\boldsymbol{s}}_i\right)>{F}_i^{-1}(u),Y\left({\boldsymbol{s}}_j\right)>{F}_j^{-1}(u)\right\}/\left(1-u\right) $ , where $ Y\left({\boldsymbol{s}}_i\right)\sim {F}_i $ and $ Y\left({\boldsymbol{s}}_j\right)\sim {F}_j $ . Asymptotic dependence is present if this converges to a positive limit $ {\chi}_{\left\{i,j\right\}}>0 $ as $ u\to 1 $ . In other words, the joint tail decay rate is proportional to the marginal tail decay rate, and any dependence at moderate levels never vanishes completely but it remains in the limiting tail. While asymptotic dependence (or independence) is a property that is difficult to verify or test in practice, models allowing for asymptotic independence often have richer tail decay rates than MSPs, which are strongly limited due to their focus on the possible asymptotic structures rather than the subasymptotic tail behavior. This difference is crucial in practice, because the flexibility of a model in its joint tail dictates extrapolations to higher levels, and thus impacts risk assessment. Being always asymptotically dependent, MSPs will thus have a tendency to overestimate the risk of very large joint extremes. Studies advising against asymptotic dependence models for environmental risk assessment include Bortot et al. (Reference Bortot, Coles and Tawn2000) for oceanographic processes, and Opitz (Reference Opitz2016) and Dawkins and Stephenson (Reference Dawkins and Stephenson2018) for wind gusts.

Lack of flexible and physically realistic models: Some of the most popular max-stable models are also characterized by unphysical properties. The Schlather and extremal- $ t $ max-stable models, for example, are non-ergodic, which implies that they cannot approach full independence between infinitely distanced sites. With such models, spatial extreme events have a positive probability to be “infinitely wide” in extent since the $ {\chi}_{\left\{i,j\right\}} $ coefficient for $ Z\left(\boldsymbol{s}\right) $ is uniformly bounded below by a positive constant for any two locations $ {\boldsymbol{s}}_i $ and $ {\boldsymbol{s}}_j $ whatever their distance $ \parallel {\boldsymbol{s}}_i-{\boldsymbol{s}}_j\parallel $ . On the other hand, the Smith model has overly smooth realizations based on analytical spatial profiles $ {W}_i $ in (1), and the original Reich–Shaby model is a noisy version of the Smith model with artificial nonstationary artifacts, though a variant based on random basis functions producing more realistic realizations was proposed by Bopp et al. (Reference Bopp, Shaby and Huser2021). Overall, the Brown–Resnick model, which is constructed from intrinsically stationary log-Gaussian processes $ {W}_i $ in (1), seems to be the most physically reasonable one. Nevertheless, the very broad subclass of MSPs possessing a positive continuous density in all of their finite-dimensional distributions (including the Brown–Resnick model itself) shares the common drawback that conditional independence implies full independence (Papastathopoulos and Strokorb, Reference Papastathopoulos and Strokorb2016). This “entanglement effect” means that most max-stable models cannot exhibit any interesting Markov structure directly in $ Z\left(\boldsymbol{s}\right) $ , which is a major impediment to leveraging such Markov structures for efficient inference and validation of max-stable models. This also implies that physically meaningful stochastic partial differential equation models (see, e.g., Lindgren et al., Reference Lindgren, Rue and Lindström2009; Bolin and Wallin, Reference Bolin and Wallin2020; Zhang et al., Reference Zhang, Bolin, Engelke and Huser2023b), which commonly lead to conditional independencies and graphical structures when discretized, are—at least on the surface—directly incompatible with MSPs. Although conditional independence can be enforced at the deeper level of the latent variables $ {W}_i $ used to construct the MSP, or in the underlying exponent measure (Engelke and Hitz, Reference Engelke and Hitz2020; Engelke et al., Reference Engelke, Ivanovs and Strokorb2022), it will never manifest at the data level except in special cases such as max-linear models (Améndola et al., Reference Améndola, Klüppelberg, Lauritzen and Tran2022). Furthermore, given that the total vapor holding capacity of the atmosphere and the total energy in the climate system are finite, the property of asymptotic dependence that characterizes all MSPs may also be questioned and intuitively considered as “unphysical” in typical environmental applications.

Computational complexity: Finally, MSPs are also notoriously computationally cumbersome to fit and simulate from. Fully Bayesian inference in high dimensions, as well as efficient conditional simulation, are possible for the Reich–Shaby model by taking advantage of its simple hierarchical construction (see also Bopp et al., Reference Bopp, Shaby and Huser2021), but this is more the exception than the rule: for most other MSPs, likelihood-based inference is very challenging, even with advanced supercomputers (Castruccio et al., Reference Castruccio, Huser and Genton2016), because the full likelihood function contains a number of terms that grows super-exponentially fast with the number of observed locations. Therefore, alternative inference solutions are generally required. Proposed approaches include pairwise likelihoods (Padoan et al., Reference Padoan, Ribatet and Sisson2010), higher-order composite likelihoods (Genton et al., Reference Genton, Ma and Sang2011; Huser and Davison, Reference Huser and Davison2013; Huser et al., Reference Huser, Stein and Zhong2024), M-estimators (Yuen and Stoev, Reference Yuen and Stoev2014; Einmahl et al., Reference Einmahl, Kiriliouk, Krajina and Segers2016), a customized stochastic expectation–maximization algorithm (Huser et al., Reference Huser, Dombry, Ribatet and Genton2019) or its Bayesian inference counterpart (Thibaud et al., Reference Thibaud, Aalta, Cooley, Davison and Heikkinen2016; Dombry et al., Reference Dombry, Engelke and Oesting2017), distributed inference through a divide-and-conquer strategy (Hector and Reich, Reference Hector and Reich2024), or more recently neural Bayes estimators (Lenzi et al., Reference Lenzi, Bessac, Rudi and Stein2023; Sainsbury-Dale et al., Reference Sainsbury-Dale, Zammit-Mangion and Huser2024). However, except for the latter which can be trained offline, these approaches all face a delicate trade-off between computational and statistical efficiency, and they often remain quite expensive to apply in moderate-to-high dimensions (i.e., from a few dozens to a few hundreds of spatial locations depending on the method and specific setting). Likelihood-based inference can be simplified by including event times in the dataset (Stephenson and Tawn, Reference Stephenson and Tawn2005), but this can lead to bias if the number of locations is large in comparison to the number of replicates over which maxima are taken (Wadsworth, Reference Wadsworth2015). Nonetheless, the inclusion of event times starts to mimic the paradigm of peaks-over-threshold modeling, which we argue in Section 3 is a more sensible and natural approach.

The stochastic representation (1) involving an infinite number of processes over which the pointwise maximum is taken also makes simulation cumbersome (apart from the Reich–Shaby model and its variants). While various types of approximate and exact simulation algorithms have been developed for certain max-stable families (Schlather, Reference Schlather2002; Oesting et al., Reference Oesting, Kabluchko and Schlather2012; Thibaud and Opitz, Reference Thibaud and Opitz2015; Dombry et al., Reference Dombry, Engelke and Oesting2016; Oesting and Strokorb, Reference Oesting and Strokorb2022), conditional simulation remains computationally laborious (Dombry et al., Reference Dombry, Éyi-Minko and Ribatet2013; Oesting and Schlather, Reference Oesting and Schlather2013) and does not scale well with the dimension. To construct more “generic” stochastic generators for spatial extremes, it is also possible to directly exploit generative artificial-intelligence techniques from the machine learning literature, such as generative adversarial networks (see, e.g., Boulaguiem et al., Reference Boulaguiem, Zscheischler, Vignotto, van der Wiel and Engelke2022) which bypass the need to specify a parametric dependence structure for extremes at the expense of completely abandoning any known structure, or variational autoencoders (VAEs; see, e.g., Lafon et al., Reference Lafon, Naveau and Fablet2023; Zhang et al., Reference Zhang, Ma, Wikle and Huser2023a), though these recent machine-learning-based approaches are sometimes difficult to train, are always data-hungry, and are often challenging to study from a theoretical perspective, for example, in terms of the approximation quality of the trained generator, or its ability to accurately reproduce joint tail decay rates.

Discussion: Overall, the difficulties with MSPs are “built-in”: they are a direct consequence of their basic definition leading to the complex structure in (1), and the fact that they are not meant to describe extremes of the original individual events. The action of computing block maxima indeed masks information about the timings of events and temporal dependence, and specifically about co-occurrence of maxima at different spatial locations, which has implications for modeling, inference, and simulation. Max-stability arises as an “asymptotic artifact” resulting from taking the limit of block maxima as the block size $ n $ goes to infinity; in practice, however, interest often lies in the original events themselves, rather than maxima. Moreover, even when the modeling of maxima (or minima) may be desired (e.g., when assessing the survival probability of certain species over an entire region, see Thibaud et al., Reference Thibaud, Aalta, Cooley, Davison and Heikkinen2016), the effective block size is often quite moderate in most environmental data due to serial dependence and seasonality, which can in some cases create a severe mismatch between theory and practice, thus casting serious doubts on the suitability of MSPs in such a case. Using the limit model for data in this context severely constrains the form of dependence structures that can be obtained, in a way that is unrealistic in most environmental applications, while simultaneously complicating statistical inference significantly. Simply put, we think that MSPs are often “not worth the trouble,” as the benefits they bring do not counterbalance their other limitations. In the next section, we summarize more recent modeling strategies that bypass several of the above roadblocks by going beyond max-stability, and that are thus better suited for the spatial modeling of extremes.

3. Solutions beyond max-stability

In Section 2, we argued strongly against the continued use of MSPs in all but exceptional circumstances. Here we outline the breadth of alternative models for spatially indexed environmental data, highlighting their relative merits and potential drawbacks.

Peaks-over-threshold versus maxima: Unlike approaches based on block maxima, peaks-over-threshold approaches focus on modeling extremes of the original spatial events that effectively took place; in the context of Figure 1, this includes the fields highlighted in color in the center panel, for example. Therefore, they are not only more meaningful from a practical perspective, but they also offer ways to customize the definition of a “spatial extreme event” to the specific problem of interest. In particular, peaks-over-threshold approaches do not only provide valid probability approximations when all variables are simultaneously large (which rarely occurs in practice), but they can be adapted to events where only a subset of variables are extreme. This is illustrated in Figure 2 for the two main peaks-over-threshold approaches, namely the (generalized) Pareto process (Ferreira and de Haan, Reference Ferreira and de Haan2014; Dombry and Ribatet, Reference Dombry and Ribatet2015) and the spatial conditional extremes process (Wadsworth and Tawn, Reference Wadsworth and Tawn2022), which are justified by different asymptotic paradigms.

Figure 2. For a pair of variables $ \left\{Y\left({\boldsymbol{s}}_1\right),Y\left({\boldsymbol{s}}_2\right)\right\} $ from various extreme-value models, illustration of the domains (colored areas) over which each model is meant to provide accurate tail probability approximations. Left: max-stable process; Middle: Pareto process for various aggregation functionals, $ r $ ; and Right: Spatial conditional extremes process, for each conditioning variable.

Pareto processes (Ferreira and de Haan, Reference Ferreira and de Haan2014; Dombry and Ribatet, Reference Dombry and Ribatet2015; de Fondeville and Davison, Reference de Fondeville and Davison2018, Reference de Fondeville and Davison2022) are usually viewed as the peaks-over-threshold analogue of MSPs. Both classes of models are grounded in the theory of functional regular variation, but Pareto processes are, in principle, applicable to all data which are in some sense extreme. As a result, they possess much simpler likelihoods than max-stable models, and make a more efficient use of data. Slightly different formulations of Pareto processes arise depending on the so-called aggregation (or risk) functional, usually denoted by $ r\left(\cdot \right) $ , used to define a functional extreme event (such as the spatial maximum); see, for example, de Fondeville and Davison (Reference de Fondeville and Davison2018, Reference de Fondeville and Davison2022) for more details.

The spatial conditional extremes model (Wadsworth and Tawn, Reference Wadsworth and Tawn2022), on the other hand, is a different peaks-over-threshold approach that only applies to the case where a single variable (at a fixed chosen location) exceeds a high threshold; however, it offers improved tail flexibility and other benefits compared to Pareto processes, as discussed below.

Improved tail flexibility with “subasymptotic” models: While Pareto processes have simpler likelihoods and permit the use of more data than MSPs, they are unfortunately seldom appropriate models in practice when used for tail extrapolation and estimation of small probabilities associated with environmental extreme events that lie far in the upper joint tail. This is because their threshold-stability property, analogous to the max-stability property of MSPs, is rarely satisfied by the environmental data available at observed levels of extremity. The threshold-stability property will never be satisfied by data that exhibit asymptotic independence, and represents a very strong additional assumption over the presence of asymptotic dependence. Let $ {Y}_j=Y\left({\boldsymbol{s}}_j\right)\sim {F}_j,j\in D=\left\{1,\dots, d\right\} $ , represent a spatial process observed at $ d $ locations. A simple way to assess whether this property may hold is to consider the quantity $ {\chi}_D(u) $ , defined similarly to $ {\chi}_{\left\{i,j\right\}}(u) $ in Section 2, as

(2) $$ {\chi}_D(u)=\Pr \left\{{Y}_1>{F}_1^{-1}(u),\dots, {Y}_d>{F}_d^{-1}(u)\right\}/\left(1-u\right),\hskip2em u\in \left(0,1\right). $$

If a Pareto process is applicable, then for any $ d $ there always exist $ d $ distinct locations such that $ {\chi}_D(u)\equiv {\chi}_D>0 $ for all sufficiently large $ u<1 $ . Our collective experience is that, in contrast, it is almost always the case that $ {\chi}_D(u) $ decreases as $ u\to 1 $ , representing a weakening of spatial dependence at extreme levels. This is illustrated in Figure 3 for datasets of precipitation (using the UKCP data from Figure 1), as well as air temperature, sea surface temperature, and windspeed.

Figure 3. Examples of estimates of $ {\chi}_D(u) $ for four environmental datasets. Solid black lines represent point estimates, whereas dashed black lines are approximate $ 95\% $ pointwise confidence intervals based on block bootstrapped estimates. From left to right: UKCP daily precipitation data from Figure 1 at $ d=1024 $ sites (within the blue subregion shown in the left panel of Figure 1), with the model fit in blue (based on the Huser and Wadsworth, Reference Huser and Wadsworth2019 model fitted using a pretrained neural Bayes estimator); gridded conditionally simulated E-OBS Irish summer temperature data at $ d=178 $ locations; (detrended) Red Sea surface temperature data at $ d=144 $ locations in the Gulf of Aqaba; daily mean windspeed at $ d=7 $ locations in the Vaucluse “département” in France.

To deal with this deficiency, some authors—including ourselves—have advocated the use of what are often termed subasymptotic models. This terminology is used because such models can be seen to bridge the gap between the world of finite-level data and the “mythical land” of asymptopia, where models such as max-stable and Pareto processes should be applicable. However, it is perhaps a misnomer, since if data exhibit asymptotic independence, then Pareto processes are not well defined for domains $ \mathcal{S} $ composed of an infinite number of locations, and max-stable models offer no benefits. Therefore, aside from conditional extremes models, alternatives such as these are currently the only approach to performing useful extreme value inference.

Broadly, these subasymptotic models are designed to represent flexible forms of tail decay, permitting extrapolation from observed levels to more extreme levels. A typical consideration is that probabilities such as $ \Pr \left\{{Y}_1>{F}_1^{-1}(u),\dots, {Y}_d>{F}_d^{-1}(u)\right\} $ or $ \Pr \left\{{Y}_i>{F}_i^{-1}(u),{Y}_j>{F}_j^{-1}(u)\right\} $ should have flexible forms as $ u\to 1 $ , and these forms fit with the assumptions of regular variation and/or hidden regular variation (Ledford and Tawn, Reference Ledford and Tawn1996, Reference Ledford and Tawn1997; Resnick, Reference Resnick2002) to provide some theoretical grounding. Where modeling componentwise maxima remains relevant, we can relax the max-stability property to obtain more realistic dependence structures with constructions such as max-infinitely divisible processes (Huser et al., Reference Huser, Opitz and Thibaud2021) or max-mixtures (Wadsworth and Tawn, Reference Wadsworth and Tawn2012; Bacro et al., Reference Bacro, Gaetan and Toulemonde2016); however, working with such extensions of MSPs can further exacerbate the issues of model interpretation and computational cost outlined before.

To date, most subasymptotic models have a random scale construction: extreme data are modeled using the spatial copula of the process $ X\left(\boldsymbol{s}\right)= RW\left(\boldsymbol{s}\right) $ , where the scalar random variable $ R>0 $ is independent of the spatial process $ W $ . The relative tail heaviness of $ R $ and $ W $ , together with the dependence structure of $ W $ , provides a rich array of dependence possibilities (see Engelke et al. (Reference Engelke, Opitz and Wadsworth2019) for an almost-exhaustive description). Examples of this class of models include Opitz (Reference Opitz2016), Huser et al. (Reference Huser, Opitz and Thibaud2017), and Huser and Wadsworth (Reference Huser and Wadsworth2019). With appropriate specification of $ R $ and $ W $ , Pareto processes also possess a random scale representation (Ferreira and de Haan, Reference Ferreira and de Haan2014). Limitations of simple random scale models include their inability to capture complex dependence structures observed over large domains and a gradual decay of positive dependence to independence at large distances (because of the spatially constant $ R $ variable, which makes them non-ergodic), as well as the incorporation of nonstationarity. These limitations also make it difficult to adapt such models to the case of spatiotemporal data. Hazra et al. (Reference Hazra, Huser and Bolin2024) recently attempted to address these issues by proposing a Gaussian scale mixture model extension, which can capture short-range asymptotic dependence, mid-range asymptotic independence, and long-range exact independence, by replacing $ R $ with a suitable spatial process $ R\left(\boldsymbol{s}\right) $ (see also Krupskii and Huser (Reference Krupskii and Huser2022)).

Conditional spatial extremes model: The recently introduced conditional spatial extremes model (Wadsworth and Tawn, Reference Wadsworth and Tawn2022) is another class of models with flexible tail structures, which has been adapted to the spatiotemporal case in Simpson and Wadsworth (Reference Simpson and Wadsworth2021). These models are based on the assumption of a limiting process for suitably normalized $ Y\left(\boldsymbol{s}\right) $ , conditional upon the event $ Y\left({\boldsymbol{s}}_0\right)>t $ , for some conditioning location $ {\boldsymbol{s}}_0 $ . The formulation permits modeling of both asymptotically dependent and asymptotically independent data, while the most commonly used version of the likelihood is relatively simple. The “price” for these two major gains is the necessity of conditioning on a particular location being large, rather than, say, any location being large, though Wadsworth and Tawn (Reference Wadsworth and Tawn2022) outline ways in which this can be mitigated if it is an issue.

Considerations for inference: As mentioned, the fitting of peaks-over-threshold models via likelihood is much simpler than max-stable models. However, to avoid bias in estimation of the tail properties, it is necessary to fit the models only to extreme data. For generalized Pareto process and subasymptotic models, this is usually done using an appropriately censored likelihood (see, e.g., Huser and Wadsworth, Reference Huser and Wadsworth2022) to avoid influence of small values. Censoring can be done in different ways, either by applying a risk functional such as the maximum to the observation vector and fully censoring or discarding the vectors for which the risk is below a fixed high threshold, or using an approach focusing on marginal exceedances, without the need to define a global risk functional, where any component of the vector that falls below its marginal threshold is censored. Especially in the latter case, this act of censoring makes likelihood evaluation significantly more computationally intensive as it can require calculating numerous potentially high-dimensional integrals of the density function. Therefore, this typically limits likelihood-based inference for such models to numbers of observation locations less than about $ 30 $ . One way to circumvent this difficulty is to add a measurement-error (nugget effect) term and then use a Markov chain Monte Carlo algorithm to approximate these multifold integrals numerically (Morris et al., Reference Morris, Reich, Thibaud and Cooley2017; Zhang et al., Reference Zhang, Shaby and Wadsworth2022a; Hazra et al., Reference Hazra, Huser and Bolin2024). Another possibility is to use a weighted gradient scoring approach for inference, as advocated by de Fondeville and Davison (Reference de Fondeville and Davison2018), which mimics smooth censoring, while remaining relatively cheap computationally.

The spatial conditional extremes model can usually be fitted to data from hundreds of observation locations since its common variants take the form of a nonstationary and marginally transformed Gaussian process. The different nature of the asymptotics means that an extreme event is one that is extreme at the conditioning location $ {\boldsymbol{s}}_0 $ , but that may be large or small elsewhere, which (often) avoids the need for censoring in the likelihood. Simpson et al. (Reference Simpson, Opitz and Wadsworth2023) outline the extensions to thousands of dimensions via a slight change in formulation and use of Gaussian Markov random fields, which have a sparse probabilistic structure, constructed from stochastic partial differential equation models (see also Vandeskog et al. (Reference Vandeskog, Martino and Huser2024) for related methodology).

Techniques from the machine learning literature have recently permeated the world of spatial extreme value modeling: Sainsbury-Dale et al. (Reference Sainsbury-Dale, Zammit-Mangion and Huser2024), Sainsbury-Dale et al. (Reference Sainsbury-Dale, Richards, Zammit-Mangion and Huser2024), and Walchessen et al. (Reference Walchessen, Lenzi and Kuusela2024) describe the use of neural networks (NNs) for likelihood-free inference on the parameters of spatial models whose likelihoods are costly to evaluate because of high dimensionality. Richards et al. (Reference Richards, Sainsbury-Dale, Zammit-Mangion and Huser2024) extend these ideas to incorporate censoring of small values, a key consideration for extremes. These NN-based estimators are “amortized” (Zammit-Mangion et al., Reference Zammit-Mangion, Sainsbury-Dale and Huser2025), meaning that all computational effort is encapsulated in a training period, after which estimates can be obtained in a fraction of a second; this is in comparison to hours or sometimes days for likelihood-based estimates. To illustrate the potential of such amortized inference approaches, Figure 4 displays the estimated parameters obtained by fitting the copula associated with the anisotropic Huser and Wadsworth (Reference Huser and Wadsworth2019) model to the UKCP precipitation data from Figure 1 locally in all subgrids of size $ 32\times 32 $ within the study domain; these estimates were obtained by reusing the pretrained censored neural Bayes estimators from Richards et al. (Reference Richards, Sainsbury-Dale, Zammit-Mangion and Huser2024), thus bypassing the expensive training time. In this implementation, data below a marginal 0.9-quantile are censored. Each estimate was obtained in about 38 milliseconds, which amounts to less than 20 minutes overall for the entire domain (requiring about $ 30\hskip0.1em 000 $ fits to censored peaks-over-threshold data in dimension $ d=1024 $ ). While parameter estimates are quite variable and model fits would need to be more carefully validated using advanced diagnostics, the results make a degree of sense at first sight, as the parameter estimates are partially aligned with topography and the coastline; in particular, larger range parameters can be observed over the sea (e.g., over the Channel), as one would expect. However, the estimated shape parameters controlling the asymptotic dependence type are somewhat too large for precipitation data. This may be caused by a number of factors, including out-of-the-box model parameter estimation and use of relatively large subregions. Nevertheless, the model-based $ {\chi}_D(u) $ curve displayed in Figure 3 shows a decent fit for the particular highlighted subregion over the Channel, for which that parameter estimate is quite small (about $ 0.39 $ ). Overall, we emphasize that these results only serve to showcase the potential capability of flexible modeling strategies and modern inference approaches; they should thus not be over-interpreted nor used in practical risk assessment studies without further model validation.

Figure 4. Three of the five estimated parameters obtained from local fits of the anisotropic Huser and Wadsworth (Reference Huser and Wadsworth2019) copula model in all subgrids of size $ 32\times 32 $ ( $ d=1024 $ ) within the study domain for the UKCP precipitation example from Figure 1. The plots display, for each local subgrid, the estimated range parameter (left), smoothness parameter (middle), and shape parameter (right) controlling the asymptotic dependence type; the two anisotropy parameters (stretch and rotation) are not shown for brevity. All of these estimates were obtained in a few minutes in total by reusing the pretrained censored neural Bayes estimators from Richards et al. (Reference Richards, Sainsbury-Dale, Zammit-Mangion and Huser2024).

In practical use, a major consideration is the availability of robust and user-friendly code for implementation. This is where alternative models are playing catch-up to MSPs, inference for which can be done in the well-established SpatialExtremes package in R, based on pairwise likelihood. For Pareto processes, the mvPot package (de Fondeville and Belzile, Reference de Fondeville and Belzile2023) offers useful options. Belzile et al. (Reference Belzile, Dutang, Northrop and Opitz2023) provide a recent tour through extreme-value software, including for spatial extremes. They highlight the general lack of packages for implementing modern spatial extremes approaches, although point to supplementary material and unpublished packages in several cases. Recently, the NeuralEstimators package has filled a gap by providing user-friendly functions to facilitate the construction and training of neural Bayes estimators that can, in particular, handle spatial censored peaks-over-threshold data from general spatial extremes models (see also Zammit-Mangion et al. (Reference Zammit-Mangion, Sainsbury-Dale and Huser2025)), who summarize available software for general amortized inference methods, including neural posterior and full likelihood approximations. Nonetheless, further efforts are needed to address the relative scarcity of software to facilitate uptake of novel modeling approaches.

Discussion: It is almost always the case that interest lies in understanding the extremes of original events, thus it makes sense to model their extremal behavior directly. This usually leads to simpler inference and more flexible classes of models. One reason sometimes cited for preferring a block maximum approach is that the resulting $ {Z}_n\left(\boldsymbol{s}\right) $ should be independent in time, as the environmental events comprising the block maxima will fall in different years. Although the modeling of original events does lead to more temporal dependence in extremes, it is typically preferable to ignore this when fitting via likelihood and adjust the uncertainty of parameter estimates post hoc (Fawcett and Walshaw, Reference Fawcett and Walshaw2007).

Recent developments in machine-learning-based inference begin to circumvent some computational difficulties for inference with both peaks-over-threshold models and MSPs alike. Nonetheless, the conceptual drawbacks of MSPs remain, together with potential simulation difficulties. These recent machine-learning-based developments have largely thus far focused on demonstrating how to fit existing models in previously unfeasible scenarios. An exciting possibility is their potential to facilitate generation of flexible new models with desirable tail properties: it is often easier to write down a stochastic representation for a flexible model than to derive (and evaluate) its likelihood function. For example, this could permit specification of and inference on models exhibiting asymptotic dependence at short range, with asymptotic independence and exact independence at longer range. The modeling of spatiotemporal extreme dependence, thus far tackled in relatively few cases, may also be facilitated via this route.

4. Conclusion

The probability theory supporting MSPs is rich and should not be despised, as it has contributed significantly to extreme-value theory and led to important advances as well as a better understanding of extremes in stochastic processes; we thus do not question the rigor or historical developments of this theory itself, but rather its relevance in concrete environmental applications. While the systematic use of MSPs in environmental studies needs to be gradually phased out, we also do not categorically advocate against the use of asymptotically justified models. Asymptotic theory can indeed be very useful provided the asymptotic paradigm directly responds to a concrete need posed by the applied scientific problem at hand. In particular, peaks-over-threshold approaches should be prioritized over block maxima approaches whenever possible. Pareto processes, and the more recent spatial conditional extremes model, are two possible frameworks that stem from more helpful asymptotic regimes. However, while asymptotic guarantees are in principle desired for tail extrapolation, we also stress that they should never supersede careful model checking. Oftentimes, more pragmatic solutions (e.g., certain types of random location and scale constructions, or physics-informed models) that display improved subasymptotic tail flexibility, physically more realistic properties, or computationally more affordable inference, can be better suited to address the specific scientific problem and should thus not be disregarded. When risk assessment of future extreme events (within a reasonable time frame) is of interest, the incorporation of nonstationary climate change signals (e.g., from climate model outputs under various greenhouse gas emission scenarios) and the development of asymptotic independence models with a flexible joint tail decay rate is more important than focusing on accurately identifying the asymptotic dependence class. Furthermore, when models are intended to be used in operational settings, some accuracy must sometimes be traded for speed of inference; in this context, geostatistical models constructed from Gaussian building blocks and/or based on sparse probabilistic structures, and fast approximate inference and simulation techniques (e.g., based on deep learning), can be particularly helpful. Nonparametric extreme-value frameworks for tail extrapolation that rely on broad regular variation (or hidden regular variation) assumptions, which are model-free and thus likelihood-free, are also promising approaches when the dataset is massive and the inference target is too complex to be captured by simple statistical models (Oesting and Huser, Reference Oesting and Huser2022).

We also mention that if interest lies only in the marginal properties of the estimated distributions (e.g., pointwise return-level maps), and not event-level dependence, then alternative simpler approaches are likely warranted. For example, the spatial dependence between parameters of a GEV distribution for maxima, or of a generalized Pareto distribution for threshold exceedances, may be well captured by a model that assumes conditional independence of observations given parameters, which are themselves modeled via Gaussian random fields (see Sang and Gelfand (Reference Sang and Gelfand2010), Geirsson et al. (Reference Geirsson, Hrafnkelsson and Simpson2015), Jalbert et al. (Reference Jalbert, Favre, Bélisle and Angers2017), and Jóhannesson et al. (Reference Jóhannesson, Siegert, Huser, Bakka and Hrafnkelsson2022) for GEV-focused and Cooley et al. (Reference Cooley, Nychka and Naveau2007) and Opitz et al. (Reference Opitz, Huser, Bakka and Rue2018) for generalized-Pareto-focused approaches).

An interesting direction for future research is to develop models and methods deeply rooted in extreme-value theory that harness the power and computational efficiency of advanced machine learning (e.g., deep extreme quantile regression, deep nonstationary spatial models, neural inference, and generative approaches) to better address problems in environmental data science; machine learning tools can indeed overcome limitations of classical statistical tools designed for estimating few model parameters from datasets of only moderate size. Wikle and Zammit-Mangion (Reference Wikle and Zammit-Mangion2023) review statistical deep learning methods in classical spatial statistics; with some suitable adjustments, these methods could potentially be adapted to the extreme-value context.

In the future, it would also be interesting to develop unified spatial frameworks for tractably modeling the full data range in a way that offers high flexibility in the lower joint tail, bulk, and upper joint tail, in the same vein as Naveau et al. (Reference Naveau, Huser, Ribereau and Hannart2016) in the univariate context. For example, normal mean–variance mixtures allow for both asymptotic dependence and independence as well as for controlling asymmetry in lower- and upper-tail dependence (Zhang et al., Reference Zhang, Huser, Opitz and Wadsworth2022b). Another related future line of research is to build upon recent advances in the geometric approach for multivariate extremes (Nolde, Reference Nolde2014; Nolde and Wadsworth, Reference Nolde and Wadsworth2022). The great benefit of this new asymptotic framework is that it provides a unified representation of multivariate extremes approaches (Wadsworth and Campbell, Reference Wadsworth and Campbell2024) and a flexible strategy for the joint modeling of extremes in “all directions”—including the lower and upper tails (Papastathopoulos et al., Reference Papastathopoulos, de Monte, Campbell and Rue2023; see also Mackay and Jonathan (Reference Mackay and Jonathan2023) and Murphy-Barltrop et al. (Reference Murphy-Barltrop, Mackay and Jonathan2024) for related methodology). Extending this approach to the spatial context and to capturing both bulk and tail behaviors simultaneously is an interesting area of investigation. Capturing the full range of values and their dependence relationships is particularly important for modeling compound extremes (AghaKouchak et al., Reference AghaKouchak, Chiang, Huning, Love, Mallakpour, Mazdiyasni, Moftakhari, Papalexiou, Ragno and Sadegh2020; Zscheischler et al., Reference Zscheischler, Martius, Westra, Bevacqua, Raymond and Horton2020) defined as the combination of conditions (over space, time, or several variables) leading to extreme impacts but where the contributing events are not necessarily extreme individually. With spatial data increasingly available on regular grids at relatively high resolution, such as climate model output and remote sensing data, the study of geometric properties of the pixellated exceedance sets at increasingly high threshold levels provides another line of research toward better understanding and modeling of the behavior of joint extremes and could offer methods that scale well to very large data volumes (Cotsakis et al., Reference Cotsakis, Di Bernardino and Opitz2023).

Although highly promising, neural likelihood-free parameter inference, mentioned in both Sections 2 and 3, still requires several developments to replace traditional likelihood-based inference. For example, theoretical guarantees on the accuracy of neural estimators in terms of the chosen NN architecture and number of training samples remain to be established. Furthermore, while uncertainty quantification can be handled via the bootstrap, for example, more needs to be understood about its properties. The effects of model misspecification also need to be studied more comprehensively: with likelihood-based inference, it is known that parameter point estimates are robust to certain types of misspecification, and that adjustments can be made for properly handling uncertainty (White, Reference White1982; Chandler and Bate, Reference Chandler and Bate2007). Model selection and comparison techniques, which have often relied on log-likelihood values, also require attention. Finally, the estimation of a relatively large number of parameters in flexible models for spatial tail dependence remains challenging. While some of these challenges can be addressed by adopting some recent amortized fully Bayes neural methods, such as BayesFlow (Radev et al., Reference Radev, Mertens, Voss, Ardizzone and Köthe2020, Reference Radev, Schmitt, Schumacher, Elsemüller, Pratz, Schälte, Köthe and Bürkner2023b) or JANA (Radev et al., Reference Radev, Schmitt, Pratz, Picchini, Koethe and Buerkner2023a), their use in extreme-value applications remains to be carefully investigated.

Our main recommendations moving forward can be summarized with the acronym MACHINE, which highlights seven “golden rules” for extreme-value analyses and the special role that machine learning is likely to play in the future of environmental extreme data science:

  • Move away from MSPs;

  • Adopt a peaks-over-threshold approach or a unified bulk-tail model whenever possible;

  • Capture subasymptotic behavior rather than focusing on the asymptotic structure and the dichotomy between asymptotic dependence and independence;

  • Harness specialized models with a sparse and numerically convenient probabilistic structure for speed and interpretation;

  • Incorporate physics/climate knowledge into probability models as much as possible;

  • Never-prioritize asymptotic justification over careful model checking;

  • Embrace modern machine learning and artificial intelligence methods to enhance the modeling and inference of extreme events in complex settings.

“Starting the MACHINE” (or keeping it on) is key, in our opinion, to remain relevant and maximize the impact of extreme-value theory across statistics and applied environmental sciences, especially in the face of the escalating challenges posed by today’s world of extreme climate-driven events.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/eds.2024.54.

Acknowledgments

The authors warmly thank the Environmental Data Science (EDS) editorial board and D. Nychka from the advisory board for inviting them to write this opinion piece. The authors also thank Jordan Richards for his help to fit models in the UKCP precipitation data illustration using neural Bayes estimators he had trained previously in the paper Richards et al. (Reference Richards, Sainsbury-Dale, Zammit-Mangion and Huser2024).

Author contribution

Conceptualization: J.L.W., T.O., R.H.; Writing – original draft: J.L.W., T.O., R.H.; Writing – review & editing: J.L.W., T.O., R.H.; Funding acquisition: R.H. All authors contributed equally to this work.

Competing interest

The authors have no competing interests to declare.

Data availability statement

Data to reproduce Figures 1 and 3 can be freely obtained upon reasonable request from the authors or through the following websites: https://www.metoffice.gov.uk/research/approach/collaboration/ukcp (UKCP precipitation data); https://data.marine.copernicus.eu/product/SST_GLO_SST_L4_NRT_OBSERVATIONS_010_001/description (Red Sea surface temperature data); https://www.ecad.eu/download/ensembles/download.php (E-OBS Irish temperature data); and https://meteo.data.gouv.fr/form (French windspeed data).

Funding statement

This publication is based upon work supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) under Award No. OSR-CRG2020-4394.

References

AghaKouchak, A, Chiang, F, Huning, LS, Love, CA, Mallakpour, I, Mazdiyasni, O, Moftakhari, H, Papalexiou, SM, Ragno, E and Sadegh, M (2020) Climate extremes and compound hazards in a warming world. Annual Review of Earth and Planetary Sciences 48, 519548.CrossRefGoogle Scholar
Améndola, C, Klüppelberg, C, Lauritzen, S and Tran, NM (2022) Conditional independence in max-linear Bayesian networks. The Annals of Applied Probability 32, 145.CrossRefGoogle Scholar
Bacro, J-N, Gaetan, C and Toulemonde, G (2016) A flexible dependence model for spatial extremes. Journal of Statistical Planning and Inference 172, 3652.CrossRefGoogle Scholar
Belzile, LR, Dutang, C, Northrop, PJ and Opitz, T (2023) A modeler’s guide to extreme value software. Extremes 26, 595638.CrossRefGoogle Scholar
Bolin, D and Wallin, J (2020) Multivariate type g matérn stochastic partial differential equation random fields. Journal of the Royal Statistical Society: Series B 82, 215239.CrossRefGoogle Scholar
Bopp, G, Shaby, B and Huser, R (2021) A hierarchical max-infinitely divisible spatial model for extreme precipitation. Journal of the American Statistical Association 116, 93106.CrossRefGoogle Scholar
Bortot, P, Coles, S and Tawn, JA (2000) The multivariate Gaussian tail model: An application to oceanographic data. Journal of the Royal Statistical Society: Series C (Applied Statistics) 49, 31049.Google Scholar
Boulaguiem, Y, Zscheischler, J, Vignotto, E, van der Wiel, K and Engelke, S (2022) Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks. Environmental Data Science 1, e5.CrossRefGoogle Scholar
Brown, BM and Resnick, SI (1977) Extreme values of independent stochastic processes. Journal of Applied Probability 14, 732739.CrossRefGoogle Scholar
Bücher, A and Kojadinovic, I (2016) An overview of nonparametric tests of extreme-value dependence and of some related statistical procedures. In Extreme Value Modeling and Risk Analysis: Methods and Applications, pp. 377398. CRC Press.Google Scholar
Castro-Camilo, D and Huser, R (2020) Local likelihood estimation of complex tail dependence structures, applied to us precipitation extremes. Journal of the American Statistical Association 115, 10371054.CrossRefGoogle Scholar
Castro-Camilo, D, Huser, R and Rue, H (2022) Practical strategies for generalized extreme value-based regression models for extremes. Environmetrics 33, e2742.CrossRefGoogle Scholar
Castruccio, S, Huser, R and Genton, MG (2016) High-order composite likelihood inference for max-stable distributions and processes. Journal of Computational and Graphical Statistics 25, 12121229.CrossRefGoogle Scholar
Chandler, RE and Bate, S (2007) Inference for clustered data using the independence loglikelihood. Biometrika 94, 167183.CrossRefGoogle Scholar
Cooley, D, Nychka, D and Naveau, P (2007) Bayesian spatial modeling of extreme precipitation return levels. Journal of the American Statistical Association 102, 824840.CrossRefGoogle Scholar
Cotsakis, R, Di Bernardino, E and Opitz, T (2023) A local statistic for the spatial extent of extreme threshold exceedances. arXiv preprint arXiv:2310.09075.Google Scholar
Danabasoglu, G, Lamarque, J-F, Bacmeister, J, Bailey, D A, DuVivier, A K, Edwards, J, Emmons, L K, Fasullo, J, Garcia, R, Gettelman, A, Hannay, C, Holland, M M, Large, W G, Lauritzen, P H, Lawrence, D M, Lenaerts, J T M, Lindsay, K, Lipscomb, W H, Mills, M J, Neale, R, Oleson, K W, Otto-Bliesner, B, Phillips, A S, Sacks, W, Tilmes, S, van Kampenhout, L, Vertenstein, M, Bertini, A, Dennis, J, Deser, C, Fischer, C, Fox-Kemper, B, Kay, J E, Kinnison, D, Kushner, P J, Larson, V E, Long, M C, Mickelson, S, Moore, J K, Nienhouse, E, Polvani, L, Rasch, P J and Strand, W G (2020) The community earth system model version 2 (CESM2). Journal of Advances in Modeling Earth Systems 12, e2019MS001916.CrossRefGoogle Scholar
Davison, AC and Huser, R (2015) Statistics of extremes. Annual Review of Statistics and Its Application 2, 203235.CrossRefGoogle Scholar
Davison, A C, Huser, R and Thibaud, E (2019) Spatial extremes. In Gelfand, AE, Fuentes, M, Hoeting, JA and Smith, RL (eds.), Handbook of Environmental and Ecological Statistics. CRC Press, pp. 711744.CrossRefGoogle Scholar
Davison, AC, Padoan, SA and Ribatet, M (2012) Statistical modeling of spatial extremes. Statistical Science 27, 161186.CrossRefGoogle Scholar
Dawkins, LC and Stephenson, DB (2018) Quantification of extremal dependence in spatial natural hazard footprints: Independence of windstorm gust speeds and its impact on aggregate losses. Natural Hazards and Earth System Sciences 18, 29332949.CrossRefGoogle Scholar
de Fondeville, R and Belzile, L (2023) mvPot: Multivariate Peaks-Over-Threshold Modelling for Spatial Extreme Events R package version 0.1.6.Google Scholar
Dombry, C, Engelke, S and Oesting, M (2016) Exact simulation of max-stable processes. Biometrika 103, 303317.CrossRefGoogle ScholarPubMed
Dombry, C, Engelke, S and Oesting, M (2017) Bayesian inference for multivariate extreme value distributions. Electronic Journal of Statistics 11, 48134844.CrossRefGoogle Scholar
Dombry, C, Éyi-Minko, F and Ribatet, M (2013) Conditional simulation of max-stable processes. Biometrika 100, 111124.CrossRefGoogle Scholar
Dombry, C and Ribatet, M (2015) Functional regular variations, pareto processes and peaks over threshold. Statistics and Its Interface 8, 917.CrossRefGoogle Scholar
Donoho, D (2017) 50 years of data science. Journal of Computational and Graphical Statistics 26, 745766.CrossRefGoogle Scholar
Einmahl, J H J, Kiriliouk, A, Krajina, A and Segers, J (2016) An $ m $ -estimator of spatial tail dependence. Journal of the Royal Statistical Society: Series B 78, 275298.CrossRefGoogle Scholar
Engelke, S and Hitz, A (2020) Graphical models for extremes (with discussion). Journal of the Royal Statistical Society: Series B 82, 871932.CrossRefGoogle Scholar
Engelke, S, Ivanovs, J and Strokorb, K (2022) Graphical models for infinite measures with applications to extremes and l’evy processes. arXiv preprint arXiv:2211.15769.Google Scholar
Engelke, S, Opitz, T and Wadsworth, JL (2019) Extremal dependence of random scale constructions. Extremes 22, 623666.CrossRefGoogle Scholar
Fawcett, L and Walshaw, D (2007) Improved estimation for temporally clustered extremes. Environmetrics: The official journal of the International Environmetrics Society 18, 173188.CrossRefGoogle Scholar
Ferreira, A and de Haan, L (2014) The generalized pareto process; with a view towards application and simulation. Bernoulli 20, 17171737.CrossRefGoogle Scholar
de Fondeville, R and Davison, AC (2018) High-dimensional peaks-over-threshold inference. Biometrika 105, 575592.CrossRefGoogle Scholar
de Fondeville, R and Davison, AC (2022) Functional peaks-over-threshold analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology 84, 13921422.CrossRefGoogle Scholar
Gabda, D, Towe, R, Wadsworth, JL and Tawn, JA (2012) Discussion of “statistical modeling of spatial extremes”’ by A.C. Davison, S.A. Padoan and M. Ribatet. Statistical Science 27, 189192.CrossRefGoogle Scholar
Geirsson, ÓP, Hrafnkelsson, B and Simpson, D (2015) Computationally efficient spatial modeling of annual maximum 24-h precipitation on a fine grid. Environmetrics 26, 339353.CrossRefGoogle Scholar
Genton, MG, Ma, Y and Sang, H (2011) On the likelihood function of gaussian max-stable processes. Biometrika 98, 481488.CrossRefGoogle Scholar
de Haan, L (1984) A spectral representation for max-stable processes. The Annals of Probability 12, 11941204.Google Scholar
Hazra, A, Huser, R and Bolin, D (2024) Efficient modeling of spatial extremes over large geographical domains. Journal of Computational and Graphical Statistics, to appear.CrossRefGoogle Scholar
Hector, EC and Reich, BJ (2024) Distributed inference for spatial extremes modeling in high dimensions. Journal of the American Statistical Association 119, 12971308.CrossRefGoogle ScholarPubMed
Hersbach, H, Bell, B, Berrisford, P, Hirahara, S, Horányi, A, Muñoz-Sabater, J, Nicolas, J, Peubey, C, Radu, R, Schepers, D, Simmons, A, Soci, C, Abdalla, S, Abellan, X, Balsamo, G, Bechtold, P, Biavati, G, Bidlot, J, Bonavita, M, De Chiara, G, Dahlgren, P, Dee, D, Diamantakis, M, Dragani, R, Flemming, J, Forbes, R, Fuentes, M, Geer, A, Haimberger, L, Healy, S, Hogan, RJ, Hólm, E, Janisková, M, Keeley, S, Laloyaux, P, Lopez, P, Lupu, C, Radnoti, G, de Rosnay, P, Rozum, I, Vamborg, F, Villaume, S and Thépaut, J-N (2020) The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society 146, 19992049.CrossRefGoogle Scholar
Huang, WK, Monahan, AH and Zwiers, FW (2021) Estimating concurrent climate extremes: A conditional approach. Weather and Climate Extremes 33, 100332.CrossRefGoogle Scholar
Huser, R and Davison, AC (2013) Composite likelihood estimation for the Brown–Resnick process. Biometrika 100, 511518.CrossRefGoogle Scholar
Huser, R and Davison, AC (2014) Space-time modelling of extreme events. Journal of the Royal Statistical Society: Series B 76, 439461.CrossRefGoogle Scholar
Huser, R, Dombry, C, Ribatet, M and Genton, MG (2019) Full likelihood inference for max-stable data. Stat 8, e218.CrossRefGoogle Scholar
Huser, R, Opitz, T and Thibaud, E (2017) Bridging asymptotic independence and dependence in spatial extremes using Gaussian scale mixtures. Spatial Statistics 21, 166186.CrossRefGoogle Scholar
Huser, R, Opitz, T and Thibaud, E (2021) Max-infinitely divisible models and inference for spatial extremes. Scandinavian Journal of Statistics 48, 321348.CrossRefGoogle Scholar
Huser, R, Stein, ML and Zhong, P (2024) Vecchia likelihood approximation for accurate and fast inference in intractable spatial max-stable models. Journal of Computational and Graphical Statistics 33, 978990.CrossRefGoogle Scholar
Huser, R and Wadsworth, JL (2019) Modeling spatial processes with unknown extremal dependence class. Journal of the American Statistical Association 114, 434444.CrossRefGoogle Scholar
Huser, R and Wadsworth, JL (2022) Advances in statistical modeling of spatial extremes. Wiley Interdisciplinary Reviews: Computational Statistics 14, e1537.CrossRefGoogle Scholar
IPCC (2023) Climate change 2023: Synthesis report. In H. L. Core Writing Team and Romero, J (eds.), Contribution of Working Groups I, II, and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Geneva, Switzerland: IPCC, pp. 35115.Google Scholar
Jalbert, J, Favre, A-C, Bélisle, C and Angers, J-F (2017) A spatiotemporal model for extreme precipitation simulated by a climate model, with an application to assessing changes in return levels over North America. Journal of the Royal Statistical Society Series C: Applied Statistics 66, 941962.CrossRefGoogle Scholar
Jóhannesson, ÁV, Siegert, S, Huser, R, Bakka, H and Hrafnkelsson, B (2022) Approximate bayesian inference for analysis of spatiotemporal flood frequency data. The Annals of Applied Statistics 16, 905935.CrossRefGoogle Scholar
Kabluchko, Z, Schlather, M and de Haan, L (2009) Stationary max-stable fields associated to negative definite functions. The Annals of Probability 37, 20422065.CrossRefGoogle Scholar
Krupskii, P and Huser, R (2022) Modeling spatial tail dependence with Cauchy convolution processes. Electronic Journal of Statistics 16, 61356174.CrossRefGoogle Scholar
Lafon, N, Naveau, P and Fablet, R (2023) A VAE approach to sample multivariate extremes. arXiv preprint arXiv:2306.10987.Google Scholar
Ledford, AW and Tawn, JA (1996) Statistics for near independence in multivariate extreme values. Biometrika 83, 169187.CrossRefGoogle Scholar
Ledford, AW and Tawn, JA (1997) Modelling dependence within joint tail regions. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 59, 475499.CrossRefGoogle Scholar
Lenzi, A, Bessac, J, Rudi, J and Stein, ML (2023) Neural networks for parameter estimation in intractable models. Computational Statistics and Data Analysis 185, 107762.CrossRefGoogle Scholar
Lindgren, F, Rue, H and Lindström, J (2009) An explicit link between gaussian fields and gaussian markov random fields: The stochastic partial differential equation approach (with discussion). Journal of the Royal Statistical Society: Series B 73, 423498.CrossRefGoogle Scholar
Mackay, E and Jonathan, P (2023) Modelling multivariate extremes through angular-radial decomposition of the density function. arXiv preprint arXiv:2310.12711.Google Scholar
Menne, MJ, Williams, CN and Vose, RS (2009) The U.S. historical climatology network monthly temperature data, version 2. Bulletin of the American Meteorological Society 90, 9931008.CrossRefGoogle Scholar
Morris, SA, Reich, BJ, Thibaud, E and Cooley, DS (2017) A space-time skew-t model for threshold exceedances. Biometrics 73, 749758.CrossRefGoogle ScholarPubMed
Murphy-Barltrop, CJR, Mackay, E and Jonathan, P (2024) Inference for multivariate extremes via a semi-parametric angular-radial model. arXiv preprint arXiv:2401.07259.Google Scholar
Naveau, P, Huser, R, Ribereau, P and Hannart, A (2016) Modeling jointly low, moderate and heavy rainfall intensities without a threshold selection. Water Resources Research 52, 27532769.CrossRefGoogle Scholar
Nguyen, M, Veraart, AED, Taisne, B, Tan, CT and Lallemant, D (2023) A dynamic extreme value model with application to volcanic eruption forecasting. Mathematical Geosciences to appear.CrossRefGoogle Scholar
Nolde, N (2014) Geometric interpretation of the residual dependence coefficient. Journal of Multivariate Analysis 123, 8595.CrossRefGoogle Scholar
Nolde, N and Wadsworth, JL (2022) Linking representations for multivariate extremes via a limit set. Advances in Applied Probability 54, 688717.CrossRefGoogle Scholar
OCO-2 Science Team, Gunson, M and Eldering, A (2020) OCO-2 level 2 bias-corrected solar-induced fluorescence and other select fields from the IMAP-DOAS algorithm aggregated as daily files, retrospective processing V10r. Available at https://disc.gsfc.nasa.gov/datasets/OCO2_L2_Lite_SIF_10r/summary.Google Scholar
Oesting, M and Huser, R (2022) Patterns in spatio-temporal extremes. arXiv preprint arXiv:2212.11001.Google Scholar
Oesting, M, Kabluchko, Z and Schlather, M (2012) Simulation of Brown–Resnick processes. Extremes 15, 89107.CrossRefGoogle Scholar
Oesting, M and Schlather, M (2013) Conditional sampling for max-stable processes with a mixed moving maxima representation. Extremes 17, 157192.CrossRefGoogle Scholar
Oesting, M and Strokorb, K (2022) A comparative tour through the simulation algorithms for max-stable processes. Statistical Science 37, 4263.CrossRefGoogle Scholar
Opitz, T (2013) Extremal t processes: Elliptical domain of attraction and a spectral representation. Journal of Multivariate Analysis 122, 409413.CrossRefGoogle Scholar
Opitz, T (2016) Modeling asymptotically independent spatial extremes based on Laplace random fields. Spatial Statistics 16, 118.CrossRefGoogle Scholar
Opitz, T, Huser, R, Bakka, H and Rue, H (2018) INLA goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles. Extremes 21, 441462.CrossRefGoogle Scholar
Padoan, SA, Ribatet, M and Sisson, SA (2010) Likelihood-based inference for max-stable processes. Journal of the American Statistical Association 105, 263277.CrossRefGoogle Scholar
Papastathopoulos, I, de Monte, L, Campbell, R and Rue, H (2023) Statistical inference for radially-stable generalized pareto distributions and return level-sets in geometric extremes. arXiv preprint arXiv:2310.06130.Google Scholar
Papastathopoulos, I and Strokorb, K (2016) Conditional independence among max-stable laws. Statistics and Probability Letters 108, 915.CrossRefGoogle Scholar
Radev, ST, Mertens, UK, Voss, A, Ardizzone, L and Köthe, U (2020) BayesFlow: Learning complex stochastic models with invertible neural networks. IEEE Transactions on Neural Networks and Learning Systems 33, 14521466.CrossRefGoogle Scholar
Radev, S. T, Schmitt, M, Pratz, V, Picchini, U, Koethe, U and Buerkner, P.-C (2023a) JANA: Jointly amortized neural approximation of complex Bayesian models. In The 39th Conference on Uncertainty in Artificial Intelligence.Google Scholar
Radev, ST, Schmitt, M, Schumacher, L, Elsemüller, L, Pratz, V, Schälte, Y, Köthe, U and Bürkner, P-C (2023b) BayesFlow: Amortized Bayesian workflows with neural networks. arXiv preprint arXiv:2306.16015.CrossRefGoogle Scholar
Reich, BJ and Shaby, BA (2012) A hierarchical max-stable spatial model for extreme precipitation. The Annals of Applied Statistics 6, 1430.CrossRefGoogle ScholarPubMed
Resnick, S (2002) Hidden regular variation, second order regular variation and asymptotic independence. Extremes 5, 303336.CrossRefGoogle Scholar
Ribatet, M (2022) SpatialExtremes: Modelling Spatial Extremes. R package version 2.1–0. https://CRAN.R-project.org/package=SpatialExtremes.Google Scholar
Richards, J, Sainsbury-Dale, M, Zammit-Mangion, A and Huser, R (2024) Likelihood-free neural Bayes estimators for censored peaks-over-threshold models. Journal of Machine Learning Research, to appearGoogle Scholar
Risser, MD and Wehner, MF (2017) Attributable human-induced changes in the likelihood and magnitude of the observed extreme precipitation during hurricane Harvey. Geophysical Research Letters 44, 1245712464.CrossRefGoogle Scholar
Sainsbury-Dale, M, Richards, J, Zammit-Mangion, A and Huser, R (2024) Neural Bayes estimators for irregular spatial data using graph neural networks. Journal of Computational and Graphical Statistics, to appear.CrossRefGoogle Scholar
Sainsbury-Dale, M, Zammit-Mangion, A and Huser, R (2024) Likelihood-free parameter estimation with neural Bayes estimators. The American Statistician 78, 114.CrossRefGoogle Scholar
Sang, H and Gelfand, AE (2010) Continuous spatial process models for spatial extreme values. Journal of Agricultural, Biological, and Environmental Statistics 15, 4965.CrossRefGoogle Scholar
Schlather, M (2002) Models for stationary max-stable random fields. Extremes 5, 3344.CrossRefGoogle Scholar
Segers, J (2012) Max-stable models for multivariate extremes. REVSTAT 10, 6182.Google Scholar
Simpson, ES, Opitz, T and Wadsworth, JL (2023) High-dimensional modeling of spatial and spatio-temporal conditional extremes using INLA and Gaussian Markov random fields. Extremes, 145.CrossRefGoogle Scholar
Simpson, ES and Wadsworth, JL (2021) Conditional modelling of spatio-temporal extremes for Red Sea surface temperatures. Spatial Statistics 41, 100482.CrossRefGoogle Scholar
Smith, R L (1990) Max-stable processes and spatial extremes. Unpublished manuscript. https://www.rls.sites.oasis.unc.edu/postscript/rs/spatex.pdf.Google Scholar
Stephenson, A and Tawn, J (2005) Exploiting occurrence times in likelihood inference for componentwise maxima. Biometrika 92, 213227.CrossRefGoogle Scholar
Stott, PA, Stone, DA and Allen, MR (2004) Human contribution to the european heatwave of 2003. Nature 432, 610614.CrossRefGoogle Scholar
Thibaud, E, Aalta, J, Cooley, DS, Davison, AC and Heikkinen, J (2016) Bayesian inference for the Brown–Resnick process, with an application to extreme low temperatures. Journal of the American Statistical Association 10, 23032324.Google Scholar
Thibaud, E and Opitz, T (2015) Efficient inference and simulation for elliptical Pareto processes. Biometrika 102, 855870.CrossRefGoogle Scholar
Vandeskog, S, Martino, S and Huser, R (2024) An efficient workflow for modelling high-dimensional spatial extremes. Statistics and Computing 34, 137.CrossRefGoogle Scholar
Wadsworth, JL and Campbell, R (2024) Statistical inference for multivariate extremes via a geometric approach. Journal of the Royal Statistical Society Series B: Statistical Methodology 86, 12431265.CrossRefGoogle Scholar
Wadsworth, JL (2015) On the occurrence times of componentwise maxima and bias in likelihood inference for multivariate max-stable distributions. Biometrika 102, 705711.CrossRefGoogle Scholar
Wadsworth, JL and Tawn, JA (2012) Dependence modelling for spatial extremes. Biometrika 99, 253272.CrossRefGoogle Scholar
Wadsworth, JL and Tawn, JA (2022) Higher-dimensional spatial extremes via single-site conditioning. Spatial Statistics 51, 100677.CrossRefGoogle Scholar
Walchessen, J, Lenzi, A and Kuusela, M (2024) Neural likelihood surfaces for spatial processes with computationally intensive or intractable likelihoods. Spatial Statistics 62, 100848.CrossRefGoogle Scholar
White, H (1982) Maximum likelihood estimation of misspecified models. Econometrica: Journal of the Econometric Society, 125.CrossRefGoogle Scholar
Wikle, C and Zammit-Mangion, A (2023) Statistical deep learning for spatial and spatiotemporal data. Annual Review of Statistics and Its Application 10, 247270.CrossRefGoogle Scholar
Yuen, R and Stoev, S (2014) CRPS M-estimation for max-stable models. Extremes 17, 387410.CrossRefGoogle Scholar
Zammit-Mangion, A, Sainsbury-Dale, M and Huser, R (2025) Neural methods for amortized inference. Annual Reviews of Statistics and Its Application 12, to appear.CrossRefGoogle Scholar
Zhang, L, Ma, X, Wikle, CK and Huser, R (2023a) Flexible and efficient spatial extremes emulation via variational autoencoders. arXiv preprint arXiv:2307.08079.Google Scholar
Zhang, L, Shaby, B and Wadsworth, JL (2022a) Hierarchical transformed scale mixtures for flexible modeling of spatial extremes on datasets with many locations. Journal of the American Statistical Association 117, 13571369.CrossRefGoogle Scholar
Zhang, Z, Bolin, D, Engelke, S and Huser, R (2023b) Extremal dependence of moving average processes driven by exponential-tailed Lévy noise. arXiv preprint arXiv:2307.15796.Google Scholar
Zhang, Z, Huser, R, Opitz, T and Wadsworth, JL (2022b) Modeling spatial extremes using normal mean-variance mixtures. Extremes 25, 175197.CrossRefGoogle Scholar
Zhong, P, Huser, R and Opitz, T (2022) Modeling non-stationary temperature maxima based on extremal dependence changing with event magnitude. Annals of Applied Statistics 16, 272299.CrossRefGoogle Scholar
Zscheischler, J, Martius, O, Westra, S, Bevacqua, E, Raymond, C, Horton, RM, et al. (2020) A typology of compound weather and climate events. Nature Reviews Earth & Environment 1, 333347.CrossRefGoogle Scholar
Figure 0

Figure 1. Left: Observational grid (gray) of the UK Climate Projections (UKCP) daily precipitation data, showing a particular horizontal transect (red) and a $ 32\times 32 $ subregion (blue). Middle: Illustration of underlying daily random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with three of the daily fields that contribute to the pointwise annual maximum for the years 1988 (yellow), 1990 (red), and 1996 (blue). Right: Illustration of underlying random fields $ {Y}_i\left(\boldsymbol{s}\right) $ (gray) for the UKCP data over the summers 1981–2000 along the selected transect, with the pointwise annual maximum $ {Z}_n\left(\boldsymbol{s}\right) $ for the years 1988 (yellow), 1990 (red), and 1996 (blue), corresponding to the three individual extreme events shown in the middle panel. In each case, there is no $ j=1,\dots, n $ such that $ {Z}_n\left(\boldsymbol{s}\right)={Y}_j\left(\boldsymbol{s}\right) $ for all $ \boldsymbol{s} $.

Figure 1

Figure 2. For a pair of variables $ \left\{Y\left({\boldsymbol{s}}_1\right),Y\left({\boldsymbol{s}}_2\right)\right\} $ from various extreme-value models, illustration of the domains (colored areas) over which each model is meant to provide accurate tail probability approximations. Left: max-stable process; Middle: Pareto process for various aggregation functionals, $ r $; and Right: Spatial conditional extremes process, for each conditioning variable.

Figure 2

Figure 3. Examples of estimates of $ {\chi}_D(u) $ for four environmental datasets. Solid black lines represent point estimates, whereas dashed black lines are approximate $ 95\% $ pointwise confidence intervals based on block bootstrapped estimates. From left to right: UKCP daily precipitation data from Figure 1 at $ d=1024 $ sites (within the blue subregion shown in the left panel of Figure 1), with the model fit in blue (based on the Huser and Wadsworth, 2019 model fitted using a pretrained neural Bayes estimator); gridded conditionally simulated E-OBS Irish summer temperature data at $ d=178 $ locations; (detrended) Red Sea surface temperature data at $ d=144 $ locations in the Gulf of Aqaba; daily mean windspeed at $ d=7 $ locations in the Vaucluse “département” in France.

Figure 3

Figure 4. Three of the five estimated parameters obtained from local fits of the anisotropic Huser and Wadsworth (2019) copula model in all subgrids of size $ 32\times 32 $ ($ d=1024 $) within the study domain for the UKCP precipitation example from Figure 1. The plots display, for each local subgrid, the estimated range parameter (left), smoothness parameter (middle), and shape parameter (right) controlling the asymptotic dependence type; the two anisotropy parameters (stretch and rotation) are not shown for brevity. All of these estimates were obtained in a few minutes in total by reusing the pretrained censored neural Bayes estimators from Richards et al. (2024).

Author comment: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR1

Comments

Dear Editor,

We thank you for the invitation to write this opinion piece about environmental data science and extremes. In this paper, we discuss the severe limitations of max-stable processes, which are currently widely used in environmental applications. After strongly arguing against their systematic use in environmental extreme data science, we discuss more appropriate modeling framework based on peaks-over-thresholds, as well as the use of modern hybrid statistical machine learning approaches that enhance the modeling and inference of extreme events.

We hope you will appreciate our contributions, and look forward to hearing from you.

With thanks and best wishes,

Raphael Huser (on behalf of all authors)

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Summary:

This position paper offers a nice and concise overview of the major limitations associated with utilizing the class of max-stable process models for spatial extremes, along with presenting several viable alternative models and methods. Such a contribution is timely and holds significant interest, particularly considering the rapidly expanding focus on modeling spatially-natured climate and weather extremes across several research communities. Below, I provide some comments and suggestions aimed at enhancing accessibility and appreciation for a broader audience.

Major comment:

Given the potentially diverse background of the readers this position paper aims to reach, I believe it would be highly beneficial to use real data as much as possible for selected variables where spatial extreme analysis is to be performed. Specifically, Fig. 1 could be replaced by real data (or a gridded data product to avoid spatial gaps). I believe daily precipitation could be a suitable variable to illustrate the point that pointwise maximum is unlikely to be obtained from the original data, as well as making the main message for Fig. 3.

In general, I believe it would be beneficial to focus on one (or more) representative running examples, similar to what Davison et al. (2012) did. Specifically, the selected running example(s) can be used to not only demonstrate the named limitations but also illustrate the benefits of using alternative methods in Section 3.

Ref:

Davison, Anthony C., Simone A. Padoan, and Mathieu Ribatet. ``Statistical modeling of spatial extremes.‘’ (2012): 161-186.

Minor comments:

1. page 2, lines 8-11, ``massive high-quality environmental data products based on observation and simulation...‘’

Should provide some representative references for each data type here.

2. page 2, line 19, ``in order to provide sound extrapolation into the tail of the distribution...‘’

The authors may want to make this more concise to ensure that the interdisciplinary readership understands precisely what it means.

3. page 2, line 29 ``and subasymptotic forms of tail dependence‘’

Provide a pointer to the relevant subsection in this paper.

4. page 3, line 20, ``has already realized some of their limitations‘’

Add the reference to Huang et al. (2021) here. Specifically, Figs. 1 and 2 in that paper concretely demonstrate the time-mismatch issue in the context of concurrent extremes, which can easily arise in a spatial setting.

Ref:

Huang, Whitney K., Adam H. Monahan, and Francis W. Zwiers. “Estimating concurrent climate extremes: A conditional approach.” Weather and Climate Extremes 33 (2021): 100332.

5. page 5, Fig. 1

Related to the major point I arise, it would be more comprehensible, especially for the dominant scientists who are the ‘users’ for applying statistical models and methods, to utilize real data applications such as extreme precipitation, temperature, or wind speed.

6. The dividing line between the limitations ``The rigidity of stability‘’ and ``Models only for limited sub-classes of possible tail structures‘’ is not that clear to me, and the authors may consider combining them into one section.

7. page 7

I find the argument of ``Lack of physically realistic models‘’ a bit weak on the physical side, and it would benefit from elaborating on the potential use of physically-meaningful stochastic partial differential equation models in modeling extremes to make it more accessible to a wider audience.

8. page 8, lines 25-26 ``they often remain quite expensive to apply in moderate-to-high dimensions in terms of number of spatial locations‘’

It would be ideal to provide a numerical range to give domain users an idea of what ``moderate-to-high dimensions‘’ might mean in terms of spatial locations.

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

As the manuscript’s title suggests, the objective of this paper is argue for the field to move away from MSP based analyses and towards other approaches. To this end, the authors highlight the drawbacks of MSP based modeling approaches and then propose alternative approaches.

The authors' thesis is important and interesting, and may be of interest to Environmetal Data Science readers in general. After addressing the comments below the paper should be considered for publication.

Comments:

* It is important to keep the target audience in mind for a paper such as this, because readers from differing backgrounds may receive different take home messages. Readers from the extremes community are likely well aware of many of the drawbacks that you mention, and would likely be supportive of rigorous model checking to ensure that the assumed model is a reasonable fit for the data. However, applied scientists may not be able to follow all aspects of the paper and may miss some of the important take away messages.

For example, I often collaborate with hydrologists and engineers for the purpose of modeling environmental extremes, and I have found that is still not uncommon for researchers like this to rely on subjective approaches, such as regional frequency analysis. I am a bit concerned that readers from this community may read this paper and abandon extremes based approaches in general. I would urge you to revise your paper to keep this perspective in mind.

* For applied researchers to embrace an approach such as MACHINE, they will need statistical software packages. One reason for the ubiquity of MSP based modeling approaches is the fitmaxstab function in the SpatialExtremes R package (as you mention in the paper). What software do you recommend for applied researchers to make use of your proposed approaches? If well established user friendly packages do not exist yet you should mention this as well.

* It may also be worth pointing out that other spatial extremes based approaches may be appropriate in certain circumstances. For example, if pointwise return level estimates are the desired output, a latent process model (or the so called spatial GEV model) may work well. This is important to point out as well.

Minor Comments:

* Maybe define MSP to denote `max stable process' at the beginning since you use it so often.

* Maybe mention that you propose a seven step procedure in the abstract. This seems to be an important take away of the paper.

* Define AI at first use (first page).

* l18-24: you use ‘hand’ three times in two sentences. This reads a bit awkward. Could you reword without losing anything?

* p4 l18: can you reword to avoid having to make the citation possessive?

* p4 l20: change `boosted' to something else (maybe buoyed?).

* p5 l29: should not be possessive

* p7 l40: artifacts is misspelled

* p8 l43: here you use double quotes whereas you’ve previously used single quotes (need to be consistent)

* p9 l12: artifacts is misspelled again

* p14 l46: I’m not sure if you mean systemic or systematic?

Recommendation: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR4

Comments

Both reviewers highlighted the relevance of the manuscript but also pointed out several aspects that need to be addressed by the authors before it can be accepted for publication in EDS. Based on their feedback, my recommendation is for a minor revision.

Decision: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R0/PR5

Comments

No accompanying comment.

Author comment: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR6

Comments

Dear Editors, and Reviewers,

We have now revised our opinion paper, and have thoroughly addressed all comments raised by the reviewers.

We hope you will appreciate our efforts to make the paper more accessible to a broad applied audience, and that you will find it satisfactory for publication in Environmental Data Science.

We have attached our point-by-point response, as well as a Diff file highlighting all our changes.

We look forward to hearing back from you, and thank you for your consideration.

With many thanks and best wishes,

Raphaël Huser (on behalf of all authors)

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

This version has sufficiently addressed my comments and, in my opinion, should be accepted for publication.

Review: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

Thank you for adequately addressing my concerns.

Recommendation: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR9

Comments

Both reviewers were highly satisfied with the revised manuscript, so I am pleased to recommend it for publication in EDS.

Decision: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes — R1/PR10

Comments

No accompanying comment.