Introduction
The basis for modern avalanche forecasting may be derived from the work of Reference LaChapelleLaChapelle (1980). LaChapelle rationalized avalanche forecasting on the basis of three data classes and his work was extended by Reference McClung and SchaererMcClung and Schaerer (1993). The basic idea is that data available to an avalanche forecaster are in three classes. The three data classes are defined by their ease of interpretation and direct relevance to avalanche prediction. For example, a report of wind speed and direction may be difficult to relate directly to avalanche possibility, whereas a report of avalanche occurrences leaves no doubt as to the current stability. Therefore, data are roughly classified into three classes. The higher the class number, the less directly relevant are the data and the less direct is the connection to avalanche prediction. A brief description of factors follows: (i) stability factors: those factors such as results of stability tests and avalanche-occurrence information which give direct inferences about snow stability (Reference McClung and SchaererMcClung and Schaerer (1993) should be consulted for further details about this approach and the method of classifying data); (ii) snowpack parameters including results of snow-stratigraphy studies, snowpack temperatures and others; and (iii) meteorological and snow parameters measured at or above the snow surface: data largely of numerical character and amenable to numerical analysis.
One theme which develops from LaChapelle’s method outlined above is that avalanche forecasters have access to varying amounts of information of varying quality, depending on the scale of the problem they are working with. For example, those attempting to predict avalanches for an entire mountain range from a meteorological office as described by Reference Ferguson, Moore, Marriott and Speers-HayesFerguson and others (1990) might have better access to high-quality meteorological information with less relevant information about local snow-stability factors. At the other extreme of the scale, a skier attempting to analyze stability potentially has access to snow-stability information of the highest quality of relevance to his/her problem (class i) but meteorological data and forecasts may be lacking.
In this paper, an objective forecast scheme is partly developed for highway-forecasting application at Kootenay Pass in southeastern British Columbia (Reference McClung and TweedyMcClung and Tweedy, 1993). The scale of the problem is intermediate between the two scales described above. Forecasting at Kootenay Pass involves collection of data from all three classes.
Of the three classes, the most difficult data for a forecaster to interpret are those from class iii. The reason is that class iii data are continuously recorded and the variables are potentially correlated. Fortunately, class iii is the category most amenable to use of numerical techniques. In this paper, a numerical model is dev-eloped for application at Kootenay Pass based largely on class iii data. The method chosen is a parametric discriminant analysis similar to those presented by Reference Bois, Obled and GoodBois and others (1975), Reference BovisBovis (1977) and Reference Obled and GoodObled and Good (1980). For illustration, discriminant functions are developed based on magnitude and frequency of aval-anching as well as stratification of prediction for dry and moist-wet avalanches. The present analysis differs from past attempts in two important ways: (1) the transitional climate at Kootenay Pass (Reference McClung and TweedyMcClung and Tweedy, 1993) does not allow stratification into distinct dry and wet parts of the avalanche season (November-April); rainfall and moist-wet avalanching can occur at any time of the year. Due to the absence of distinct parts of the snow season for dry and wet avalanching, the character of modeling potentially differs from that described by Reference Obled and GoodObled and Good (1980) and Reference BovisBovis (1977); (2) Bayesian statistics are employed to estimate the probability that a data vector belongs to groups of avalanche or non-avalanche periods measured in the past. Application of Bayesian statistics allows a forecaster to input his/her a priori probability of avalanching based on empirical data which potentially allows use of non-numerical data from classes outside those used in numerical prediction to improve the prediction. This approach can synthesize numerical and conventional avalanche forecasting (see Reference LaChapelleLaChapelle, 1980) and it can forge the link for the future development of a coupled expert system using empirical and numerical data from all three classes.
An additional feature of our model is the use of cluster techniques to allow analysis of the “nearest neighbours” of data vectors from an historical data base consisting of snow, avalanche and weather data (Reference BuserBuser and Good, 1985; Reference Buser, Bütler and GoodBuser and others, 1987). Our application of cluster techniques differs from previous approaches because we have employed the Mahalanobis distance as the distance metric between data vectors instead of the Euclidean distance. This assumption allows us to account for correlation between variables when calculating distances between data vectors in discriminant space and it provides a consistent mathematical framework for linking the cluster techniques and the parametric discriminant analysis.
Description of Data Base
The data in this study consisted of records collected at Kootenay Pass, British Columbia, of ten winters (November-April 1982-–82 through 1990–91). Reference McClung and TweedyMcClung and Tweedy (1993) defined the climate area as approximately midway between maritime and continental. Snow and weather data were collected twice daily (early morning and late afternoon) to form a data base of approximately 3300 records. The reader is referred to Reference McClung and TweedyMcClung and Tweedy (1993) for more details about the data at Kootenay Pass and avalanche characteristics based on a single-variable analysis.
In addition to snow and weather data, avalanche events were recorded as soon as possible after they occurred according to estimated occurrence time: those recorded between 1200 h midnight and 1200 h noon were grouped with morning observations; the remainder were grouped with afternoon observations. Avalanche occurrences were recorded (regardless of size) if they had potential for affecting the highway. All avalanches were given an empirical size based on the five-class Canadian size system (see Reference McClung and TweedyMcClung and Tweedy, 1993). Two empirical indices were formed to represent the character of avalanching: (1) an avalanche-activity index (AAI) defined as the sum of avalanche sizes recorded in the time periods described above. The AAI gives a simple index which represents the magnitude and frequency of avalanching within a time period; (2) a moisture index (MI) to differentiate between avalanches with dry, moist or wet debris. The MI categories were assigned the following numbers: 1, dry; 2, moist; 3, wet. The MI is defined as the average value of moisture content for all avalanches recorded in a time period. Two categories of moisture index for a time period were defined as dry (MI < 1.5); moist-wet (MI > 1.5).
Snow and weather data initially used in the analysis were of two types: (1) data taken regularly at the twice-daily intervals, and (2) lagging variables: data averaged over the present time period and several previous time periods. Data used in the study are listed in Table 1. Lagging variables and the number of time periods (12 h) used in the analysis are listed by including a parenthesis such as (L,3) meaning lagging variable averaged over three time periods.
Lagging variables present the possibility of including temperature trends (present value minus previous value) as well as memory into a model constructed from them. The variable for storm-board total includes all snowfall deposited during a storm so that a memory effect for snowfall deposition during storms is included.
Discriminant Analysis
Discriminant analysis seeks a linear combination of measurements in which the sum of squared differences between group means is maximized relative to within groups variance. It is assumed that the variancecovariance matrices for both groups can be represented by one common matrix in the simplest analysis. For the cases presented in this paper, a single function (discriminant function) based on components of a measurement vector is used to account for all differences between groups.
In the analysis, we divide the population of time vectors into two groups at each stage of the analysis. A vector of data at a given time is denoted by If each group (i) has a mean vector given by and, if ∑ represents the variance—covariance matrix representing both groups, then the task is to find a linear combination of so that the ratio of the difference in the group means to the common variance is maximized. If linear combinations are given by then a vector of weights is sought to maximize the ratio:
When ∆ is maximized, the maximum ratio of between-population dispersions to within-population dispersions is obtained. Before application, the samples rij from each group are used to replace parameters (μ,∑) with sample-based estimates. The μ are replaced by:
i = 1,2 and
where and S is the pooled sample variance—covariance matrix.
In this study, it was necessary to derive the standardized discriminant coefficients to define group differences, as well as individual group classification functions. The latter enable conditional probabilities for group membership (avalanche/non-avalanche time periods) to be calculated for each vector. Given a data vector for a given time, it is then possible to estimate the probability that a linear combination of variables in the time vector describes an avalanche situation.
Discriminant Function and Application of Bayesian Statistics
In the present work, for two-group discrimination, the discriminant function is defined by: and the estimate of is given by:
where are vectors representing the group means. The components of are equivalent to canonical coefficients standardized by conditional (within-groups) standard deviations.
In addition to Fisher’s discriminant function (defined above), the Mahalanobis distance squared between all data vectors Xi and the group centroids were calculated in discriminant space. These distances are presented by variables given by:
From the Mahalanobis distances determined by Equation (4), it may be shown (e.g. Reference TatsuokaTatsuoka, 1971) that the likelihood ratio of conditional probabilities P(D\Gi) relating membership in groups 1 and 2 is given by:
where
and
By application of Bayes’ rule, then the posterior probability for membership in group i is given by:
where P(Gi) is the a priori probability of group membership and P(G1)+ P(G2) = 1.
The rule for classification into avalanche/non-avalanche periods is then to classify in group i if:
For each calculation, a test of the null hypothesis that the population means are equal was conducted. The appropriate Ζ statistic may be calculated in terms of the Mahalanobis distance between group means by forming the D2 statistic:
where D2 is the Mahalanobis distance between group centroids, n2, n2 are group-sample sizes, p is the number of independent variables. Within the null hypothesis and a common variance—covariance matrix, Ζ is distributed as an F distribution with p, n1 + n2 − p − 1 degrees of freedom:
where α is the upper α percentage point of the F distribution. In all cases, α was less than 0.001, indicating that the null hypothesis is rejected and the group means are significantly different.
The assumption of homogeneity of variance of the groups was tested for all individual variables. The Bartlett (e.g. Reference Eisenbeis and AveryEisenbeis and Avery, 1972) test for homogeneity of group variances and an independent groups t-test was calculated for each variable. The results indicate no reason to reject the null hypothesis that the variances of the two groups are equivalent.
Variable Transformations
The usual assumption in discriminant analysis is that the variables are from a Gaussian distribution. Examination of variables in the analysis showed that the precipitation variables all had positive skewness. This same condition was found by Reference Bovis, Armstrong and IvesBovis (1976). Following Reference Bovis, Armstrong and IvesBovis (1976), we applied the transformation Xi → ln(Xi + 1) to the precipitation variables used for dry avalanche and dry time-period prediction. This transformation improved the percentage of data vectors correctly classified by approximately 5%. It also made the discriminant functions very nearly linear with very similar correct prediction percentages for avalanche and non-avalanche time periods. For wet-moist avalanche predictions, it was found that prediction and quality of the discriminant functions were better without application of the transformation, so it was not applied. This may reflect the relative lack of precipitation variables in our moist/wet avalanche variable sets.
Results of Parametric Discriminant Analysis
For the analysis here, groups of nearly equal size were selected for time vectors describing avalanche and non-avalanche time periods by deleting non-avalanche data vectors with a random-number generator. At Kootenay Pass, avalanche-data vectors constitute about 20% of the total during a winter period. This analysis produced discriminant functions derived from nearly equal a priori probabilities concerning population size. Separate analyses were performed by stratifying time periods according to whether avalanches were dry or moist-wet and by the avalanche-activity index (AAI).
Ten winters of data were included in the calculations: 1981-82 through 1990-91. In addition to avalanche-non-avalanche groups, an analysis was also employed to allow discrimination between dry and moist-wet avalanche periods. This result is used later by application of Bayesian statistics to help decide on the use of predictor-variable groups for dry or moist-wet cases in operational use (discussed below).
Table 2 contains information about the percentages of avalanche and non-avalanche time periods misclassified, the variables and the sample sizes in the analysis. Variables followed by (L) were transformed according to the logarithm transformation described above.
Operational Avalanche-Prediction Scheme and Numerical Testing
The prediction scheme to be employed is the result of a two-step procedure using Bayesian parametric and non-parametric discriminant analysis. Figure 1 shows a schematic of group separation achieved by discriminant analysis and cluster (non-parametric) analysis for a data vector represented as a point in multi-dimensional discriminant space. Figure 2 shows avalanche/non-avalanche groups calculated numerically from Kootenay Pass data. The groups in Figure 2 are fuzzy sets (Reference Klir and FeglerKlir and Fegler, 1988) depicted on a two-dimensional graph but calculated in a six-dimensional discriminant space.
The first step in the analysis involves employing Bayesian statistics coupled to parametric discriminant analysis to decide whether a given data vector would be more likely to belong to a group to forecast dry avalanches or moist-wet avalanches (set III, Table 2). We tested 30% of the data base on 3 years’ of data (1 year at a time) using an a priori probability of a dry-avalanche time period as 0.5 to calculate the posterior probability of a dry/moist-wet period. The method of testing consisted of holding out the year in question. The data from each data vector in the year held out were then tested by application of Bayesian statistics. Cluster calculations were performed between each data vector in the year held out and all other data vectors in the remaining 9 years of data. The results were then compared with what actually happened on the day in question by application of mathematically determined probability-warning levels (discussed below). This gave an overall success rate of 88% with 85% of dry-avalanche periods selected correctly and 98% of moist-wet periods selected correctly. A data vector was specified dry if the posterior probability of branching to dry conditions was ≥0.5. These results are slightly better than group classification based on equal sample sizes (86%, set III, Table 2) as the discriminant functions are defined. When used operationally, we expect a better success rate when a forecaster applies Bayesian statistics by specifying an a priori probability. Variables employed in this first part of the analysis are listed as set III (Table 2).
The second step of analysis involves application of Bayesian statistics to determine the probability of avalanching. Either set I (dry avalanching) or set II (moist-wet avalanching) of variables are selected depending on the results of the first step of the analysis.
We tested the results of steps one and two of the analysis by applying Bayesian statistics (a priori probability = 0.5) to 3years of the data set (1984–85, 1987–88 and 1990–91). Λ warning (avalanche prediction) was issued if the probability of dry avalanching was ≥0.6 and if the probability of moist-wet avalanching was ≥0.7. These levels (0.6 and 0.7) were determined mathematically to achieve an approximate equal balance between the percentage of avalanche and non-avalanche periods predicted correctly. The predicted results were then compared with what actually occurred (avalanching or not) during the time period. The results gave 79% success rate for dry-avalanche prediction (79% of non-avalanche periods; 78% of avalanche periods). For moist-wet avalanche periods, 75% success rate was achieved (75% non-avalanche periods, 71% wet-moist avalanche periods). These results are comparable to sample classification based on equal sizes during development of the discriminant functions (see Table 2) using equal sample sizes. Figure 3 and 4 show the results of our numerical testing for 2 years of observations.
Cluster Analysis of a Data Vector
In addition to calculating the probability of occurrence of avalanching, we also use cluster analysis to provide a further description of “nearest neighbours” to the time vector (e.g. Reference Obled and GoodObled and Good, 1980; Reference BuserBuser, 1983; Reference Buser, Bütler and GoodBuser and others, 1987). Our method of determining nearest neighbours consists of calculating the Mahalanobis distance D2 between all vectors in the data Xi base and the current one: Xp
where S−1 is the inverse of the pooled within-groups variance—covariance matrix calculated as a part of the discriminant analysis. This distance and variance-covariance matrices are calculated using the variables of either set I or set II in Table 2, depending on whether dry avalanching or moist-wet avalanching may be expected. This distance metric reduces to the Euclidean distance metric assumed by Reference Buser, Bütler and GoodBuser and others (1987) if the variance—covariance matrix is taken to be the identity matrix. Therefore, our distance matrix retains the assumption that the variables are correlated and distances are calculated in discriminant space in contrast to the implicit assumption of no correlation between variables if a Euclidean metric is chosen.
On calculation of distances between data vectors, the distances are ranked and the 30 nearest neighbours are retained and analyzed for information as to whether they have avalanche occurrences associated with them, and information about the avalanches is summarized. This information includes: size, moisture (dry, moist, wet), trigger, aspect, location, type (slab or loose snow) and other parameters. The fraction of data vectors in the nearest-neighbour cluster which contain avalanching may be regarded as similar to a conditional probability of avalanching for the cluster analysis. We have studied two values of probability (P10 and P30), which describe the conditional probability of avalanching for the ten and 30 nearest neighbours to the present data vector. Our study had two objectives: (1) to determine whether the Bayesian discriminant probability or conditional cluster probabilities correlated best with avalanche activity; (2) to assess whether P10 and P30 correlated best with the Bayesian discriminant probability.
The answer to objective (1) was sought in two parts:
(a) by a simple correlation, e.g. Spearman Rank correlation, and (b) by comparing conditional probability of avalanching calculated by both methods. Both (a) and (b) showed that Bayesian discriminant probabilities were superior to conditional cluster probabilities in our study. Rank correlation between magnitude of avalanching and Bayesian probabilities is shown in the correlation Table 3.
These data show that the Bayesian probabilities correlate best with the avalanche magnitude—frequency index but the differences are minor. In order to compare predictions from the cluster probabilities with Bayesian probabilities, it was necessary to establish a prediction level (warning level) for the cluster predictions. It was found mathematically that the optimal prediction level was obtained when P30 = 0.20; when 20% of the 30 nearest neighbours contained avalanche occurrences, the best balance between successful prediction of avalanche and non-avalanche periods was achieved. This “warning level” is comparable to the optimum warning levels of 0.6 for dry avalanches and 0.7 for moist-wet avalanches. Table 4 gives a summary of results for data from the winters of 1985, 1988 and 1991. The results are tabulated in numbers of data vectors.
It is important to note that, for the results in Table 4, there were 12 dry avalanche periods found in periods which were predicted by use of discriminant probabilities to be more likely to produce dry-avalanche periods. When these are taken into account, the fraction of avalanche periods correctly predicted by the two methods is virtually identical but non-avalanche periods are best predicted by the Bayesian-discriminant probabilities (77%) vs (73%) by using cluster techniques. Our conclusion then is: for the analysis using Kootenay Pass data, application of Bayesian discriminant technique is superior but there is not much difference between the two methods. Since our operational forecasting procedures will use both methods, Bayesian statistics to predict avalanches and cluster techniques to analyze expected character of avalanche occurrences, the consistency between the two methods is both advantageous and necessary.
Time-Series Analysis
Our model contains temperature and snowfall variables (and perhaps others) which allow a memory effect in the prediction capabilities. To study the time relation between probabilities and the AAI (see Table 3), we calculated the cross-correlation time series on a year-by-year basis for the numerical experiments given in Table 4. Figure 5 and 6 show the correlation coefficients for 1984– 85 calculated by cross-correlation of AAI and Ρ (Fig. 5) and P30 (Fig. 6). Figure 5 and 6 also show the limits of ±2 standard errors above zero correlation: when the correlation falls inside these ranges, it is not considered significant. The results shown indicate that correlation between avalanching and the probability from the model becomes significant about two time periods (1 d) prior to the present (zero lag). In addition, the results in Figure 5 and 6 show that the correlation between avalanching and the probabilities approaches insignificance after about one time period past maximum correlation (zero lag).
Summary and Discussion
We have modified some of the numerical procedures for dealing with the numerical part of the avalanche-forecasting problem. Our methods are derived from the suggestions of Reference Obled and GoodObled and Good (1980), Reference Buser, Bütler and GoodBuser and others (1987) and Reference LaChapelleLaChapelle (1980). Our resulting model involves using both parametric discriminant-analysis techniques (incorporating Bayesian statistics) and cluster techniques and “nearest neighbours” calculated in discriminant space using the Mahalanobis distance as the distance metric.
Reference LaChapelleLaChapelle (1980) first advocated the use of Bayesian statistics in avalanche forecasting in the manner in which we intend to employ it by field testing our model. It is our belief that by combining a forecaster’s a priori probability with our numerical estimate of the conditional probability of avalanching, we can assess the potential power of an expert system employing symbolic logic to non-numerical data of classes i and ii. We have been able to include some data from these classes (e.g. avalanche occurrences) but not enough to produce a forecasting tool of the accuracy desired in applications.
The consistency of the results presented in comparing the cluster analysis and parametric discriminant analysis prove that these techniques are nearly identical when calculations are done in discriminant space. Application of the cluster techniques (along with parametric discriminant analysis) allows an operational forecaster to analyze the character of expected avalanche occurrences if any appear in the clusters. For example, triggering mechanism, avalanche type and terrain information, such as avalanche-path location and aspect of avalanche release. The consistency in our comparison of results adds confidence that the mathematical procedures are correct in both cases.
Even though we have not field-tested our model, we believe that our numerical testing of the model is among the most extensive reported for numerical forecasting models. When Bayesian statistics are actually employed by forecasters, we have good reason to believe that the predictive accuracy we have estimated numerically will be exceeded. Since groups calculated by discriminant techniques (e.g. Fig. 2) are fuzzy, a future goal for avalanche forecasting may be to provide estimates of a priori probabilities from application of fuzzy logic procedures coupled with symbolic computing.
Acknowledgements
We wish to thank Dr O. Buser, Swiss Federal Institute for Snow and Avalanche Research, for his encouragement and help with this research. This research was supported by the British Columbia Ministry of Transportation and Highways, the National Research Council of Canada and the Natural Sciences and Engineering Research Council of Canada. We are indebted to the staff of the Snow Avalanche Section of the British Columbia Ministry of Transportation and Highways for providing data-handling and avalanche-forecasting expertise. We are also grateful for the vision of E.A. Lund, Chief Highway Engineer, British Columbia Ministry of Transportation and Highways, for making this study possible.
The accuracy of the the references in the text and in this list is the responsibility of the authors, to whom queries should be addressed