Notation
Introduction
The most common way to evaluate ice ablation (kg m −1) is to measure the surface lowering (m) relative to an ablation stake and to multiply the surface lowering by the density of glacier ice (900 kg m −3).
Bauer (1961) discussed annual net ablation measurements with six stakes at a single site- in Greenland and interpreted differences between stakes as errors. His error standard deviation was ± 230 kg m −2a−1, or ± 15% of the annual ablation. Generally similar errors of ±200 to ±400 kg m−2a−2 have been reported for annual balances (Reference LliboutryLliboutry, 1974; Reference Reynaud, Vallon and LetrReynaud and others, 1986; Reference Braithwaite and OlesenBraithwaite and Olesen, 1989b) using Lliboutry's linear model, and a similar error of ±200 to ± 250 kg m −2 a−1 is given by Gogley and others (1995).
The above errors are much larger than the simple error in measuring from the top of an ablation stake to the ice surface, and presumably reflect gross errors as well as the failure of the data to represent correctly annual ablation in the immediate surroundings of the stake where they are measured. In the present paper, the term “measurement error” is used in this broader sense. This usage is justified because one assumes (or hopes) that a single ablation measurement represents a certain minimum area around the stake-, ranging from about 0.1 km2 on a well-stuthed glacier (Reference SchyttSchytt, 1962) to many km2 in a sparse stake network as in Greenland. Braithwaite and Olesen (1989b) showed that errors in annual ablation data reduce correlations between ablation data and climate variables according to the ratio of error variance to ablation variance.
Short-term variations in ablation (e.g. daily variations) have also been measured as part of energy-balance stuthes (Reference AmbachAmbach, 1963; Reference Muller and KeelerMuller and Keeler, 1969; Braithwaite and Olesen. 1990). In such cases, one would like the measured ablation to be as accurate as possible so that it can be used as a check on the energy balance, which may not be completely accurate due to measurement errors and various unsatisfied assumptions.
Reference Muller and KeelerMuller and Keeler (1969) stuthed errors in short-term ablation measurements in some detail. The main errors in surface lowering are due to ablation hollows, which commonly form around stakes, and to the complex micro-relief of the ice surface. A straight edge on the ice surface at the stake can be used as a datum and gives an error of ± 0.005 m in surface lowering, which can be further reduced to ±0.003 m by the star ablatograph (Reference Muller and KeelerMuller and Keeler, 1969). There is a further error, in dial the density of the glacier surface layer can be less than that of glacier ice (e.g. as low as 500-600 kg m−3 due to “weathering crust” caused by melting within the top few cm of the glacier surface layer. Any density changes in this layer are not detected by measurements of surface lowering but arc so difficult to measure that most people would regard them as unavoidable.
The effects of errors on daily ablation measurements were investigated at two locations in northern Greenland (Fig. 1) in the summers of 1993 and 1994, respectively. The field stuthes were part of a 2 year programme on world sea-level changes supported by the European Union. Northern Greenland was chosen as the target because there is little information about ablation in this area except for a qualitative description by Reference FristrupFristrup (1951). The present paper examines the accuracy of daily ablation measurements at the two sites and discusses their implications for glacier climate stuthes.
The Data
Daily measurements of ablation were made at two locations (Fig. 1) in northern Greenland: at the margin of the Greenland ice sheet in Kronprins Christian Land (KPCL) in 1993, and at the margin of the Hans Tausen Ice Gap (HTIC) in 1994. Locations and periods of measurements are shown in Table 1 where the coordinates were determined by repeated positioning with a hand-held receiver for the global positioning system (GPS).
Previous experience has shown that ablation measurements involve considerable error, and that ablation itself varies greatly even on a scale of metres, suggesting that many measurements are needed to obtain a representative value for a site (Olcsen and Braithwaite, 1989). Ablation in the present study was therefore measured at ten stakes placed within an area of only about 100 m2. The ablation-measurement sites were located beside climate and radiation stations on the ice, which are a few minutes’ walk from field camps established on the tundra. The ten stakes were read daily at close to 1900 UTC (about 1715 h solar time in KPCL and 1630 h solar lime at HTIC).
Stakes are usually surrounded by an “ablation hollow” of 04-0.3 m diameter, and slake readings are made by the straight-edge method. Differences in successive daily readings are converted into ablation values using an ice density of 900 kg m −3. For convenience, ablation is treated in this paper as positive rather than negative, as recommended by Anonymous (1969).
An attempt was made to choose the stake sites at random. However, summer melting was already well under way at the start of the KPCL data collection, with well-developed microtopography, melt streams and cryconite holes, so that subconscious bias in avoiding these features cannot be excluded. By contrast, melting had hardly started in 1994 when the HTIC! stakes were established, and melt streams and cryconite holes subsequently developed around several stakes.
At both sites, ablation crust was ubiquitous, but an aller-nation between a white crusty surface and blue ice, as described by Reference Muller and KeelerMuller and Keeler (1969), was never observed. This suggests that large changes in surface density did not occur in the present eases.
The first data analysis involves calculating the mean and standard deviation of the ten ablation readings for each day. The daily mean represents the best estimate of the true ablation on that day, and I he standard deviation expresses the spread of individual readings around this mean, reflecting the effects of measurement errors as well as any small-scale variations in ablation. For KPCL 1993 (Fig. 2), mean ablation was well above zero for the whole 20 day record, while ablation for HTIC 1994 (Fig. 3) was generally lower and approaches zero several times during the 35 day record. However, day-to-day fluctuations in ablation have a similar amplitude in both cases.
For both datasets (figs 2 and 3) the daily standard deviation is usually quite small, i.e. around ±5 kg m−2d −1for most days, but there are also days with much larger standard deviations (e.g. well over ±10 kg m−2 d−1. Presumably the former is due to a random process with a fairly small amplitude, while the latter is caused by a larger, more sporadic effect, i.e.”errors”and “mistakes”.
Theory
The ablation a it measured at stake i on day tis not the true ablation, because of measurement errors and the effects of small-scale variations in ablation. It is assumed that, the ablation is given by:
where xt is the true ablation for the day, and eit is a random error. It is assumed here that there are no real differences between stakes, and that any apparent differences are due to the statistical sampling of the error term.
In principle, Equation (1) is not solvable, because it involves two unknown quantities xt and eit However, with a large number of stakes (M) within a small area, the errors can be assumed to average out, and the true ablation is approximately equal to the average ablation at for the M stakes on any particular day:
It is assumed that the error eit has zero mean for M stakes, is uncorrelated with the true ablation xt, and has the same standard deviation Se for both time and space variations, The standard deviation of the error is t hen equal to the standard deviation St of the ablation for the M stakes:
Under the same assumptions, the standard deviation St of ablation time series for N days is:
where Sx is the standard deviation of true ablation xt for N days. The correlation coefficient R(ait,yt between measured ablation and any time series yt is:
where R(xtyt) is the correlation coefficient between true ablation and the time series. As Sx/Si must be less than unity, Equation (5) means that ablation errors reduce correlations between measured ablation and other variables. In the particular case where yt=xt the correlation coefficient on the righthand side of Equation (5) is unity, and
This represents the highest correlation that can be found between ablation at any individual stake and any other time series.
Ideally, the random error, as expressed by the standard deviation Se, should be as small as possible compared with true ablation variations. Otherwise, the presence of large errors affects the interpretation of the ablation measurements. For example, according to Equation (5) with typical values of ±5 and ±10 kg m−2 in d −1 for Se and Sx, correlation coefficients involving measured ablation are reduced by a factor of 0.89 compared with true ablation. For a larger random error (e.g. Se = Sx = ±10 kg m −2 d −1), the reduction factor is 0.71. As the percentage of variance that is “explained” by the correlation depends upon the square of the correlation coefficient, i.e. 79% and 50% in the two examples here, measurement errors in daily ablation can easily reduce the predictive power of ablation climate correlations by 20-50%.
Equation (l) assumes that the true ablation is the same at all stakes, so that observed differences are due to statistical sampling alone; however, ablation at the different stakes need not be the same. For example, in the linear model:
xt and eit are the same as before, but the new term zi represents possible differences in ablation between stakes. This equation was first applied to stake networks by Reference LliboutryLliboutry (1974).
Magnitude of Errors
According to the theory, random errors in ablation measurements should have similar effects for all stakes, and similar effects for all days. This principle can be used for detection of gross errors. For example, the few large values of daily standard deviations in the upper parts of figures 2 and 3 indicate the operation of something other than a random error, i.e. a relatively large error that is more sporadic.
Correlation coefficients between ablation series at individual stakes and the mean ablation for all stakes (table 2) are generally similar, as predicted by Equation (6), but there are some cases with very low correlations. For example, stake B in the KPCL 1993 dataset has a correlation coefficient of only 0.35 compared to an average of 0.84 for all ten stakes in Table 2. Re-examination of the data for individual cases shows that anomalous data were collected at this stake on 2 out of 20 days, i.e. with errors of 28 and -38 kg m −2d−1, but it is impossible to guess the correct values. Data for stake B were therefore excluded from the dataset, and calculations were repealed for the other nine stakes, raising the average correlation from 0.84 to 0.91 (table 2). The few large daily-standard deviations in the upper pari of Figure 2 are also reduced, although variations of mean ablation are little changed (because the effect of the original errors is “diluted”).
In the HTIC 1994 dataset, there are low correlation coefficients of 0.61 and 0.68 at stakes B and E compared to a mean of 0.88 for all ten stakes (table 2). In the case of stake B, there are two large anomalies with similar magnitude and opposite signs, i.e. errors of+48 and -42 kg m −2d−1, consistent with a single wildly inaccurate stake reading. At stake E there are three anomalies, errors of-46, +66 and -43 kg m −2 d−1, which are harder to interpret. Instead of trying to guess the correct data, we excluded stakes B and F from the dataset, and calculations were repeated for the remaining eight stakes, raising the average correlation from 0.88 to 0.94 (table 2). The group of very large standard deviations in the upper pan of Figure 3 completely disappears.
In the reduced datasets, errors outside the range ± 10 kg m −2 d−1 occur with frequencies of only 9% and 5%, and the corresponding error standard deviations arc only ±6 and ± 5 kg m −2 d−1. The accuracy of an individual stake measurement can be taken as the same magnitude.
The true ablation for any day was assumed to be given by the mean ablation at M stakes (Equation (2)), but this invokes a sampling error of Se/M0.5 which is ±2 kg m −2 d −1 for eight or nine stakes when S e = ±6 kg m−2d−2. The error in mean ablation for any day can be taken as the same magnitude.
Differences between stakes
The mean ablation appears to be different at different stakes (table 3). The statistical significance of the differences is studied by a two-way analysis of variance (Reference KreyszigKreyszig, 1970, p. 274-287) using Equation (7). The resulting sources of variance are shown in Table 4. As one might expect, the largest source of variance in both cases is the day effect, i.e. the day-to-day change in ablation according to weather. Next largest is the unexplained variance, mainly due to measurement errors, and the smallest is the stake effect, reflecting only small differences between stakes.
According to the F-test (Reference KreyszigKreyszig, 1970, p. 277), the differences between days arc significantly different at a very low probability level for both samples. The situation for stake differences is less clear: for KPCL 1993 (with 8 by 152 degrees of freedom) the differences are significantly different al 5% level; but differences are not significant for HTIC 1994 (7 by 238 degrees of freedom).
The alternative to a purely statistical treatment is to look for a physical difference between stakes. For example, Reference Konzelmann and BraithwaiteKonzelmann and Braithwaite (1995) have already noted the significant difference in ablation at the different stakes for KPCL 1993 and explained it by small-scale variations in albedo, causing differences in absorbed shortwave radiation.
It is not possible to measure albedo exactly at a stake. because the slake and albedometer interfere with each other, but albedo was assessed indirectly by subjectively classifying the immediate surroundings of each stake as “dark” or “light”. In both cases, the mean ablation for “dark” stakes is higher than for “light” stakes (table 5). The range in measured albedo values is 0.1 in the general area of the KPCL 1993 stakes (Reference Konzelmann and BraithwaiteKonzelmann and Braithwaite, 1995) and is similar for HTIC 1994, This refers to the difference between minimum and maximum albedo, and the difference between “light” and “dark” stakes must be smaller (e.g. of the order of 0.05). For KPCL 1993 the mean global radiation is 317 W m 2 and an albedo difference of 0.05 gives a difference of 16W m2 in ablation energy, equivalent to an ablation difference of 4.1 kgm2 d −1. For HTIC 1994 the mean global radiation is only 223Wm −2 giving an ablation difference of only 2.9 kg m2 d−1. These figures are in rough agreement with the ablation differences in Table 5, supporting the notion that (1) mean ablation differences between close stakes are partly caused by albedo variations, and (2) the interstake ablation differences for HTIC 1994 are less than those for KPCL 1993 because of lower global radiation in 1994.
Implications for degree-day modelling
The validity of degree-day modelling rests partly upon a relatively high correlation between ablation and air temperature [Reference Braithwaite and OlesenBraithwaite and Olesen, 1989a). However, the correlation coefficients in Table 2 suggest that very low correlations can occur with data from single stakes. For example, daily ablation at stake B for KPCL 1993 would have a low correlation with air temperature (or any other climate element) due to the effects of errors, and one would mistakenly infer only a weak correlation between ablation and temperature (or any other climate variable).
Measured ablation in West Greenland has been used to estimate positive degree-day factors (Reference Braithwaite and OlesenBraithwaite and Olesen, 1989a) that are then applied to the whole Greenland ice sheet (Reference Huybrechts, Letr and ReehHuybrechts and others, 1991). For these purposes, ablation data should be as accurate as possible so that degree-day factors are not unduly affected by errors.
Positive degree-day factors are calculated for the two datasets by dividing the mean ablation values in Table 3 by the corresponding mean of positive temperatures for the datasets. These are 4.18° C for KPCL 1993 and 3.10°C for HTIC 1994, based on hourly air temperatures recorded by data loggers (at about 2 m over the glacier surface) only a few metres from where the ablation stakes were measured.
The resulting positive degree-day factors (table 6) show not only a clear difference between the two sites, but also a substantial variation of about ±10% between stakes at the same site. As different degree-day factors can be calculated from the ablation for “dark” and “light” stakes (table 5), it can be concluded that small-scale albedo differences partly explain these interstake variations in positive degree-day factors. However, the difference in mean degree-day factors at the two sites (9.8 and 5.9kgm−2d−1 deg−1) is too large to be caused by errors in ablation measurements. Variations in positive degree-day factors at other sites in Greenland (Reference BraithwaiteBraithwaite, 1995, Fig. 3) are also too large to be explained by a ± 10% error m ablation measurements.
Implications for energy-balance modelling
Another important reason for measuring ablation is to check the accuracy of energy-balance measurements (Reference Streten and WendlerStreten and Wendler, 1968; Muller and Keeler. 1969; Reference FohnFohn, 1973; Reference Wendler and WellerWendler and Weller, 1974; Munro, 1990), or energy-balance models (Reference Escher-VetterEscher-Vetter, 1985; Reference Hay and FitzharrisHay and Fitzharris, 1988; Reference Braithwaite and OlesenBraithwaite and Olesen, 1990).
Each component of the energy balance is measured or estimated as accurately as possible, but due to various errors the sources and sinks do not exactly balance. This effect is expressed by the standard deviation of the “error” terms in Table 6 for West Greenland (Reference Braithwaite and OlesenBraithwaite and Olesen, 1990) and for the two sites in northern Greenland. The latter are documented in a separate paper ι Braithwaite and others, 1998), but the KPCL 1993 values here are essentially based on Reference Konzelmann and BraithwaiteKonzelmann and Braithwaite (1995) with recalculation of the turbulent fluxes using log-linear profiles and a larger surface roughness.
Small mean errors in energy-balance modelling are easily achieved by suitable choices of parameters in the model, and can always be reduced to zero by suitable “model tuning”. However, the standard deviations of the errors in Table 7 do express the overall accuracy of the energy balance. It is noteworthy that the energy balance is much more accurate at the two northern Greenland sites with a standard deviation of only ± 20 W m −2, equivalent to about ± 5 kg m −2 d−1. This is comparable to the reduced error in the ablation measurements (±2kgm−2 d−1), suggesting that improved ablation measurements do partly contribute to a more accurate energy balance than in the West Greenland studies, where ablation was only measured on three stakes without any effort to detect and eliminate gross errors.
Conclusions
Measurement errors in glacier climate studies can be reduced by measuring daily ablation at many stakes close together (e.g. at least ten stakes within a small area) and comparing the data to detect gross errors. For example, a high standard deviation for ablation data taken on the same day, or a low correlation between data series at different stakes, indicates erroneous data. These data should be discarded if it is too late to repeat the measurements. In the two cases studied, random errors in daily ablation data for individual stakes are about ±5 kg m−2d −1 after discarding erroneous data. The error is further reduced to only about ± 2 kg m −2 d−1 by averaging over eight or nine stakes.
Random errors in calculated energy balances using the present ablation data are much lower than found in earlier studies in West Greenland where ablation was only measured on three stakes without any attempt to detect and eliminate gross errors.
Aside from day-to-day errors, there are ± 10% differences in mean ablation at different stakes, which are probably caused by small-scale variations in surface albedo. Such interstake differences give a ± 10% uncertainly in the positive degree-day factors, which are 9.8 ±0.9 and 5.9 ±0.6 kg m−2 d −1 deg−1 for the two sites.
Acknowledgements
This paper is published by permission of the Geological Survey of Denmark and Greenland (GEUS, formerly GGU). Fieldwork in both 1993 and 1994 was supported by the European Union under contract number EV5V-CT91-0051, and co-ordinated by the Climate Research Unit, University of East Anglia, U.K. Additional support for the 1994 work on the HTIC was provided by the Nordic Council under a contract to N. Reeh, Danish Polar Centre, Copenhagen. Logistic support in both years was supplied by GGU's base camp at Centrum So, KPCL, led by GGU Statsgeolog N. Henriksen.