Explicit solutions for a probabilistic moraine preservation model

PAUL MUZIKAR

doi:10.1017/jog.2016.109

Explicit solutions for a probabilistic moraine preservation model

Published online by Cambridge University Press: 26 September 2016

PAUL MUZIKAR

Show author details

PAUL MUZIKAR*: Affiliation:
Department of Physics and Astronomy, Purdue University, West Lafayette, IN, USA
*: Correspondence: Paul Muzikar <[email protected]>

Article contents

Abstract
INTRODUCTION
CALCULATION OF THE GENERATING FUNCTION
RESULTS
DISCUSSION
References

Rights & Permissions

Abstract

If a series of glacial advances occurs over the same pathway, the moraines that are now present may constitute an incomplete record of the total history. This is because a given advance can destroy the moraine left by a previous one, if the previous advance was less extensive. Gibbons, Megeath and Pierce (GMP) formulated an elegant stochastic model for this process; the key quantity in their analysis is $\bi P(n\vert N)$, the probability that n moraines are preserved after N glacial advances. In their paper, GMP derive a recursion formula satisfied by $\bi P(n\vert N)$, and use this formula to compute values of P for a range of values of n and N. In the present paper, we derive an explicit general answer for $\bi P(n\vert N)$, and show explicit, exact results for the mean value and standard deviation of n. We use these results to develop more insight into the consequences of the GMP model; for example, to a good approximation, 〈n〉 increases as ln(N). We explain how a Bayesian approach can be used to analyze $\bi P(N\vert n)$, the probability that there were N advances, given that we now observe n moraines.

Keywords

glaciation moraines stochastic model

Type: Papers
Information: Journal of Glaciology , Volume 62 , Issue 236 , December 2016 , pp. 1181 - 1185

DOI: https://doi.org/10.1017/jog.2016.109 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright: Copyright © The Author(s) 2016

1. INTRODUCTION

One difficulty in reconstructing the history of glaciations in a given area is that a moraine deposited by a glacial advance can be destroyed by a subsequent glacial advance. The paper by Gibbons and others (Reference Gibbons, Megeath and Pierce1984) (we will refer to this paper of Gibbons, Megeath and Pierce as GMP) refers to this as ‘obliterative overlap.’ For example, if there have been ten glacial advances in a given valley, there may be only three moraines left at the present time. In order to analyze such situations, GMP formulated an elegant model, in which the key quantity of interest is $P(n\vert N)$ , the probability that n moraines are preserved, given that N glacial advances have occurred. Note that within the GMP model we have 1 ≤ n ≤ N, since the most recent moraine is always preserved, and one particular scenario results in all N moraines left intact. For a recent application of the model see, for example, Young and others (Reference Young, Schweinsberg, Briner and Schaefer2015).

The key assumptions of the GMP model are: (1) all N! possible orderings of the extents of the glacial advances are equally likely; or, to quote GMP, ‘moraines were deposited at various distances from their glacial source areas randomly over time.’ (2) A given glacial advance obliterates the moraines of all previous advances of lesser extent.

The degree to which this model fits a given situation depends on many factors. Detailed, careful studies have shown that glacial advances and retreats occur on many timescales, as the glaciers respond to both climate fluctuations, and to more systematic, long-term changes (Reichert and others, Reference Reichert, Bengtsson and Oerlemans2002; Roe and O'Neal, Reference Roe and O'Neal2009; Roe, Reference Roe2011; Anderson and others, Reference Anderson, Roe and Anderson2014). If the net effect of these complicated processes is that the average of L (the length of an advance) is both time-independent and not correlated with the length of other advances, then the GMP model will be suitable. If, however, the statistics of L is time dependent, the GMP model may miss important features of the situation. For example, if the mean value of L decreases gradually with time, we have a long-term glacial retreat scenario, and as GMP point out, their model will underestimate n. A more subtle complication would be the following: even if the average length of an advance had no long-term trend, there could be shorter-term correlations between the successive values of L.

The preceding paragraph discusses assumption (1). We should also note the following: while it may not be very likely, if an overrun moraine is not obliterated, assumption (2) is violated. The judgment of the field geologist can be important in such a case. If a moraine survives, but in a degraded form, which strongly indicates it was overrun by a later one, then it should not be counted as contributing toward n, if the GMP model is being applied.

In this paper, we take the view that the GMP model provides a very useful basic description, and captures important real-world features of the obliterative-overlap phenomenon. Thus, it is worth elucidating in more detail the actual predictions of the model. Here, we will concentrate on deriving explicit formulae, which embody the predictions of the model.

2. CALCULATION OF THE GENERATING FUNCTION

Given the assumptions of their model, GMP were able to derive a recursion equation satisfied by $P(n\vert N)$ (for n ≥ 2):

(1)

$$P(n \vert N) = \displaystyle{1 \over N}\sum\limits_{M = n - 1}^{N - 1} P(n - 1 \vert M)$$

Note that $P(1\vert N) = 1/N$ . To gain insight into Eqn (1), note that if n moraines are to survive, the length of the very last glacial advance can have N − n + 1 possible positions, relative to the lengths of the other advances. That is, its length can be the shortest, the second shortest, the third shortest and up to the $(N - n + 1)$ th shortest. For each of these cases, if we compute how many ways there can be a total of n surviving moraines, divide by N!, and then combine, we arrive at Eqn (1). The authors then used this equation to generate a table of values for $P(n\vert N)$ . The calculations revealed, perhaps surprisingly, that the number of surviving moraines would probably be quite small. For example, when N = 20, the most likely value for n is n = 3. The authors did not present a closed form solution for $P(n\vert N)$ , or for quantities such as 〈n〉 or $\sigma _n^2 = \langle (n - \langle n\rangle )^2\rangle $ .

In order to make progress in analyzing the GMP model, we rely on a widely used approach: we compute the generating function (see, e.g., chapter 7 of Wallace, Reference Wallace1972). The generating function for $P(n\vert N)$ uses a real-valued auxiliary variable x, and is defined as follows:

(2)

$$G(x,N) = \sum\limits_{n = 1}^N x^nP(n \vert N)$$

Once we have an explicit form for G(x,N), we may determine $P(n\vert N)$ from the coefficient of x ⁿ; we also have:

(3)

$$\langle n\rangle = \sum\limits_{n = 1}^N nP(n \vert N) = \left( {\displaystyle{{\partial G(x,N)} \over {\partial x}}} \right)_{x = 1}$$

(4)

$$\langle n(n - 1)\rangle = \left( {\displaystyle{{\partial ^2G(x,N)} \over {\partial x^2}}} \right)_{x = 1}$$

Equations (3) and (4) may be used to calculate $\sigma _n^2 $ .

To compute the generating function, we use the result derived by GMP, Eqn (1):

(5)

$$\eqalign{G(x,N) & = \displaystyle{x \over N} + \sum\limits_{n = 2}^N x^nP(n \vert N) \cr & = \displaystyle{x \over N} + \displaystyle{1 \over N}\sum\limits_{n = 2}^N x^n\sum\limits_{M = n - 1}^{N - 1} P(n - 1 \vert M)}$$

By rearranging the sums in the preceding equation, we may rewrite it as

(6)

$$G(x,N) = \displaystyle{x \over N} + \displaystyle{1 \over N}\sum\limits_{M = 2}^N x\sum\limits_{n = 1}^{M - 1} x^nP(n \vert M - 1)$$

(7)

$$G(x,N) = \displaystyle{x \over N} + \displaystyle{x \over N}\sum\limits_{M = 2}^N G(x,M - 1)$$

(8)

$$G(x,N) = \displaystyle{x \over N}(1 + G(x,1) + G(x,2) + \cdots + G(x,N - 1))$$

Using the previous equation, we can derive an iterative formula:

(9)

$$G(x,N + 1) = \displaystyle{{N + x} \over {N + 1}}G(x,N)$$

This allows us to write down the solution for the generating function:

(10)

$$G(x,N) = \displaystyle{{x(x + 1)(x + 2) \cdots (x + N - 1)} \over {N!}}$$

The numerator in the previous equation is a known mathematical function called Pochhammer's symbol:

(11)

$$(x)_N = x(x + 1)(x + 2) \cdots (x + N - 1) = \displaystyle{{\Gamma (x + N)} \over {\Gamma (x)}}$$

where $\Gamma (x)$ is the Gamma function (see Abramowitz and Stegun, Reference Abramowitz and Stegun1965).

3. RESULTS

The fact that the numerator of the expression given for G(x,N) in Eqn (10) is actually a well-known mathematical function is very useful. The coefficient of x ⁿ gives us $P(n\vert N)$ , and is known to be:

(12)

$$P(n \vert N) = \displaystyle{1 \over {N!}} \vert s(N,n) \vert $$

Let us explain what this equation means. The function s(N,n) is known as a Stirling number of the first kind, and arises in certain areas of mathematics. The book by Abramowitz and Stegun (Reference Abramowitz and Stegun1965) contains information on these numbers, and they are available as built-in functions in programs such as Maple and Mathematica. The website Wolfram MathWorld also contains useful information on both the Pochhammer symbols and the Stirling numbers. As usually defined some of the Stirling numbers are negative, so the magnitude (|s|) is a necessary part of the answer.

In Figure 1, we plot this distribution for several values of N. We reproduce the numerical results of GMP, and can see that even as N grows, the distribution has significant statistical weight only at rather small values of n.

Fig. 1. Plots of $P(n\vert N)$ , the probability that n moraines are preserved, given that N glacial advances have occurred, for various values of N. The exact result, Eqn (12), is used to construct the plots. (a) N = 5. (b) N = 10. (c) N = 30.

Next, the generating function (10) can be used to compute important properties of the distribution given by (12). These results are presented, for example, in the paper by Louchard (Reference Louchard2010), who considers exactly this problem. The average value of n is

(13)

$$\langle n\rangle = \psi (N + 1) + \gamma $$

Here $\psi (x)$ is the digamma function, and γ is Euler's constant, γ = 0.57721, …. The variance is given by

(14)

$$\sigma _n^2 = \psi (1,N + 1) + \psi (N + 1) + \gamma - \displaystyle{{\pi ^2} \over 6}$$

Here $\psi (1,x)$ is the first polygamma function. The digamma function, for integer values of its argument, is defined by

(15)

$$\psi (N) = \sum\limits_{k = 1}^{N - 1} \displaystyle{1 \over k}$$

The first polygamma function is defined by

(16)

$$\psi (1,N) = \sum\limits_{k = 0}^\infty (N + k)^{ - 2}$$

For more information on the digamma and polygamma functions, see Abramowitz and Stegun (Reference Abramowitz and Stegun1965).

In Figure 2, we plot both 〈n〉 and σ _n as functions of N. We can see how slowly 〈n〉 grows as N increases, and that σ _n is smaller than 〈n〉. In Figure 3, we show the ratio σ _n/〈n〉 as a function of N. It has a quite narrow range of values, and for N greater than about five, this ratio has a value in the range 0.38–0.40 over a wide range of N.

Fig. 2. Plot of 〈n〉 and σ as functions of N. The upper curve plots 〈n〉, the lower one plots σ. Equations (13) and (14) were used. In this figure, and in the next two figures, we plot continuous curves, even though N is of course restricted to integer values.

Fig. 3. Plot of the ratio σ/〈n〉 as a function of N.

In Figure 4, we plot 〈n〉, 〈n〉 + σ _n and 〈n〉 − σ _n as functions of N, to see the band of likely values for n, the number of preserved moraines. We see, for example, that for N = 10, n is likely to be between 1 and 4, and even for N = 20, n is likely to be in the range of 2–5.

Fig. 4. Plot of 〈n〉, 〈n〉 + σ and 〈n〉 − σ, as functions of N. This illustrates the range of likely values of n, the number of preserved moraines.

A very nice, simplifying result is presented by Louchard (Reference Louchard2010), for the large-N behavior of the average value of n:

(17)

$$\langle n\rangle \approx \ln (N) + \gamma + \displaystyle{1 \over {2N}}$$

If we plot the exact answer, Eqn (13) and the approximation (17), it turns out that they agree remarkably well even for N as small as N = 2. Thus, the simple, accurate approximation in Eqn (17) conveys the key idea that 〈n〉 grows as ln(N), which of course is much smaller than N as N becomes large. Louchard (Reference Louchard2010) also presents a simple result for the large-N behavior of the variance:

(18)

$$\sigma _n^2 \approx \ln (N) + \gamma - \displaystyle{{\pi ^2} \over 6} + \displaystyle{3 \over {2N}}$$

This is not quite as accurate as the previous equation at N = 2 or 3, but is still very good. Thus, the spread in values of n goes as $\surd \ln (N)$ as N gets large.

4. DISCUSSION

Gibbons and others (Reference Gibbons, Megeath and Pierce1984) invented an elegant probabilistic model to describe glacial moraine preservation in the presence of obliterative overlap. Their scenario captures important features of the geological situation, and is an interesting statistical model in its own right. As their calculations revealed, the typical number of surviving moraines is surprisingly small. We presented explicit, analytic solutions for key quantities of their model, including $P(n\vert N)$ , the mean value 〈n〉 and the variance $\sigma _n^2 $ . The probability distribution is given in terms of the Stirling numbers of the first kind. These explicit results help in understanding the consequences of the GMP model. In particular, the large-N estimates, that 〈n〉 goes as ln(N) and that σ _n goes as $\surd \ln (N)$ , make clear just how small the typical number of preserved moraines will be.

As GMP point out, the observed value of n can serve as a guide to the value of N. One way to do this is to study $P(N\vert n)$ , the probability that there were N glacial advances, given that we now observe n preserved moraines. We will conclude with a discussion of how to do this.

If we have a theory for $P(n\vert N)$ , such as that invented by GMP, but we are interested in $P(N\vert n)$ , it is natural to use Bayes' theorem (see, e.g., D'Agostini, Reference D'Agostini2003) to write

(19)

$$P(N \vert n)P(n) = P(n \vert N)P_0(N)$$

In this equation, $P_0(N)$ is called the prior; it gives the probability we assign to various values of N, before we have the information on n. Since P(n) is not a function of N, we may write

(20)

$$P(N \vert n) = cP(n \vert N)P_0(N)$$

where we select the constant c to normalize $P(N\vert n)$ on N.

To see how this works in a simple example, we can use the explicit result, Eqn (12). Suppose that we have three surviving moraines, so that n = 3. Then

(21)

$$P(N \vert 3) = c\displaystyle{{ \vert s(N,3) \vert} \over {N!}}P_0(N)$$

To complete the analysis, we need to choose a prior. For simplicity, we will assume our background knowledge tells us that N is between 2 and 12, and take $P_0(N)$ to be constant for 2 ≤ N ≤ 12, and zero otherwise. We stress that, by using expert knowledge about the behavior of glaciers, this simple prior could be replaced by a more detailed one. Figure 5 shows a plot of $P(N\vert3)$ , with our simple choice of prior. We can see that it has a broad peak with a maximum at N = 8. Note that the Bayesian approach, embodied in Eqn (20), is not restricted to the GMP model; a more sophisticated model for $P(n\vert N)$ could be used.

Fig. 5. Plot of $P(N\vert3)$ , the probability that there were N glacial advances, given that three moraines have survived. Equation (21) was used.

ACKNOWLEDGEMENTS

The author would like to thank Andrew Mugler for helpful discussions. This work was supported by the National Science Foundation (grant no. EAR 0112480).

References

REFERENCES

Abramowitz, M and Stegun, IA (1965) Handbook of mathematical functions. Dover, New York Google Scholar

Anderson, LS, Roe, GH and Anderson, RS (2014) The effects of interannual climate variability on the moraine record. Geology, 42, 55–58 Google Scholar

D'Agostini, G (2003) Bayesian reasoning in data analysis. World Scientific, Singapore Google Scholar

Gibbons, AB, Megeath, JD and Pierce, KL (1984) Probability of moraine survival in a succession of glacial advances. Geology, 12, 327–330; Correction, Geology 12, 5762.0.CO;2>CrossRef Google Scholar

Louchard, G (2010) Asymptotics of the Stirling numbers of the first kind revisited: a saddle point approach. Discrete Math. Theor. Comput. Sci., 12, 167–184 Google Scholar

Reichert, BK, Bengtsson, L and Oerlemans, J (2002) Recent glacial retreat exceeds internal variability. J. Clim., 15, 3069–3081 Google Scholar

Roe, GH (2011) What do glaciers tell us about climate variability and climate change? J. Glaciol., 57, 567–578 Google Scholar

Roe, GH and O'Neal, MA (2009) The response of glaciers to intrinsic climate variability: observations and models of late-Holocene variations in the Pacific Northwest. J. Glaciol., 55, 839–854 CrossRef Google Scholar

Wallace, PR (1972) Mathematical analysis of physical problems. Holt, Rinehart, Winston, New York Google Scholar

Young, NE, Schweinsberg, AD, Briner, JP and Schaefer, JM (2015) Glacial maxima in Baffin Bay during the Medieval Warm Period coeval with Norse settlement. Sci. Adv., 1, e1500806 Google Scholar

Fig. 1. Plots of $P(n\vert N)$, the probability that n moraines are preserved, given that N glacial advances have occurred, for various values of N. The exact result, Eqn (12), is used to construct the plots. (a) N = 5. (b) N = 10. (c) N = 30.