1. Introduction
Systems with multiple components that evolve randomly through self-reinforcement or reinforcement via interactions with other components of the system have been of great interest for a long time. Interacting urn models are often used to analyse such systems. Recently, there has been a lot of activity in the area of interacting urns [Reference Laruelle and Pagès8, Reference Crimaldi, Dai Pra, Louis and Minelli4, Reference Crimaldi, Dai Pra and Minelli5, Reference Dai Pra, Louis and Minelli6, Reference Benaïm, Benjamini, Chen and Lima2, Reference Sahasrabudhe13]. In simplest terms, an urn model or an urn process refers to a discrete-time random process that involves updating the configuration of an urn consisting of balls of different colours, according to some reinforcement rule. Here by the configuration of an urn at time t, we mean a vector whose ith element represents the number of balls of colour i in the urn at time t. The reinforcement process is assumed to be Markovian; that is, the reinforcement at any time depends only on the present urn configuration. The most common means of self-reinforcement is to select a ball uniformly at random from the urn at time t, and then, depending on the colour of the drawn ball, to add to or remove from the urn a certain number of balls of each colour. Such an urn model can be fully described by the initial configuration of the urn and the associated replacement matrix whose (i, j)th element is the number of balls of colour j added to or removed from the urn when a ball of colour i is drawn.
Traditionally, the study of urn models is classified based on the type of replacement matrix. For instance, the replacement matrix for the classical Pólya urn model is the identity matrix. That is, at every time-step a ball is drawn and is replaced in the urn along with another ball of the same colour. The asymptotic properties of the Pólya urn model have been studied extensively [Reference Mahmoud9]. The best-known result for the Pólya urn model is that the fraction of balls of each colour converges almost surely to a random limit, which is distributed according to a beta distribution with parameters depending on the initial configuration of the urn. An immediate generalisation of the two-colour Pólya urn model is the Friedman urn model [Reference Freedman7], where the chosen ball is replaced with $\alpha\geq 0$ balls of the same colour and $\beta >0 $ balls of the other colour. In this case, the fraction of balls of either colour approaches the deterministic limit $1/2$ with probability 1. Several other generalisations of the Pólya urn model have been studied; we refer the reader to [Reference Mahmoud9] for details.
1.1. General two-colour interacting urn model
In this paper, we study urn processes involving more than one urn. We define a general two-colour interacting urn model as a random process involving N urns, such that the reinforcement in each urn depends on all the urns or on a non-trivial subset of the given set of N urns. More precisely, suppose there are N urns with configurations $(W^t_i, B^t_i) $ , where $W^t_i$ and $B^t_i$ denote the number of white balls and black balls respectively, at time $t\geq 0$ in the ith urn, for every $i\in [N] \,{:\!=}\, \{1, \ldots, N \}$ . Let
be the proportion of white balls in the ith urn at time t and $\mathcal{F}_t = \sigma\!\left( Z_i^s\,{:}\, 0\leq s\leq t, i\in [N] \right)$ . Define $ (I_{i,W}^t, I_{i,B}^t) \,{:\!=}\, (W^t_i, B^t_i) -(W^{t-1}_i, B^{t-1}_i)$ to be the reinforcement in the ith urn at time t. We consider non-negative and finite reinforcement, that is, $0\leq I_{i,W}^t, I_{i,B}^t <\infty $ for every $i \in [N]$ and $t\geq 0$ , such that $\{(I_{i,W}^t, I_{i,B}^t )\}_{i\in [N]}$ are conditionally independent given $\mathcal{F}_{t-1}$ for every $t\geq 1$ . If the evolution of the ith urn depends on urns $\{i_1, \dots,i_{k_i} \} \subseteq [N]$ , then the distribution of $(I^t_{i,W}, I^t_{i,B})$ conditioned on $\mathcal{F}_{t-1}$ is determined by $Z^{t-1}_{i_1}, \dots, Z^{t-1}_{i_{k_i}}$ . We call the set $\{i_1, \dots,i_{k_i} \}$ the dependency set of the ith urn. In particular, in a graph-based general two-colour interacting urn model, a natural choice for the dependency set of an urn is the collection of urns in its neighbourhood.
Several special cases of the general two-colour interacting urn model have been studied recently. In [Reference Crimaldi, Dai Pra and Minelli5, Reference Dai Pra, Louis and Minelli6] and [Reference Sahasrabudhe13], interacting urn models were studied with Pólya- and Friedman-type reinforcements respectively. More precisely, in [Reference Crimaldi, Dai Pra and Minelli5, Reference Dai Pra, Louis and Minelli6] the authors take
and
for every $i\in [N]$ and some fixed $p\in [0,1]$ . Thus, the dependency set of every urn is the entire set of urns. In other words, the underlying network of interactions is a complete undirected graph on N vertices with self-loops at every vertex. Graph-based interactions in urn processes have been studied before in [Reference Aletti, Crimaldi and Ghiglietti1, Reference Benaïm, Benjamini, Chen and Lima2] and [Reference Van der Hofstad, Holmes, Kuznetsov and Ruszel14].
In this paper, we consider N interacting two-colour urns which are placed at the vertices of a finite directed graph. A ball is drawn independently and uniformly at random, simultaneously from every urn. Each urn then reinforces all the urns in its out-neighbourhood according to a reinforcement matrix R. More precisely, if
is a reinforcement matrix, then if a white ball is drawn from an urn, all its out-neighbours are reinforced with $\alpha$ white and $\delta$ black balls; and if the ball drawn is black, the out-neighbours are reinforced with $\gamma$ white and $\beta$ black balls. We restrict our discussion to a non-negative reinforcement matrix which is balanced; that is, $\alpha + \delta = \gamma + \beta$ .
1.2. Summary of results
We state our main results in Section 4. To study the asymptotic properties of $Z_i^t$ for the proposed model with reinforcement matrix R, we divide the discussion into two parts based on the type of reinforcement matrix, namely, Pólya type (when $\delta=\gamma=0$ ) and non-Pólya type (when $\gamma+\delta >0$ and $\alpha+\beta>0$ ). For non-Pólya-type reinforcement, we show that the fraction of white balls in the urns at vertices with non-zero in-degree converges to a deterministic limit almost surely. Furthermore, under certain conditions on the graph and the initial configuration of the urns we observe synchronisation; that is, the almost sure limit is the same across all urns at vertices with non-zero in-degree. We also obtain fluctuation theorems around the limiting vector.
For Pólya-type reinforcement and the case when $\alpha=\beta=0$ , we obtain results depending on the presence of the vertices with zero in-degree in the graph. When there are vertices with zero in-degree such that every vertex with non-zero in-degree in the graph can be reached from at least one vertex with zero in-degree, we show that the fraction of white balls in every urn converges to a deterministic limit almost surely. When there are no vertices with zero in-degree, we limit our discussion of Pólya-type reinforcement to a regular directed graph, that is, when the in-degree and the out-degree of every vertex equal a constant d. In this case we show that the fraction of balls of white colour in every urn converges to the same random limit almost surely, and we also obtain $\mathcal{L}^2$ -rates of synchronisation. However, the limiting distribution in this case is not known. On this note, we remark that while the reinforcement matrix is of Pólya type, at any given time-step an urn may be reinforced with balls of both colours, depending upon the balls drawn at that time from its in-neighbours. Thus, in our model the random process of a single urn is nowhere similar to the classical Pólya process, and we expect the problem of finding the limiting distribution to be fairly challenging.
We use the stochastic approximation method and the martingale method to study the interacting urn model proposed in this paper. The stochastic approximation method has been used before (see [Reference Zhang15, Reference Laruelle and Pagès8]) to obtain several interesting results for random processes with self-reinforcement or interactive reinforcement. We refer the reader to [Reference Borkar3] for details on stochastic approximation theory. For a more extensive discussion on various techniques used to study random processes with reinforcement (including urn models), we refer the reader to [Reference Pemantle11].
1.3. Outline of the paper
The rest of the paper is organised as follows: in Sections 2 and 3, respectively, we describe in detail the model and the corresponding stochastic approximation scheme. In Section 4, we state the main results. In Section 5, we present the proofs of all the results stated in Section 4. In Section 6 we discuss an application of our results to the study of an opinion dynamics model on finite directed networks.
1.4. Notation
Throughout the paper, we use the following notation: for a sequence of random variables $\{ X_t \}_{t \geq 0}$ and a random variable X, $X_t \xrightarrow{d} X$ means that $X_t$ converges to X in distribution, as $t \to \infty$ , and $X_t \xrightarrow{a.s.} X$ means that $X_t$ converges to X almost surely, as $t \to \infty$ . $N(\mu, \Sigma)$ denotes a normal random vector with mean vector $\mu$ and variance–covariance matrix $\Sigma$ . For a complex-valued sequence $\{a_t\}_{t\geq 0}$ and a real-valued sequence $\{b_t\}_{t\geq 0}$ , we write $a_t = \mathcal{O}(b_t)$ if there exists a positive constant C such that $| a_t| \leq C b_t $ for sufficiently large values of t. The symbol $\|\cdot\|$ denotes the standard $\mathcal{L}^2$ norm, and as defined before, [N] denotes the set $\{1,\dots, N\}$ . For a matrix M, $\lambda_{\min}(M)$ and $\lambda_{\max}(M)$ denote the minimum and the maximum, respectively, of the real parts of the eigenvalues of M. For an eigenvalue $\lambda$ , $\Re(\lambda)$ denotes the real part of $\lambda$ .
2. Model dynamics
Let $G=(V, E)$ be a simple directed graph with $V = [N]$ , $E\subseteq V\times V$ . For every vertex $i\in V$ , let $V(i) \,{:\!=}\, \{j \in V\,{:}\, (j,i)\in E\}$ denote the in-neighbourhood of i, and define $d_i^{\textrm{in}}\,{:\!=}\, |\{ j\in V\,{:}\, (j,i)\in E\}| = |V(i)|$ and $d_i^{\textrm{out}} \,{:\!=}\, | \{j\in V\,{:}\, (i,j)\in E \}|$ respectively to be the in-degree and out-degree of i. We write $i\to j$ if there is a directed edge from i to j and $i \rightsquigarrow j$ if there exists a path $i=i_0\to i_1\to \dots \to i_{k-1}\to i_k=j$ from i to j, for some $i_1,\dots, i_{k-1}\in V$ . The (i, j)th entry of the adjacency matrix A of G is equal to $\mathbb{I}_{i \to j}$ . Throughout this paper we use the term d-regular graph for a directed graph if $d_i^{\textrm{in}}= d_i^{\textrm{out}}= d$ for every $i\in V$ .
Suppose there is an urn at every vertex of G and that each urn contains balls of two colours, white and black. Let $(W^t_i, B^t_i)$ be the configuration of the urn at vertex i, where $W^t_i$ and $B^t_i$ denote the number of white balls and black balls respectively, at time $t\geq 0$ . Let $m \in \mathbb{Z}^+ $ and $\alpha, \beta \in \{0, 1, \dots, m\}$ be fixed; then given $\{(W^t_i, B^t_i)\}_{i\in V}$ , we update the configuration of each urn at time $t+1$ as follows:
A ball is selected uniformly at random from every urn simultaneously and independently of every other urn. The colours of these balls are noted and they are replaced in their respective urns. For every $i \in V$ , if the colour of the ball selected from the ith urn is white, then $\alpha$ white and $(m-\alpha)$ black balls are added to each urn j such that $i\to j$ ; and if the colour of the ball selected from the ith urn is black, then $(m-\beta)$ white balls and $\beta$ black balls are added to each urn j such that $i\to j$ .
That is, each urn i reinforces urn j such that $i\to j$ , according to the following reinforcement matrix:
Throughout this paper, we consider only graphs with no isolated vertices (an isolated vertex is any vertex i such that $d_i^{\textrm{in}} = d_i^{\textrm{out}} =0$ ) and no vertices with only self-loops. This is because urns at such vertices will have no interaction with any other urn in the graph. At an isolated vertex, the urn composition would remain constant, and the other case reduces to a single-urn process (independent of every other urn). The asymptotics of such single-urn processes have been studied extensively; we refer the reader to [Reference Mahmoud9] for details.
We call (1) a Pólya-type reinforcement if $\alpha= \beta= m$ (that is, when $R=mI$ ) and a non-Pólya-type reinforcement when $0<\alpha+\beta <2m$ . To simplify the notation, we define $a \,{:\!=}\, \alpha/m$ and $b \,{:\!=}\, \beta/m$ . Let
be the proportion of white balls in the ith urn at time t, and let $Z^t\,{:\!=}\, (Z_1^t, \dots, Z_N^t)$ . Define the $\sigma$ -field $\mathcal{F}_t \,{:\!=}\, \sigma \big( Z^0, Z^1, \ldots, Z^t \big)$ , and let $Y^t_i$ denote the indicator of the event that a white ball is drawn from the ith urn at time t. Then the distribution of $Y_i^{t}$ , conditioned on $\mathcal{F}_{t-1}$ , is given by
Note that conditioned on $\mathcal{F}_{t-1}$ , $\{Y_i^t\}_{i\in V}$ are independent random variables. The proposed model is a general two-colour interacting urn model, as defined in Section 1.1, where the dependency set of the ith urn is V(i) such that, given $\mathcal{F}_{t-1}$ ,
for $t\geq 1$ .
We divide the vertex set of the graph into two disjoint sets, say $ V=S\cup F$ , where $S = \{ i \in V\,{:}\, d_i^{\textrm{in}}=0\}$ and $F = \{ i \in V\,{:}\, d_i^{\textrm{in}}>0\}$ . We call the vertices in S stubborn vertices, as the urns at these vertices are not reinforced by any other urn, i.e. $V(i)=\emptyset$ ; we call F the set of flexible vertices. Clearly, for every $i\in S$
whereas for every $i \in F$ , we have
and
Let $ T_i^t = W_i^t +B_i^t$ be the total number of balls in the ith urn at time t. Then, for every $i\in V$ ,
Thus, the total number of balls in every urn at a flexible vertex is deterministic and increases linearly with time. In the next section, we briefly discuss stochastic approximation theory, a technique which is used in the later sections to obtain the asymptotic results for $Z^t$ .
3. Stochastic approximation scheme
A stochastic approximation scheme refers to a k-dimensional recursion of following type:
where $h\,{:}\, \mathbb{R}^k \to \mathbb{R}^k$ is a Lipschitz function, $\{\Delta M_t\}_{t\geq 1}$ is a bounded square-integrable martingale difference sequence, and $\{\gamma_t\}_{t\geq 1}$ are positive step sizes satisfying conditions that ensure that $ \sum_{t\geq 1} \gamma_t$ diverges, but slowly. More precisely, we assume the following:
-
(i) $\sum_{t\geq 1} \gamma_t = \infty$ and $\sum_{t\geq 1} \gamma_t^2 < \infty$ ;
-
(ii) there exists $ C>0$ such that $\mathbb{E}\left [\|\Delta M_{t+1}\|^2 \vert \mathcal{G}_t\right] \leq C$ almost surely $\forall t \geq 0$ , where $\mathcal{G}_t = {\sigma (x_0, M_1, \ldots, M_t)}$ ;
-
(iii) $\sup_{t\geq 0} \|x_t\| < \infty$ , almost surely;
-
(iv) $\{ \epsilon_t \}_{t\geq 1}$ is a bounded sequence such that $\epsilon_t \to 0$ , almost surely as $t\to \infty$ .
Then the theory of stochastic approximation says that the iterates of (4) converge almost surely to the stable limit points of the solutions of the ordinary differential equation given by $\dot{x}_t = h(x_t)$ . For explicit results and details on a standard stochastic approximation scheme (with $\epsilon_t = 0 \ \forall t \geq 0$ ), we refer the reader to Lemma 1 and Theorem 2 from Chapter 2 of [Reference Borkar3]. Also in Chapter 2, the author discusses various extensions of the standard stochastic approximation scheme, including the stochastic approximation scheme of the form (4). In this section, we use stochastic approximation theory to study the limiting behaviour of $Z^t$ and write a stochastic approximation scheme of the form (4) for $Z^t$ . We first establish some notation.
Without loss of generality, we take $F = \{1, \dots, |F|\}$ and $S = \{|F|+1,\dots, N\}$ . For a vector $g\in \mathbb{R}^N$ and $B \subseteq [N]$ , let $g_B$ denote the $|B|$ -dimensional vector obtained by restricting g to B. Similarly, for a matrix $M\in \mathbb{R}^{N \times N}$ and $B \subseteq [N]$ , let $M_B$ denote the $|B| \times |B|$ matrix obtained by restricting M to the index set $B \times B$ . By $\textbf{1}$ and $\textbf{0}$ we denote matrices of appropriate dimension with all entries equal to 1 and to 0, respectively. We write $Z^t = (Z_1^t, \dots, Z_N^ t)= (Z_F^t, Z_S^0)$ as a row vector and define $N \times N$ matrices
Then
where $\textbf{T}_F^t$ and $\textbf{D}_F$ are both $|F| \times |F|$ matrices. Note that for $i\in S$ there is no $j\in V$ such that $j\to i$ ; therefore we can decompose the adjacency matrix A of the graph as follows:
where $ A_F $ is a $ |F|\times |F|$ matrix and $A_{S,F}$ is an $|S|\times |F|$ matrix. We define the scaled adjacency matrix
where the (i, j)th entry of $\tilde{A}$ is equal to
whenever $d_j^{in}>0$ , and is equal to 0 otherwise. We now write the evolution of $Z^t_i$ , for $i \in F$ , as a stochastic approximation scheme. From (2) and (3) we get
Define $ \Delta M_i^{t+1} \,{:\!=}\, (a+b-1) \left( Y_i^{t+1} - \mathbb{E}\left[Y_i^{t+1}\vert \mathcal{F}_t \right] \right)$ ; then we can write
Let $\Delta M^t \,{:\!=}\, ( \Delta M_1^t, \dots, \Delta M_{N}^t )$ be the martingale difference vector in $\mathbb{R}^N$ . We can write the recursion from (5) in vector form as follows:
where $h\,{:}\,[0,1]^{|F|} \to \mathbb{R}^{|F|}$ is such that
Here $\left((z, Z_S^0) \tilde{A}\right)_F = z\tilde{A}_F + Z_S^0 \tilde{A}_{S,F}$ , for $z\in [0,1]^{|F|}$ . The above recursion can be written as
where
Thus, we get a recursion of the form (4) given by
where $\gamma_t = \dfrac{1}{t}$ and $(\Delta M^{t+1}\tilde{A})_F$ is a bounded martingale difference sequence. Using (3), it can easily be verified that $\epsilon_t \to 0$ as $t \to \infty$ and h is a Lipschitz function. Thus, the recursion in (7) satisfies the required assumptions (i)–(iv). Now, from stochastic approximation theory, we know that the limit points of the recursion in (7) almost surely coincide with the asymptotically stable equilibria of the ordinary differential equation given by
Therefore, one can analyse the limiting behaviour of the interacting urn process by analysing the zeroes of the h function and the eigenvalues of its Jacobian. Furthermore, we use stochastic approximation results from [Reference Zhang15] to establish central limit theorems for $Z_F^t$ .
4. Main results
In this section we state our main results. In Section 4.1 we state an almost sure convergence theorem for $Z_F^t$ and show that synchronisation occurs under certain conditions on the initial configuration of the urns and on the structure of the underlying graph. Recall that by synchronisation we mean that the proportion of white balls converges to the same limit for every urn. Furthermore, in Section 4.2 we state the central limit theorems for $Z_F^t$ for a non-Pólya-type reinforcement. In Section 4.3, we consider a special case, namely Pólya-type reinforcement on a regular graph, and show that $Z_F^t$ converges to a random vector almost surely. In addition, the urns synchronise in the sense that $Z_i^t$ converges to the same random variable for every $i\in F$ .
4.1. Convergence and synchronisation results
Theorem 4.1. Suppose that either of the following conditions holds:
-
(i) The reinforcement is of non-Pólya type, that is, $R\neq mI$ and $a+b>0$ .
-
(ii) $S\neq \emptyset$ is such that for every $f \in F$ there exists $s \in S$ such that $s \rightsquigarrow f$ .
Then $I-(a+b-1) \tilde{A}_F$ is invertible and $ Z_F^t \longrightarrow z^*$ almost surely as $t\to \infty$ , where
Remark 4.1. Theorem 4.1 does not address the case when $S = \emptyset$ and $|a+b-1|=1$ ; in this case the corresponding stochastic approximation scheme as in (7) holds with
Clearly, $h(z)=\textbf{0}$ has a unique solution whenever $I-(a+b-1)\tilde{A}_F$ is invertible. In fact, when $I-(a+b-1) \tilde{A}_F$ is not invertible, $Z_F^t$ need not admit a deterministic limit. However, even in such cases we expect that on a strongly connected graph, $ Z^{t}$ converges almost surely to a random limit.
One straightforward conclusion from the above theorem is that $Z^t \to (z^*, Z_S^0)$ almost surely, as $t\to \infty$ . Furthermore, for synchronisation to occur among the set of flexible vertices F, we expect the reinforcement at any two flexible vertices to be similar. This manifests itself as a condition that $Z_S^0 \tilde{A}_S$ is a constant vector and that for $i, j \in F$ , $i \neq j$ ,
More precisely, we have the following result.
Corollary 4.1. (Synchronisation.) Suppose $Z^0_S \tilde{A}_{S,F} = c_1\textbf{1} $ and $\textbf{1} \tilde{A}_F =c_2 \textbf{1}$ , for some constants $c_1, c_2 \in [0,1]$ ; then under the condition (i) or (ii) of Theorem 4.1,
In particular, if reinforcement is of non-Pólya type and $S=\emptyset$ , then as $t \to \infty$
4.2. Fluctuation results
We now state the fluctuation theorems for $Z_F^t$ around the almost sure limit $z^*$ . Define
where I is an $|F|\times |F|$ identity matrix. As we will see in the results below, the scaling for fluctuation theorems depends on $\rho$ . In the case when $0<\rho < 1/2$ , there exist finitely many complex random vectors $\xi_1,\dots, \xi_l$ such that $(Z_F^t-z^*)$ scaled appropriately can be almost surely approximated by a weighted sum of $\xi_1,\dots, \xi_l$ . The scaling in this case depends explicitly on $\rho$ . For details we refer the reader to Theorem 2.2 of [Reference Zhang15]. In this paper, we only discuss the cases $\rho > 1/2$ and $\rho = 1/2$ and obtain Gaussian limits with appropriate scaling.
Theorem 4.2. Suppose $ Z_F^t \longrightarrow z^*$ almost surely as $t\to \infty$ and $\rho >1/2$ ; then
with
where H is as defined in (9) and $\Theta$ is an $N\times N$ diagonal matrix such that
Corollary 4.2. Suppose the reinforcement is of non-Pólya type and $\tilde{A}=\tilde{A}^\top$ . Then for $\rho >1/2$ , Theorem 4.2 holds with
where
Theorem 4.3. Suppose $ Z_F^t \longrightarrow z^*$ almost surely as $t\to \infty$ and $\rho=1/2$ ; then
with
for H as defined in (9) and $\Theta$ as defined in Theorem 4.2.
Corollary 4.3. Suppose the reinforcement is of non-Pólya type and $\tilde{A}=\tilde{A}^\top$ . Then for $\rho =1/2$ , Theorem 4.3 holds with
where J is an $|F|\times |F|$ matrix with all elements equal to 1 and C(a, b) is as defined in Corollary 4.2.
Remark 4.2. (Friedman-type reinforcement.) As an immediate consequence of Corollary 4.1, we get that with $S= \emptyset$ for the Friedman-type reinforcement, that is, when $a=b \in (0,1)$ , $Z_F^t \longrightarrow \frac{1}{2} \textbf{1} $ almost surely. Furthermore, in this case C(a, b) simplifies and Corollary 4.2 and Corollary 4.3 hold with $C(a,b) = \left( a-\frac{1}{2} \right)^2 $ .
4.3. Convergence results for Pólya-type reinforcement
In this section, we consider the Pólya-type reinforcement with the following assumptions:
-
(A1) G is connected and d-regular, and the scaled adjacency matrix $\tilde{A} = \frac{1}{d} A$ is diagonalisable over $\mathbb{C}$ . More precisely, there exists an invertible matrix P such that $\tilde{A} = P \Lambda P^{-1} $ for a diagonal matrix $ \Lambda$ .
-
(A2) The initial number of balls in each urn is the same; that is, $\textbf{T}^0 = T^0 I$ , for some $ T^0 \geq 1$ .
Note that for a d-regular graph we have $S=\emptyset$ and $F= V$ ; therefore, to simplify the notation, we remove the subscript F throughout this section. In this case, the associated ordinary differential equation obtained from (8) reduces to $ \dot{z} = h(z) = -z( I- \tilde{A}).$ The equilibrium points of this ordinary differential equation are the left eigenvectors of $\tilde{A}$ corresponding to the eigenvalue 1. Observe that these equilibrium points are not stable, as one of the eigenvalues of $\frac{\partial h}{\partial z}$ is 0. Therefore this case requires a different approach. Specifically, we use martingale theory to obtain convergence results.
Under the assumptions (A1) and (A2)
We define
Theorem 4.4. Suppose the reinforcement scheme is of Pólya type, that is, $R = m I$ , and the assumptions (A1) and (A2) hold. Then
Corollary 4.4. Let $\hat{Z}^t = Z^t\tilde{A}$ be the vector of neighbourhood averages of the proportion of white balls in the urns. Then in the setting of Theorem 4.4, $\mathop{\textrm{Var}}\!(\hat{Z}^{t}- \bar{Z}^t \textbf{1}) \longrightarrow \bf{0}_{N\times N} $ , as $t \to \infty.$
The following theorem establishes that for every $i \in V$ , $Z^t_i$ converges almost surely to the same limiting random variable.
Theorem 4.5. (Synchronisation.) Suppose the reinforcement scheme is of Pólya type, that is, $R = m I$ , and the assumptions (A1) and (A2) hold. Then there exists a finite random variable $Z^\infty$ such that as $t \to \infty$ ,
5. Proofs of the main results
In this section, we prove all the results from Section 4. To prove the results from Section 4.1, we use stochastic approximation theory, which was discussed in Section 3. We start with the following lemma.
Lemma 5.1. Let M be an $n \times n$ real non-negative matrix such that each column sum of M is less than or equal to 1. Then $rM-I$ is invertible for all $ r\in \mathbb{R}$ whenever $|r| <1$ .
The above lemma follows from the Perron–Frobenius theorem; however, for the sake of completeness we include a proof here.
Proof of Lemma 5.1. Note that the case $r=0$ is trivial. For $r\neq 0$ , suppose the matrix $(r M-I)$ is not invertible. Then there exists a vector ${\bf w} = (w_1, \ldots, w_n) \in \mathbb{C}^n$ such that $\textbf{w} (r M-I) =\textbf{0}$ , that is, $ \textbf{w} M = \frac{1}{r} \textbf{w}, $ which implies that for every $j\in \{1,\dots, n\}$ ,
The last step follows since each column sum of M is less than or equal to 1. Now if $j = \textrm{argmax}\{|w_k|;\ k =1,\ldots, n\}$ , then from (14) we must have $1/|r| \leq 1$ , which contradicts our assumption of $|r|<1$ .
Proof of Theorem 4.1. As discussed in Section 3, it is enough to study the corresponding ordinary differential equation in (8), of the stochastic approximation scheme for $Z^t_F$ obtained in (7). We know that a point $z^* \in [0, 1]^{|F|}$ is an equilibrium point of the associated ordinary differential equation $\dot{z} = h(z)$ if $h(z^*) = 0$ . For the h function given in Equation (6), we have $h(z^*) = 0$ , if and only if
Thus the unique equilibrium point is given by
whenever the matrix $I- (a+b - 1)\tilde{A}_F $ is invertible. We now show that under the assumptions of the theorem, this is indeed true.
-
(i) When $R\neq mI$ and $a+b>0$ we have $|a+b-1|<1$ . Furthermore, since each column sum of $\tilde{A}_F$ is less than or equal to 1, the invertibility of the matrix $I- (a+b - 1)\tilde{A}_F $ follows from Lemma 5.1.
-
(ii) We now show that if for every $f \in F$ there exists $s \in S$ such that $s \rightsquigarrow f$ , then $I-(a+b-1) \tilde{A}_F$ is invertible. Observe that the graph G restricted to the set of flexible vertices F can be partitioned into strongly connected components $F_1, \dots, F_k$ . Then the scaled adjacency matrix can be written as an upper block-triangular matrix as follows:
\begin{equation*} \tilde{A}_F =\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}A_{F_1} & A_{F_1,F_2} & \dots & A_{F_1,F_k}\\ \\[-7pt]0& A_{F_2} & \dots & A_{F_2,F_k}\\ \\[-7pt]\vdots &\vdots & \ddots & \vdots \\ \\[-7pt]0&\dots & 0& A_{F_k}\end{array}\right) \textbf{D}_F^{-1}\,{=\!:}\, \left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\tilde{A}_{F_1} & \tilde{A}_{F_1,F_2} & \dots & \tilde{A}_{F_1,F_k}\\ \\[-7pt]0&\tilde{A}_{F_2} & \dots & \tilde{A}_{F_2,F_k}\\ \\[-7pt]\vdots &\vdots & \ddots & \vdots \\ \\[-7pt]0&0&\dots & \tilde{A}_{F_k}\end{array}\right), \end{equation*}where $\tilde{A}_{F_i, F_j}$ is an $|F_i|\times |F_j|$ matrix and $\tilde{A}_{F_i}=\tilde{A}_{F_i, F_i}$ . Since each $F_1,\dots, F_k$ is strongly connected, $\tilde{A}_{F_i}$ is an irreducible matrix for every $1\leq i\leq k$ and the matrix $I- (a+b-1) \tilde{A}_F $ is of the form\begin{equation*}\left(\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}I_1- (a+b-1)\tilde{A}_{F_1} &- (a+b-1) \tilde{A}_{F_1,F_2} & \dots & - (a+b-1)\tilde{A}_{F_1,F_k}\\ \\[-7pt]0&I_2- (a+b-1)\tilde{A}_{F_2} & \dots & - (a+b-1)\tilde{A}_{F_2,F_k}\\ \\[-7pt]\vdots &\vdots & \ddots & \vdots \\ \\[-7pt]0&0&\dots & I_k- (a+b-1)\tilde{A}_{F_k}\end{array}\right), \end{equation*}where $I_j$ is the $|F_j| \times |F_j|$ identity matrix. By Schur’s complement, we know that $I- (a+b-1) \tilde{A}_F$ is invertible whenever each block matrix $ I_1-(a+b-1)\tilde{A}_{F_1}, \dots, I_k-(a+b-1)\tilde{A}_{F_k}$ is invertible. Since every $\tilde{A}_{F_j}$ is an irreducible and sub-stochastic matrix (sub-stochasticity follows from the hypothesis in Part (ii) of the theorem), we have that $-1<\lambda_{\min}(\tilde{A}_{F_j})\leq \lambda_{\max}( \tilde{A}_{F_j})<1$ for every j. Hence $ I_j-(a+b-1)\tilde{A}_{F_j}$ is invertible for every $1\leq j\leq k$ .
Finally, using the above arguments we can also conclude that all the eigenvalues of the Jacobian matrix $\frac{\partial h}{\partial z} = (a+b-1)\tilde{A}_F-I$ have negative real parts in both the cases. Therefore $z^*$ is uniformly stable, and this concludes the proof.
Proof of Corollary 4.1. Under the assumptions, the unique solution of (15) is $z^*=c\textbf{1} $ , for
For non-Pólya-type reinforcement we have $0< a+b \neq 2$ , and for $S=\emptyset$ we get $c_1=0$ and $c_2 =1$ . Thus the unique equilibrium point $z^*$ simplifies to
We now prove the scaling limit theorems using tools and results from [Reference Zhang15]. Observe that for $\rho >0$ (where $\rho$ is as defined in (9)), the assumptions 2.2 and 2.3 made in [Reference Zhang15] are satisfied. That is, for some $\delta >0$ ,
and for every $\epsilon >0$ ,
Furthermore, as we shall see in the proofs below,
exists.
As mentioned before, the scaling for the limit theorems is given by the regimes of $\rho$ , where $\rho$ is as defined in (9). We remark that $\rho$ depends on the eigenvalues of $\tilde{A}_F$ as well as on $a+b-1$ , which is in fact the non-principal eigenvalue of the scaled reinforcement matrix $\frac{1}{m} R$ . In particular, we have the following two cases:
-
1. When $\textbf{a + b} - \textbf{1 > 0}$ , we get $\rho = 1-(a+b-1) \lambda_{\max}(\tilde{A}_F).$ Therefore
\begin{align*}& \rho>1/2 \iff \lambda_{\max}(\tilde{A}_F) \in \left[-1, \frac{1}{2(a+b-1)}\right) \\ \text{and} \qquad & \rho=1/2 \iff \lambda_{\max}(\tilde{A}_F) = \frac{1}{2(a+b-1)}.\end{align*} -
2. When $\textbf{a + b} \boldsymbol{-} \textbf{1 < 0}$ , we get $\rho = 1-(a+b-1) \lambda_{\min}(\tilde{A}_F).$ Therefore
\begin{align*}&\rho>1/2 \iff \lambda_{\min}(\tilde{A}_F) \in \left(\frac{1}{2(a+b-1)}, 1\right] \\\text{and} \qquad &\rho=1/2 \iff \lambda_{\min}(\tilde{A}_F) = \frac{1}{2(a+b-1)}.\end{align*}
We now give the proofs of the limit theorems.
Proof of Theorem 4.2. The limit theorem in the case of $\rho>1/2$ holds with scaling $\sqrt{t}$ (see Theorem 2.3 in [Reference Zhang15]), with the limiting variance matrix given by
where
Note that
Now using the fact that for $i\neq j$ , $Y_i^t$ and $Y_j^t$ are conditionally independent, we get
where $\Theta$ is a diagonal matrix given by
Therefore we get
Proof of Corollary 4.2. For $\tilde{A} = \tilde{A}^\top$ , we have $S =\emptyset$ and $\tilde{A}_F = \tilde{A}$ . Therefore, for non-Pólya-type reinforcement, by Corollary 4.1,
and $\Theta= C(a,b)I$ . Then
where
Furthermore, $\tilde{A}$ has a spectral decomposition
where P is an $N \times N$ real orthogonal matrix and $1, \lambda_1, \dots, \lambda_{N-1}$ are eigenvalues of $\tilde{A}$ . Since H commutes with $\tilde{A}$ , the limiting variance matrix is given by
This completes the proof.
Proof of Theorem 4.3. In the case when $\rho =1/2$ , the asymptotic normality holds with scaling $\sqrt{\dfrac{t}{\log t}}$ (see Theorem 2.1 in [Reference Zhang15]) and the limiting variance matrix given by
where $\Gamma = (\tilde{A}^\top \Theta \tilde{A})_F $ .
Proof of Corollary 4.3. For $\tilde{A} = \tilde{A}^\top$ , as in the proof of Corollary 4.2, we get that $\Theta = C(a,b)I$ and $\tilde{A}_F =\tilde{A}$ is a symmetric matrix. Therefore, in the case of $\rho=1/2$ , using the spectral decomposition as in (16), the limiting variance matrix is given by
where
Since for $\rho =1/2$ , $ 1-2(a+b-1)\lambda_i >0$ for every $i=1,\dots, N-1$ , we get
Thus,
and we get
Since the normalised eigenvector corresponding to the maximal eigenvalue 1 of $\tilde{A}$ is $\frac{1}{\sqrt{N}}\textbf{1}$ , we get
where $J= \textbf{1} ^\top \textbf{1} $ is an $N \times N$ matrix with all elements equal to 1.
We now prove the results for Pólya-type reinforcement stated in Section 4.3. Our main aim (as stated in Theorem 4.5) is to show that the process $\{Z_i^t\}_{t\geq 0}$ converges almost surely to a finite random variable $Z^\infty$ for every $i\in [N]$ . In order to show this, we first prove that $Z^t-\bar{Z}^t \textbf{1} $ converges almost surely to $\textbf{0}$ , using the convergence result for almost supermartingales [Reference Robbins and Siegmund12]. Then in Lemma 5.2 and Theorem 4.5, we show that both $Z^t$ and $\bar{Z}^t$ admit an almost sure limit, using convergence theorems for quasi-martingales [Reference Métivier10] and martingales respectively.
Proof of Theorem 4.4. Recall that under the assumption (A2) of the theorem, $ \textbf{T}^t = (T^0 + tmd) I = T^t I$ . Define $\Phi_i^t = Z_i^t-\bar{Z}^t$ ; then in the vector notation we have
where $\Phi^t=\left(\Phi^t_1, \ldots, \Phi^t_N\right )$ , $K = I-\dfrac{1}{N}J $ , and J is the $N\times N$ matrix with all entries equal to 1. We want to show that each element of $\mathop{\textrm{Var}}\!(\Phi^t) = \mathop{\textrm{Var}}\!(Z^{t}- \bar{Z}^t \textbf{1}) \to 0$ and $\Phi^t \xrightarrow{a.s.}0$ . To this end, we write a recursion for $\mathop{\textrm{Var}}\!\left(\Phi^{t+1}\right) $ using the law of total variance given by
From Equation (2) we get
where $U_t = \dfrac{1}{T^{t+1}} (T^t I +m A )$ . This gives
and
For the second term of the expression for $\mathop{\textrm{Var}}\!\left(\Phi^{t+1}\right)$ in Equation (18), we have
where $\textrm{Diag}\!\left( x_i \right)_{1\leq i\leq N} $ denotes the $N\times N$ diagonal matrix with ith diagonal element equal to $x_i$ . Thus
where $V_z^t = \textrm{Diag}\big(\mathbb{E}[Z_i^{t} (1-Z_i^t)]\,\big)_{1\leq i\leq N} $ . Substituting the quantities from equations (21) and (22) in Equation (18), we get
Iterating this we get
where the last equality follows from the fact that
We can write
where $\tilde{A} = \frac{1}{d} A$ is the scaled adjacency matrix. Using the assumption (A1), we get that there exists a matrix P such that
where $1, \lambda_1,\cdots, \lambda_{N-1}$ are the N eigenvalues of $\tilde{A}$ . Since $\tilde{A}$ is irreducible with column sums equal to 1, by Perron–Frobenius, we must have $\Re(\lambda_i)<1$ for all $i=1, \dots, N-1$ . Thus for $j \geq 0.$
Let $S^j \,{:\!=}\, LP^\top A^\top V_z^ j APL$ ; then from (23) and (24) we get that $\mathop{\textrm{Var}} \!(\Phi^{t+1}) = (P^{-1})^\top \Omega^t P^{-1}$ , where $\Omega^t$ equals
The (l, n)th element of $\Omega^t$ is
Using Euler’s approximation, for every $l = 2, \dots, N$ we get
Since the elements of the matrix $S^j$ are bounded, for the non-zero elements of $\Omega^t$ we have
Let $\lambda_{l,n} = \Re(\lambda_{l-1})+\Re(\lambda_{n-1})$ . When $\lambda_{l,n} <1$ , we use the approximation
If $\lambda_{l,n} =1$ , then
Finally, if $\lambda_{l,n} >1$ , then $\sum_{j=1}^t j^{-\lambda_{l,n}}<\infty$ ; therefore,
Since $\lambda_{l,n} <2$ and P and $P^{-1}$ have bounded elements, $\mathop{\textrm{Var}}\!(Z^{t}- \bar{Z}^t \textbf{1}) \to \textbf{0}_{N\times N}$ as $t \to \infty$ . Now from Equation (20) we get
Hence $\Phi^t= Z^{t}- \bar{Z}^t \textbf{1} \to \textbf{0}$ as $t \to \infty$ in $\mathcal{L}^2$ .
Since $0\leq \mathbb{E}((\Phi^t_i)^2) \leq \mathbb{E}(\Phi^t_i) \to 0$ , we get $\mathbb{E}[ \|\Phi^t\|^2] \to 0$ . Therefore, to show that $\Phi^t$ converges almost surely to $\textbf{0}$ , it is enough to show that $\|\Phi^t\|^2 $ admits an almost sure limit:
where $\Delta M_\Phi^{t+1}= m \!\left(Y^{t+1} - \mathbb{E} \left[Y^{t+1}\Big\vert\mathcal{F}_t \right]\right) AK$ is a martingale difference sequence. Therefore we have
where
and the last inequality follows from the fact that $ 2I-\tilde{A}-\tilde{A}^\top$ is a positive semidefinite matrix. Since $\eta_t$ is a bounded random variable, from Equation (27) we get that $\|\Phi^t\|^2$ is an almost supermartingale; therefore $\|\Phi^t\|^2 $ converges almost surely (see Theorem 1 in [Reference Robbins and Siegmund12]). This concludes the proof of the theorem.
Proof of Corollary 4.4. Let $\Psi^t = Z^t \!\left (\tilde{A} - \frac{1}{N} J\right ) = Z^t K' $ , where
with $L' = \textrm{diag} \!\left( 0, \lambda_1, \dots, \lambda_{N-1}\right)$ . The proof follows from the same argument as in the proof of Theorem 4.4, with K $^{\prime}$ and L $^{\prime}$ taking the place of K and L respectively.
Recall that Theorem 4.5 says that for every $i \in V$ , $Z_i^t$ converges to the same random limit. To prove this, we use the fact that $Z_i^t- \bar{Z}^t$ converges to 0 almost surely (Theorem 4.4 proved above), and we show that both $Z^t$ and $\bar Z^t$ admit an almost sure limit. From Equation (19), we have
Therefore, the N-dimensional process $\{Z^{t}\}_{t\geq 0}$ is a martingale if and only if $\tilde{A} = I$ , that is, when each node is isolated and has a self-loop. While $\{Z^{t}\}_{t\geq 0}$ is not a martingale in general, in the next lemma we show that $Z^t$ admits an almost sure limit.
Lemma 5.2. Under the assumptions (A1) and (A2), $Z^t$ admits an almost sure limit.
Proof. Using Theorem 11.7 in [Reference Métivier10], it is enough to show that process $(Z^t)_{t\geq 0}$ satisfies the following two conditions:
-
(i) $\sum_{t=0}^\infty \mathbb{E}\left [\| \mathbb{E}[Z^{t+1} | \mathcal{F}_t]-Z_t \|\right ] <\infty$ ;
-
(ii) $\sup_{t\geq 0} \mathbb{E} [\|Z^t\|] <\infty.$
Note that $Z^t$ satisfies (ii) trivially. Let $\Phi_t$ and $\Psi_t$ be as defined in the proofs of Theorem 4.4 and Corollary 4.4 respectively. Now,
where the last inequality follows by Jensen’s inequality. The fact that the first two terms of (28) are finite follows from Theorem 4.4 and Corollary 4.4. We will now show that the last term in (28) is also finite. We have
Using the assumptions (A1) and (A2) we get
where $Q^t$ is an $N \times N$ diagonal matrix with
and $Q^t(1,1)= 0$ . This implies $ \| \mathbb{E}[\Psi^t - \Phi^t]\| = \mathcal{O}\left (t^{\Re(\lambda_{(2)})-1}\right )$ . Thus the sum in (28) is finite. This completes the proof.
Proof of Theorem 4.5. From Equation (19) we have
since $\tilde{A} \textbf{1}^\top =\textbf{1}^\top$ for a regular graph. Thus, $\bar{Z}^t$ is a bounded martingale; therefore by the martingale convergence theorem, there exists a finite random variable $Z^\infty$ such that $\bar{Z}^t \to Z^\infty $ almost surely. Using Theorem 4.4 and Lemma 5.2 we conclude that $Z^t \to Z^\infty \textbf{1} $ almost surely.
Observe that for Pólya-type reinforcement, convergence and synchronisation results are obtained only for regular graphs. This restriction was needed to obtain explicit expressions by using the symmetry and hence the spectral decomposition of the adjacency matrix. We believe that the above results can be extended to more general graphs using similar ideas.
6. Application to opinion dynamics on networks
In this section, we briefly demonstrate how these models can be used for modelling opinion dynamics on networks. We consider urns at vertices of a directed graph and assume that urns represent individuals, with the proportion of white balls (respectively, black balls) in an urn quantifying the positive (respectively, negative) inclination of that individual on a fixed subject. The graph structure determines the interactions between the individuals. That is, if there is an edge from node i to node j, the individual/urn at node i can influence the individual/urn at j via a chosen (but fixed) reinforcement matrix. We define the opinion of an individual i at time t by $O^t_i = \mbox{Sign }\!(Z^t_i-1/2)$ , where for convenience we assume $\mbox{Sign} (0) = 1$ . The asymptotic results for the urn process obtained in this paper can be used to study the evolution of opinions defined this way. Note that this process of evolution of opinions is very different from the traditional voter model or its extensions. In our model, the opinion of an individual does not flip frequently; rather, it evolves slowly, as inclinations change depending on neighbourhood interactions. For instance, at time t consider an individual i with opinion 0 (and $Z_i^t << \frac{1}{2}$ ) such that all of her in-neighbours have opinion 1. After the reinforcement at time $t+1$ , it is quite probable that the positive inclination $Z_i^{t+1}$ of the individual at i gets closer to $\frac{1}{2}$ but the opinion $O_i^{t+1}$ may still remain 0. Thus, the opinion evolution model based on the interacting urn process on a network discussed in this paper models more realistic human behaviour.
The convergence results for the urn process could be used to answer some interesting questions about opinion evolution on a network. Given the reinforcement matrix, we can determine whether the limiting opinion of the majority is 1 or 0. Put differently, we can find conditions on the reinforcement matrix such that the limiting opinion of the majority is 1 or 0. To illustrate this we consider the following example.
Example 6.1. Consider a directed star graph $G=(V, E)$ on N vertices such that the center vertex is labelled 1 and the rest of the vertices are labelled $\{2, 3, \dots, N \}$ . Then 1 is the only flexible vertex, with in-degree $N-1$ and $S= \{2, 3, \dots, N \}$ . Suppose the reinforcement matrix is $R = \begin{pmatrix} \alpha &\quad m-\alpha \\ m-\beta &\quad \beta \end{pmatrix}$ for $\alpha, \beta \in \{0,1,\dots, m\}$ . Then from Theorem 4.1 we get that as $t\to \infty$
where
Thus the asymptotic opinion of vertex 1, that is, $\mbox{Sign}\!\left(z^*- \frac{1}{2}\right)$ , depends on a, b and the average initial inclinations of stubborn individuals.
Acknowledgements
We would like to thank Antar Bandyopadhyay for introducing us to this problem and for his insightful remarks. We would also like to thank the reviewers for their valuable comments, which led to significant improvement of this work.
Funding information
The work of G. Kaur was supported by NUS Research Grant R-155-000-198-114. The work of N. Sahasrabudhe was supported in part by the DST-INSPIRE Faculty Fellowship from the Government of India and in part by CEFIPRA Grant No. IFC/DST-Inria-2016-01/448, ‘Machine Learning for Network Analytics’.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.