Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods

Radford M. Neal; Jeffrey S. Rosenthal

doi:10.1017/jpr.2024.48

Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods

Part of: Markov processes

Published online by Cambridge University Press: 18 September 2024

Radford M. Neal and

Jeffrey S. Rosenthal

Show author details

Radford M. Neal*: Affiliation:
University of Toronto
Jeffrey S. Rosenthal*: Affiliation:
University of Toronto
*: *Postal address: Department of Statistical Sciences, University of Toronto.
*Postal address: Department of Statistical Sciences, University of Toronto.

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We review criteria for comparing the efficiency of Markov chain Monte Carlo (MCMC) methods with respect to the asymptotic variance of estimates of expectations of functions of state, and show how such criteria can justify ways of combining improvements to MCMC methods. We say that a chain on a finite state space with transition matrix P efficiency-dominates one with transition matrix Q if for every function of state it has lower (or equal) asymptotic variance. We give elementary proofs of some previous results regarding efficiency dominance, leading to a self-contained demonstration that a reversible chain with transition matrix P efficiency-dominates a reversible chain with transition matrix Q if and only if none of the eigenvalues of $Q-P$ are negative. This allows us to conclude that modifying a reversible MCMC method to improve its efficiency will also improve the efficiency of a method that randomly chooses either this or some other reversible method, and to conclude that improving the efficiency of a reversible update for one component of state (as in Gibbs sampling) will improve the overall efficiency of a reversible method that combines this and other updates. It also explains how antithetic MCMC can be more efficient than independent and identically distributed sampling. We also establish conditions that can guarantee that a method is not efficiency-dominated by any other method.

Keywords

Markov chain asymptotic variance efficiency domination Peskun ordering

MSC classification

Primary: 60J10: Markov chains (discrete-time Markov processes on discrete state spaces)

Secondary: 60J22: Computational methods in Markov chains

Type: Original Article
Information: Journal of Applied Probability , Volume 62 , Issue 1 , March 2025 , pp. 188 - 208

DOI: https://doi.org/10.1017/jpr.2024.48 [Opens in a new window]
Copyright: © The Author(s), 2024. Published by Cambridge University Press on behalf of Applied Probability Trust

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Andrieu, C. and Livingstone, S. (2021). Peskun–Tierney ordering for Markovian Monte Carlo: Beyond the reversible scenario. Ann. Stat. 49, 1958–1981.CrossRef Google Scholar

Bartlett, M. S. (1946). On the theoretical specification and sampling properties of autocorrelated time-series. Suppl. J. Roy. Stat. Soc. 8, 27–41.CrossRef Google Scholar

Bendat, J. and Sherman, S. (1955). Monotone and convex operator functions. Trans. Amer. Math. Soc. 79, 58–71.CrossRef Google Scholar

Bhatia, R. (1997). Matrix Analysis (Graduate Texts in Mathematics 169). Springer, New York.Google Scholar

Billingsley, P. (1968). Convergence of Probability Measures. Wiley, New York.Google Scholar

Brooks, S., Gelman, A., Jones, G. and Meng, X.-L. (eds) (2011). Handbook of Markov Chain Monte Carlo. Chapman & Hall, New York.CrossRef Google Scholar

Chan, K. S. and Geyer, C. J. (1994). Discussion: Markov chains for exploring posterior distributions. Ann. Stat. 22, 1747–1758.CrossRef Google Scholar

Frigessi, A., Hwang, C.-R. and Younes, L. (1992). Optimal spectral structure of reversible stochastic matrices, Monte Carlo methods and the simulation of Markov random fields. Ann. Appl. Prob. 2, 610–628.CrossRef Google Scholar

Gagnon, P. and Maire, F. (2023). An asymptotic Peskun ordering and its application to lifted samplers. Bernoulli 30, 2301–2325.Google Scholar

Geyer, C. (1992). Practical Markov chain Monte Carlo. Stat. Sci. 7. 473–483.Google Scholar

Green, P. J. and Han, X.-L. (1992). Metropolis methods, Gaussian proposals, and antithetic variables. In Stochastic Models, Statistical Methods and Algorithms in Image Analysis, eds P. Barone, A. Frigessi and M. Piccione. Springer, Berlin.CrossRef Google Scholar

Häggström, O. and Rosenthal, J. S. (2007). On variance conditions for Markov chain CLTs. Electron. Commun. Prob. 12, 454–464.CrossRef Google Scholar

Horn, R. A. and Johnson, C. R. (2013). Matrix Analysis, 2nd edn. Cambridge University Press.Google Scholar

Huang, L.-J. and Mao, Y.-H. (2023). Variational formulas for asymptotic variance of general discrete-time Markov chains. Bernoulli 29, 300–322.CrossRef Google Scholar

Kipnis, C. and Varadhan, S. R. S. (1986). Central limit theorem for additive functionals of reversible Markov processes and applications to simple exclusions. Commun. Math. Phys. 104, 1–19.CrossRef Google Scholar

Li, G., Smith, A. and Zhou, Q. (2023). Importance is important: A guide to informed importance tempering methods. Preprint, arXiv:2304.06251.Google Scholar

Löwner, K. (1934). Über monotone Matrixfunktionen. Math. Z. 38, 177–216.CrossRef Google Scholar

Mira, A. (2001). Ordering and improving the performance of Monte Carlo Markov chains. Stat. Sci. 16, 340–350.CrossRef Google Scholar

Mira, A. and Geyer, C. J. (1999). Ordering Monte Carlo Markov chains. Technical Report No. 632, School of Statistics, University of Minnesota.Google Scholar

Neal, R. M. (2004). Improving asymptotic variance of MCMC estimators: Non-reversible chains are better. Preprint, arXiv:math/0407281.Google Scholar

Neal, R. M. (2024). Modifying Gibbs sampling to avoid self transitions. Preprint, arXiv:2403.18054.Google Scholar

Peskun, P. H. (1973). Optimum Monte Carlo sampling using Markov chains. Biometrika 60, 607–612.CrossRef Google Scholar

Roberts, G. O. and Rosenthal, J. S. (2008). Variance bounding Markov chains. Ann. Appl. Prob. 18, 1201–1214.CrossRef Google Scholar

Suwa, H. (2022). Reducing rejection exponentially improves Markov chain Monte Carlo sampling. Preprint, arxiv:2208.03935.Google Scholar

Tierney, L. (1994). Markov chains for exploring posterior distributions. Ann. Stat. 22, 1701–1762.Google Scholar

Tierney, L. (1998). A note on Metropolis–Hastings kernels for general state spaces. Ann. Appl. Prob. 8, 1–9.CrossRef Google Scholar

Zanella, G. (2019). Informed proposals for local MCMC in discrete spaces. J. Amer. Stat. Assoc. 115, 852–865.CrossRef Google Scholar

Article contents

Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods

Abstract

Keywords

MSC classification

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests