Optimal scaling of MCMC beyond Metropolis

Sanket Agrawal; Dootika Vats; Krzysztof Łatuszyński; Gareth O. Roberts

doi:10.1017/apr.2022.37

Optimal scaling of MCMC beyond Metropolis

Part of: Probabilistic methods, simulation and stochastic differential equations Limit theorems

Published online by Cambridge University Press: 16 December 2022

Sanket Agrawal ,

Dootika Vats

Krzysztof Łatuszyński and

Gareth O. Roberts

Show author details

Sanket Agrawal*: Affiliation:
University of Warwick
Dootika Vats*: Affiliation:
Indian Institute of Technology Kanpur
Krzysztof Łatuszyński*: Affiliation:
Indian Institute of Technology Kanpur
Gareth O. Roberts*: Affiliation:
University of Warwick
*: *Postal address: Coventry CV4 7AL, U.K.
**Email address: [email protected]
*Postal address: Coventry CV4 7AL, U.K.
*Postal address: Coventry CV4 7AL, U.K.

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The problem of optimally scaling the proposal distribution in a Markov chain Monte Carlo algorithm is critical to the quality of the generated samples. Much work has gone into obtaining such results for various Metropolis–Hastings (MH) algorithms. Recently, acceptance probabilities other than MH are being employed in problems with intractable target distributions. There are few resources available on tuning the Gaussian proposal distributions for this situation. We obtain optimal scaling results for a general class of acceptance functions, which includes Barker’s and lazy MH. In particular, optimal values for Barker’s algorithm are derived and found to be significantly different from that obtained for the MH algorithm. Our theoretical conclusions are supported by numerical simulations indicating that when the optimal proposal variance is unknown, tuning to the optimal acceptance probability remains an effective strategy.

Keywords

Barker’s acceptance weak convergence Metropolis–Hastings lazy MH tuning

MSC classification

Primary: 65C05: Monte Carlo methods

Secondary: 60F05: Central limit and other weak theorems

Type: Original Article
Information: Advances in Applied Probability , Volume 55 , Issue 2 , June 2023 , pp. 492 - 509

DOI: https://doi.org/10.1017/apr.2022.37 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Banterle, M., Grazian, C., Lee, A. and Robert, C. P. (2019). Accelerating Metropolis–Hastings algorithms by delayed acceptance. Found. Data Sci. 1, 103–128.CrossRef Google Scholar

Barker, A. A. (1965). Monte Carlo calculations of the radial distribution functions for a proton–electron plasma. Austral. J. Phys. 18, 119–134.CrossRef Google Scholar

Bédard, M. (2008). Optimal acceptance rates for Metropolis algorithms: moving beyond 0.234. Stoch. Process. Appl. 118, 2198–2222.CrossRef Google Scholar

Billera, L. J. and Diaconis, P. (2001). A geometric interpretation of the Metropolis–Hastings algorithm. Statist. Sci. 335–339.Google Scholar

Brooks, S., Gelman, A., Jones, G. and Meng, X.-L. (2011). Handbook of Markov Chain Monte Carlo. CRC Press, Boca Raton.CrossRef Google Scholar

Christensen, O. F., Roberts, G. O. and Rosenthal, J. S. (2005). Scaling limits for the transient phase of local Metropolis–Hastings algorithms. J. R. Statist. Soc. B [Statist. Methodology] 67, 253–268.CrossRef Google Scholar

Delmas, J.-F. and Jourdain, B. (2009). Does waste recycling really improve the multi-proposal Metropolis–Hastings algorithm? An analysis based on control variates. J. Appl. Prob. 46, 938–959.CrossRef Google Scholar

Doucet, A., Pitt, M. K., Deligiannidis, G. and Kohn, R. (2015). Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator. Biometrika 102, 295–313.CrossRef Google Scholar

Duane, S., Kennedy, A. D., Pendleton, B. J. and Roweth, D. (1987). Hybrid Monte Carlo. Phys. Lett. B 195, 216–222.CrossRef Google Scholar

Ethier, S. N. and Kurtz, T. G. (1986). Markov Processes: Characterization and Convergence. John Wiley, New York.CrossRef Google Scholar

Gelman, A., Roberts, G. O. and Gilks, W. R. (1996). Efficient Metropolis jumping rules. Bayesian Statist. 5, 599–608.Google Scholar

Gonçalves, F. B., Łatuszyński, K. and Roberts, G. O. (2017). Barker’s algorithm for Bayesian inference with intractable likelihoods. Brazilian J. Prob. Statist. 31, 732–745.CrossRef Google Scholar

Gonçalves, F. B., Łatuszyński, K. and Roberts, G. O. (2017). Exact Monte Carlo likelihood-based inference for jump-diffusion processes. Preprint. Available at https://arxiv.org/abs/1707.00332.Google Scholar

Hastings, W. K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 97–109.CrossRef Google Scholar

Herbei, R. and Berliner, L. M. (2014). Estimating ocean circulation: an MCMC approach with approximated likelihoods via the Bernoulli factory. J. Amer. Statist. Assoc. 109, 944–954.CrossRef Google Scholar

Jourdain, B., Lelièvre, T. and Miasojedow, B. (2014). Optimal scaling for the transient phase of Metropolis Hastings algorithms: the longtime behavior. Bernoulli 20, 1930–1978.CrossRef Google Scholar

Kuntz, J., Ottobre, M. and Stuart, A. M. (2019). Diffusion limit for the random walk Metropolis algorithm out of stationarity. Ann. Inst. H. Poincaré Prob. Statist. 55, 1599–1648.CrossRef Google Scholar

Łatuszyński, K. and Roberts, G. O. (2013). CLTs and asymptotic variance of time-sampled Markov chains. Methodology Comput. Appl. Prob. 15, 237–247.CrossRef Google Scholar

Menezes, A. A. and Kabamba, P. T. (2014). Optimal search efficiency of Barker’s algorithm with an exponential fitness function. Optimization Lett. 8, 691–703.CrossRef Google Scholar

Metropolis, N. et al. (1953). Equation of state calculations by fast computing machines. J. Chem. Phys. 21, 1087–1092.CrossRef Google Scholar

Meyn, S. P. and Tweedie, R. L. (2012). Markov Chains and Stochastic Stability. Cambridge University Press.Google Scholar

Mira, A. (2001). On Metropolis–Hastings algorithms with delayed rejection. Metron 59, 231–241.Google Scholar

Morina, G., Łatuszyński, K., Nayar, P. and Wendland, A. (2021). From the Bernoulli factory to a dice enterprise via perfect sampling of Markov chains. To appear in Ann. Appl. Prob. Google Scholar

Neal, P. and Roberts, G. O. (2006). Optimal scaling for partially updating MCMC algorithms. Ann. Appl. Prob. 16, 475–515.CrossRef Google Scholar

Peskun, P. H. (1973). Optimum Monte-Carlo sampling using Markov chains. Biometrika 60, 607–612.CrossRef Google Scholar

Robert, C. and Casella, G. (2013). Monte Carlo Statistical Methods. Springer, New York.Google Scholar

Roberts, G. O., Gelman, A. and Gilks, W. R. (1997). Weak convergence and optimal scaling of random walk Metropolis algorithms. Ann. Appl. Prob. 7, 110–120.Google Scholar

Roberts, G. O. and Rosenthal, J. S. (1998). Optimal scaling of discrete approximations to Langevin diffusions. J. R. Statist. Soc. B [Statist. Methodology] 60, 255–268.CrossRef Google Scholar

Roberts, G. O. and Rosenthal, J. S. (2001). Optimal scaling for various Metropolis–Hastings algorithms. Statist. Sci. 16, 351–367.CrossRef Google Scholar

Roberts, G. O. and Rosenthal, J. S. (2009). Examples of adaptive MCMC. J. Comput. Graph. Statist. 18, 349–367.CrossRef Google Scholar

Roberts, G. O. and Tweedie, R. L. (1996). Exponential convergence of Langevin distributions and their discrete approximations. Bernoulli 2, 341–363.CrossRef Google Scholar

Schmon, S. M., Deligiannidis, G., Doucet, A. and Pitt, M. K. (2021). Large-sample asymptotics of the pseudo-marginal method. Biometrika 108, 37–51.CrossRef Google Scholar

Schmon, S. M. and Gagnon, P. (2022). Optimal scaling of random walk Metropolis algorithms using Bayesian large-sample asymptotics. Statist. Comput. 32, 1–16.CrossRef Google Scholar PubMed

Sherlock, C. and Roberts, G. O. (2009). Optimal scaling of the random walk Metropolis on elliptically symmetric unimodal targets. Bernoulli 15, 774–798.CrossRef Google Scholar

Sherlock, C., Thiery, A. H. and Golightly, A. (2021). Efficiency of delayed-acceptance random walk Metropolis algorithms. Ann. Statist. 49, 2972–2990.CrossRef Google Scholar

Sherlock, C., Thiery, A. H., Roberts, G. O. and Rosenthal, J. S. (2015). On the efficiency of pseudo-marginal random walk Metropolis algorithms. Ann. Statist. 43, 238–275.CrossRef Google Scholar

Smith, C. J. (2018). Exact Markov chain Monte Carlo with likelihood approximations for functional linear models. Doctoral Thesis, Ohio State University.Google Scholar

Vats, D., Flegal, J. M. and Jones, G. L. (2019). Multivariate output analysis for Markov chain Monte Carlo. Biometrika 106, 321–337.CrossRef Google Scholar

Vats, D., Gonçalves, F. B., Łatuszyński, K. and Roberts, G. O. (2022). Efficient Bernoulli factory Markov chain Monte Carlo for intractable posteriors. Biometrika 109, 369–385.CrossRef Google Scholar

Yang, J., Roberts, G. O. and Rosenthal, J. S. (2020). Optimal scaling of random-walk Metropolis algorithms on general target distributions. Stoch. Process. Appl. 130, 6094–6132.CrossRef Google Scholar

Zanella, G., Bédard, M. and Kendall, W. S. (2017). A Dirichlet form approach to MCMC optimal scaling. Stoch. Process. Appl. 127, 4053–4082.CrossRef Google Scholar

Article contents

Optimal scaling of MCMC beyond Metropolis

Abstract

Keywords

MSC classification

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests