DERGMs: Degeneracy-restricted exponential family random graph models

Vishesh Karwa; Sonja Petrović; Denis Bajić

doi:10.1017/nws.2022.5

DERGMs: Degeneracy-restricted exponential family random graph models

Published online by Cambridge University Press: 31 March 2022

and

Vishesh Karwa: Affiliation:
Temple University, Philadelphia, PA 19122, USA
Sonja Petrović*: Affiliation:
Illinois Institute of Technology, Chicago, IL 60616, USA
Denis Bajić: Affiliation:
Illinois Institute of Technology, Chicago, IL 60616, USA
*: *Corresponding author. Email: [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Exponential random graph models, or ERGMs, are a flexible and general class of models for modeling dependent data. While the early literature has shown them to be powerful in capturing many network features of interest, recent work highlights difficulties related to the models’ ill behavior, such as most of the probability mass being concentrated on a very small subset of the parameter space. This behavior limits both the applicability of an ERGM as a model for real data and inference and parameter estimation via the usual Markov chain Monte Carlo algorithms. To address this problem, we propose a new exponential family of models for random graphs that build on the standard ERGM framework. Specifically, we solve the problem of computational intractability and “degenerate” model behavior by an interpretable support restriction. We introduce a new parameter based on the graph-theoretic notion of degeneracy, a measure of sparsity whose value is commonly low in real-world networks. The new model family is supported on the sample space of graphs with bounded degeneracy and is called degeneracy-restricted ERGMs, or DERGMs for short. Since DERGMs generalize ERGMs—the latter is obtained from the former by setting the degeneracy parameter to be maximal—they inherit good theoretical properties, while at the same time place their mass more uniformly over realistic graphs. The support restriction allows the use of new (and fast) Monte Carlo methods for inference, thus making the models scalable and computationally tractable. We study various theoretical properties of DERGMs and illustrate how the support restriction improves the model behavior. We also present a fast Monte Carlo algorithm for parameter estimation that avoids many issues faced by Markov Chain Monte Carlo algorithms used for inference in ERGMs.

Keywords

network ERGM degeneracy stability sufficiency

Type: Research Article
Information: Network Science , Volume 10 , Issue 1 , March 2022 , pp. 82 - 110

DOI: https://doi.org/10.1017/nws.2022.5 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

†

Karwa was partially supported by NSF TRIPODS+X grant number 1947919.

‡

SP is partially supported by the Simons Foundation’s the Collaboration Grant forMathematicians 854770. This work was initially supported by U.S. Air Force Office of Scientific Research Grant #FA9550-14-1-0141 to Illinois Tech. A small subset of the simulations for this work were completed on Illinois Tech’s Karlin cluster.

Action Editor: Stanley Wasserman

References

Bajić, D. (2016). Dergms: Supplementary material on GitHub. https://github.com/dbajic/degen.Google Scholar

Bannister, M. J., Devanny, W. E., & Eppstein, D. (2014). ERGMs are hard. Preprint arXiv:1412.1787 [cs.DS].Google Scholar

Barndorff-Nielsen, O. (2014). Information and exponential families in statistical theory. John Wiley & Sons.CrossRef Google Scholar

Batagelj, V., & Mrvar, A. (2006). Pajek datasets. http://vlado.fmf.uni-lj.si/pub/networks/data/.Google Scholar

Batagelj, V., & Zaversnik, M. (2003). An o (m) algorithm for cores decomposition of networks. arxiv preprint cs/0310049.Google Scholar

Bauer, R., Krug, M., & Wagner, D. (2010). Enumerating and generating labeled k-degenerate graphs. In Proceedings of the seventh workshop on analytic algorithmics and combinatorics (analco).CrossRef Google Scholar

Caimo, A., & Friel, N. (2011). Bayesian inference for exponential random graph models. Social Networks, 33(1), 41–55.CrossRef Google Scholar

Chatterjee, S., & Diaconis, P. (2013). Estimating and understanding exponential random graph models. Annals of Statistics, 41(5), 2428–2461.CrossRef Google Scholar

Engström, A., & Norén, P. (2011). Polytopes from subgraph statistics. Discrete Mathematics and Theoretical Computer Science.CrossRef Google Scholar

Fellows, I., & Handcock, M. (2017). Removing phase transitions from gibbs measures. In Artificial intelligence and statistics (pp. 289–297).Google Scholar

Frank, O., & Strauss, D. (1986). Markov graphs. Journal of the American Statistical Association, 81(395), 832–842.CrossRef Google Scholar

Geyer, C. J., & Thompson, E. A. (1992). Constrained monte carlo maximum likelihood for dependent data. Journal of the Royal Statistical Society. Series B (Methodological), 657–699.CrossRef Google Scholar

Giatsidis, C., Thilikos, D. M., & Vazirgiannis, M. (2011). D-cores: Measuring collaboration of directed graphs based on degeneracy. In IEEE 11th international conference on data mining.CrossRef Google Scholar

Goldenberg, A., Zheng, A. X., Fienberg, S. E., & Airoldi, E. M. (2009). A survey of statistical network models. Foundations and Trends in Machine Learning, 2(2), 129–233.CrossRef Google Scholar

Goodreau, S. M., Kitts, J. A., & Morris, M. (2009). Birds of a feather, or friend of a friend? using exponential random graph models to investigate adolescent social networks*. Demography, 46(1), 103–125.Google Scholar PubMed

Handcock, M. S. (2003). Assessing degeneracy in statistical models of social networks. Center for Statistics and the Social Sciences, University of Washington, Working Paper No. 39.Google Scholar

Horvát, S., Czabarka, É., & Toroczkai, Z. (2015). Reducing degeneracy in maximum entropy models of networks. Physical Review Letters, 114(15), 158701.CrossRef Google Scholar

Hummel, R. M., Hunter, D. R., & Handcock, M. S. (2012). Improving simulation-based algorithms for fitting ergms. Journal of Computational and Graphical Statistics, 21(4), 920–939.CrossRef Google Scholar PubMed

Hunter, D. R, & Handcock, M. S. (2006). Inference in curved exponential family models for networks. Journal of Computational and Graphical Statistics, 15(3), 565–583.CrossRef Google Scholar

Hunter, D. R, Handcock, M. S, Butts, C. T, Goodreau, S. M, & Morris, M. (2008a). ergm: A package to fit, simulate and diagnose exponential-family models for networks. Journal of Statistical Software, 24(3). http://www.jstatsoft.org/v24/i03.CrossRef Google Scholar

Hunter, D. R., Goodreau, S. M., & Handcock, M. S. (2008b). Goodness of fit of social network models. Journal of the American Statistical Association, 103(481), 248–258.CrossRef Google Scholar

Hunter, D. R., Krivitsky, P. N., & Schweinberger, M. (2012). Computational statistical methods for social network models. Journal of Computational and Graphical Statistics, 21(4), 856–882.CrossRef Google Scholar PubMed

Karwa, V., & Slavković, A. (2016). Inference using noisy degrees: Differentially private

$\beta$ -model and synthetic graphs. The Annals of Statistics, 44(1), 87–112.CrossRef Google Scholar

Karwa, V., Pelsmajer, M. J., Petrović, S., Stasi, D., & Wilburne, D. (2017). Statistical models for cores decomposition of an undirected random graph. Electronic Journal of Statistics, 11(1), 1949–1982. Preprint, arXiv:1410.7357 [math.ST].CrossRef Google Scholar

Kolaczyk, E. D., & Krivitsky, P. N. (2015). On the question of effective sample size in network modeling: An asymptotic inquiry. Statistical Science: A Review Journal of the Institute of Mathematical Statistics, 30(2), 184.Google Scholar PubMed

Krivitsky, P. N. (2012). Exponential-family random graph models for valued networks. Electronic Journal of Statistics, 6(none), 1100–1128.CrossRef Google Scholar PubMed

Krivitsky, P. N., Handcock, M. S., & Morris, M. (2011). Adjusting for network size and composition effects in exponential-family random graph models. Statistical Methodology, 8(4), 319–339.CrossRef Google Scholar

Lauritzen, S., Rinaldo, A., & Sadeghi, K. (2018). Random networks, graphical models and exchangeability. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80(3), 481–508.CrossRef Google Scholar

Lick, D. R., & White, A. T. (1970). k-degenerate graphs. Canadian Journal of Mathematics, 22, 1082–1096.CrossRef Google Scholar

Rinaldo, A., Fienberg, S. E., & Zhou, Y. (2009). On the geometry of discrete exponential families with application to exponential random graph models. Electronic Journal of Statistics, 3, 446–484.CrossRef Google Scholar

Rinaldo, A., Petrović, S., & Fienberg, S. E. (2013). Maximum lilkelihood estimation in the

$\beta$ -model. The Annals of Statistics, 41(3), 1085–1110.CrossRef Google Scholar

Robbins, H., & Monro, S. (1985). A stochastic approximation method. In Herbert robbins selected papers (pp. 102–109). Springer.CrossRef Google Scholar

Sampson, S. F. (1968). A novitiate in a period of change: An experimental and case study of relationships. Ph.D. thesis, Department of Sociology, Cornell University.Google Scholar

Saul, Z. M., & Filkov, V. (2007). Exploring biological network structure using exponential random graph models. Bioinformatics, 23(19), 2604–2611.CrossRef Google Scholar PubMed

Schweinberger, M. (2011). Instability, sensitivity, and degeneracy of discrete exponential families. Journal of the American Statistical Association, 106(496), 1361–1370.CrossRef Google Scholar PubMed

Schweinberger, M., & Handcock, M. S. (2015). Local dependence in random graph models: Characterization, properties and statistical inference. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 77(3), 647–676.CrossRef Google Scholar PubMed

Seidman, S. B. (1983). Network structure and minimum degree. Social Networks, 5(3), 269–287.CrossRef Google Scholar

Snijders, T. A. B. (2002). Markov chain Monte Carlo estimation of exponential random graph models. Journal of Social Structure, 3(2), 1–40.Google Scholar

Snijders, T. A. B., Pattison, P. E., Robins, G. L., & Handcock, M. S. (2006). New specifications for exponential random graph models. Sociological Methodology, 36(1), 99–153.CrossRef Google Scholar

Thiemichen, S., & Kauermann, G. (2017). Stable exponential random graph models with non-parametric components for large dense networks. Social Networks, 49, 67–80.CrossRef Google Scholar

Yin, M., Rinaldo, A., Fadnavis, S., et al. (2016). Asymptotic quantization of exponential random graphs. The Annals of Applied Probability, 26(6), 3251–3285.CrossRef Google Scholar

Karwa et al. supplementary material

PDF 624 KB

Article contents

DERGMs: Degeneracy-restricted exponential family random graph models

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Karwa et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests