Hostname: page-component-cd9895bd7-p9bg8 Total loading time: 0 Render date: 2024-12-23T03:40:31.310Z Has data issue: false hasContentIssue false

LEARNING TO SIGNAL WITH PROBE AND ADJUST

Published online by Cambridge University Press:  11 July 2012

Abstract

This is an investigation of the emergence of signaling using one kind of trial and error learning: probe and adjust.

Type
Signaling and Information Transmission
Copyright
Copyright © Cambridge University Press 2012

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Argiento, R., Pemantle, R., Skyrms, B., and Volkov, S. 2009. ‘Learning to Signal: Analysis of a Micro-Level Reinforcement Model.’ Stochastic Processes and their Applications, 119: 319–73.Google Scholar
Barrett, J. A., and Zollman, K. 2009. ‘The Role of Forgetting in the Evolution and Learning of Language.’ Journal of Experimental and Theoretical Artificial Intelligence, 21: 293309.Google Scholar
Donaldson, M., Lachmann, M., and Bergstrom, C. T. 2007. ‘The Evolution of Functionally Referential Meaning in a Structured World.’ Journal of Theoretical Biology, 246: 225–33.Google Scholar
Erev, I., and Roth, A. 1998. ‘Predicting How People Play Games: Reinforcement Learning in Games with Unique Mixed-Strategy Equilibria.’ American Economic Review, 88: 848–81.Google Scholar
Estes, W. K. 1950. ‘Toward a Statistical Theory of Learning.’ Psychological Review, 57: 94107.Google Scholar
Hofbauer, J., and Huttegger, S. 2008. ‘Feasibility of Communication in Binary Signaling Games.’ Journal of Theoretical Biology, 254: 843–9.Google Scholar
Hu, Y. 2010. ‘Essays on Random Processes with Reinforcement.’ Ph.D. thesis, St Anne's College, Oxford University.Google Scholar
Hu, Y., Skyrms, B., and Tarrés, P. In preparation. ‘Reinforcement Learning in Signaling Games.’Google Scholar
Huttegger, S., and Skyrms, B. Forthcoming. ‘Emergence of a Signaling Network with Probe and Adjust.’ In Calcott, B., Joyce, R. and Sterelney, K. (eds), Cooperation, Complexity, and Signaling. Cambridge, MA: MIT Press.Google Scholar
Kimbrough, S. O., and Murphy, F. H. 2009. ‘Learning to Collude Tacitly on Production Levels by Oligopolistic Agents.’ Computational Economics, 33: 4778.CrossRefGoogle Scholar
Lewis, D. K. 1969. Convention: A Philosophical Study. Cambridge, MA: Harvard University Press.Google Scholar
Marden, J. P., Young, H. P., Arslan, G., and Shamma, J. S. 2009. ‘Payoff-based dynamics for Multiplayer Weakly Acyclic Games.’ SIAM Journal on Control and Optimization, 48: 373–96.Google Scholar
Roth, A., and Erev, I. 1995. ‘Learning in Extensive Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term.’ Games and Economic Behavior, 8: 164212.Google Scholar
Skyrms, B. 2010. Signals: Evolution, Learning and Information. London and New York: Oxford University Press.Google Scholar
Suppes, P., and Atkinson, R. C. 1960. Markov Learning Models for Multiperson Interactions. Stanford, CA: Stanford University Press.Google Scholar
Vulkan, N. 2000. ‘An Economist's Perspective on Probability Matching.’ Journal of Economic Surveys, 14: 101–18.Google Scholar
Young, H. P. 2004. Strategic Learning and its Limits. London and New York: Oxford University Press.Google Scholar
Young, H. P. 2009. ‘Learning by Trial and Error.’ Games and Economic Behavior, 65: 626–43.Google Scholar