When will's wont wants wanting

Peter Dayan

doi:10.1017/S0140525X20001508

When will's wont wants wanting

Published online by Cambridge University Press: 26 April 2021

Peter Dayan

Show author details

Peter Dayan*: Affiliation:
Max Planck Institute for Biological Cybernetics & University of Tuebingen, 72076Tuebingen, Germany. [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We use neural reinforcement learning concepts including Pavlovian versus instrumental control, liking versus wanting, model-based versus model-free control, online versus offline learning and planning, and internal versus external actions and control to reflect on putative conflicts between short-term temptations and long-term goals.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 44 , 2021 , e35

DOI: https://doi.org/10.1017/S0140525X20001508 [Opens in a new window]
Creative Commons: The target article and response article are works of the U.S. Government and are not subject to copyright protection in the United States.
Copyright: Copyright © The Author(s), 2021. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Berridge, K. C. (2009). Wanting and liking: Observations from the neuroscience and psychology laboratory. Inquiry: A Journal of Medical Care Organization, Provision and Financing, 52(4), 378–398.CrossRef Google Scholar PubMed

Boureau, Y.-L., Sokol-Hessner, P., & Daw, N. D. (2015). Deciding how to decide: Self-control and meta-decision making. Trends in cognitive sciences, 19(11), 700–710.CrossRef Google Scholar PubMed

Cavanagh, J. F., Eisenberg, I., Guitart-Masip, M., Huys, Q., & Frank, M. J. (2013). Frontal theta overrides Pavlovian learning biases. Journal of Neuroscience, 33(19), 8541–8548.CrossRef Google Scholar PubMed

Daw, N. D., Niv, Y., & Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience, 8(12), 1704–1711.CrossRef Google Scholar PubMed

Dayan, P. (2012). How to set the switches on this thing. Current Opinion in Neurobiology, 22, 1068–1074.CrossRef Google Scholar PubMed

Dayan, P., Niv, Y., Seymour, B., & Daw, N. D. (2006). The misbehavior of value and the discipline of the will. Neural Networks, 19(8), 1153–1160.CrossRef Google Scholar PubMed

de Araujo, I. E., Schatzker, M., & Small, D. M. (2020). Rethinking food reward. Annual Review of Psychology, 71, 139–164.CrossRef Google Scholar PubMed

Dickinson, A. (1980). Contemporary animal learning theory. Cambridge, UK: Cambridge University Press.Google Scholar

Dickinson, A., & Balleine, B. (2002). The role of learning in motivation. In Gallistel, C. (Ed.), Stevens’ handbook of experimental psychology (Vol. 3, pp. 497–533). New York, NY: Wiley.Google Scholar

Eldar, E., Lièvre, G., Dayan, P., & Dolan, R. J. (2020). The roles of online and offline replay in planning. eLife, 9.CrossRef Google Scholar PubMed

Gershman, S. J. (2020). Origin of perseveration in the trade-off between reward and complexity. bioRxiv.Google Scholar PubMed

Kahneman, D. (2011). Thinking, fast and slow. Macmillan.Google Scholar

Keramati, M., Smittenaar, P., Dolan, R. J., & Dayan, P. (2016). Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum. Proceedings of the National Academy of Sciences of the United States of America, 113, 12868–12873.CrossRef Google Scholar PubMed

Kurzban, R., Duckworth, A., Kable, J. W., & Myers, J. (2013). An opportunity cost model of subjective effort and task performance. Behavioral and Brain Sciences, 36(6), 661–679.CrossRef Google Scholar PubMed

Liu, Y., Dolan, R. J., Kurth-Nelson, Z., & Behrens, T. E. (2019). Human replay spontaneously reorganizes experience. Cell, 178(3), 640–652.CrossRef Google Scholar PubMed

Loewenstein, G. (1996). Out of control: Visceral influences on behavior. Organizational Behavior and Human Decision Processes, 65(3), 272–292.CrossRef Google Scholar

Mackintosh, N. J. (1983). Conditioning and associative learning. Oxford, UK: Oxford University Press.Google Scholar

Mattar, M. G., & Daw, N. D. (2018). Prioritized memory access explains planning and hippocampal replay. Nature Neuroscience, 21, 1609–1617.CrossRef Google Scholar PubMed

Ng, A. Y., Harada, D., & Russell, S. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. ICML (Vol. 99, pp. 278–287).Google Scholar

Pezzulo, G., Rigoli, F., & Chersi, F. (2013). The mixed instrumental controller: Using value of information to combine habitual choice and mental simulation. Frontiers in Psychology, 4, 92.CrossRef Google Scholar PubMed

Pfeiffer, B. E., & Foster, D. J. (2013). Hippocampal place-cell sequences depict future paths to remembered goals. Nature, 497, 74–79.CrossRef Google Scholar PubMed

Shenhav, A., Musslick, S., Lieder, F., Kool, W., Griffiths, T. L., Cohen, J. D., & Botvinick, M. M. (2017). Toward a rational and mechanistic account of mental effort. Annual Review of Neuroscience, 40, 99–124.CrossRef Google Scholar

Stevens, J. R., & Stephens, D. W. (2008). Patience. Current Biology, 18(1), R11–R12.CrossRef Google Scholar PubMed

Sutton, R. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3(1), 9–44.CrossRef Google Scholar

Sutton, R. S. (1991). Dyna, an integrated architecture for learning, planning, and reacting. ACM Sigart Bulletin, 2(4), 160–163.CrossRef Google Scholar

Watkins, C. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.Google Scholar

Wikenheiser, A. M., & Redish, A. D. (2015). Decoding the cognitive map: Ensemble hippocampal sequences and decision making. Current Opinion in Neurobiology, 32, 8–15.CrossRef Google Scholar PubMed