Toward biologically plausible artificial vision

Mason Westfall

doi:10.1017/S0140525X23001930

Toward biologically plausible artificial vision

Published online by Cambridge University Press: 28 September 2023

Mason Westfall

Show author details

Mason Westfall*: Affiliation:
Department of Philosophy, Philosophy–Neuroscience–Psychology Program, Washington University in St. Louis, St. Louis, MO, USA [email protected] http://www.masonwestfall.com

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Quilty-Dunn et al. argue that deep convolutional neural networks (DCNNs) optimized for image classification exemplify structural disanalogies to human vision. A different kind of artificial vision – found in reinforcement-learning agents navigating artificial three-dimensional environments – can be expected to be more human-like. Recent work suggests that language-like representations substantially improves these agents’ performance, lending some indirect support to the language-of-thought hypothesis (LoTH).

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 46 , 2023 , e290

DOI: https://doi.org/10.1017/S0140525X23001930 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Bowers, J. S., Malhorta, G., Dujmović, M., Montero, M. L., Tsvetkov, C., Biscione, V., …Blything, R. (2022). Deep problems with neural network models of human vision. Behavioral and Brain Sciences, 1–74.CrossRef Google Scholar PubMed

Chang, A., Dai, A., Funkhouser, T., Halber, M., Nießner, M., Savva, M., … Zhang, Y. (2017). Matterport3D: Learning from RGB-D data in indoor environments. arXiv preprint, arXiv:1709.06158.Google Scholar

Du, Y., Gan, C., & Isola, P. (2021). Curious representation learning for embodied intelligence. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 10408–10417.CrossRef Google Scholar

Green, E. J. (2020). The perception–cognition border: A case for architectural division. Philosophical Review, 129(3), 323–393.CrossRef Google Scholar

Gupta, A., Kumar, V., Lynch, C., Levine, S., & Hausman, K. (2019). Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning. arXiv preprint, arXiv:1910.11956.Google Scholar

Hafri, A., & Firestone, C. (2021). The perception of relations. Trends in Cognitive Sciences, 25(6), 475–492.CrossRef Google Scholar PubMed

Kempka, M., Wydmuch, M., Runc, G., Toczek, J., & Jaśkowski, W. (2016). Vizdoom: A doom-based AI research platform for visual reinforcement learning. 2016 IEEE Conference on Computational Intelligence and Games (CIG). IEEE, pp. 1–8.CrossRef Google Scholar

Mandelbaum, E., Dunham, Y., Feiman, R., Firestone, C., Green, E., Harris, D.,… Quilty-Dunn, J. (2022). Problems and mysteries of the many languages of thought. Cognitive Science, 46(12), e13225.CrossRef Google Scholar PubMed

Mu, J., Zhong, V., Raileanu, R., Jiang, M., Goodman, N., Rocktäschel, T., & Grefenstette, E. (2022). Improving intrinsic exploration with language abstractions. arXiv preprint, arXiv:2202.08938.Google Scholar

Parisi, S., Rajeswaran, A., Purushwalkam, S., & Gupta, A. (2022). The unsurprising effectiveness of pre-trained vision models for control. International Conference on Machine Learning. PMLR, pp. 17359–17371.Google Scholar

Savva, M., Kadian, A., Maksymets, O., Zhao, Y., Wijmans, E., Jain, B., … Batra, D. (2019). Habitat: A platform for embodied AI research. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 9339–9347.CrossRef Google Scholar

Schwartz, E., Tennenholtz, G., Tessler, Chen, & Mannor, S. (2019). Language is power: Representing states using natural language in reinforcement learning. arXiv preprint, arXiv:1910.02789.Google Scholar

Tam, A. C., Rabinowitz, N. C., Lampinen, A. K., Roy, N. A., Chan, S. C. Y., Strouse, D.,…Hill, F. (2022). Semantic exploration from language abstractions and pretrained representations. arXiv preprint, arXiv:2204.05080.Google Scholar

Xia, F., Zamier, A., He, Z., Sax, A., Malik, J., & Savarese, S. (2018). Gibson Env: Real-world perception for embodied agents. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (CVPR), 2018, pp. 9068–9079.CrossRef Google Scholar