Hostname: page-component-78c5997874-lj6df Total loading time: 0 Render date: 2024-11-19T12:30:35.442Z Has data issue: false hasContentIssue false

Are schematic diagrams valid visual representations of concepts? Evidence from mental imagery in online processing of English prepositions

Published online by Cambridge University Press:  02 April 2024

Menghan Wang
Affiliation:
School of Languages and Linguistics, The University of Melbourne, Parkville, VIC, Australia
Helen Zhao*
Affiliation:
School of Languages and Linguistics, The University of Melbourne, Parkville, VIC, Australia
*
Corresponding author: Helen Zhao; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Embodied imagery hypothesis proposes the activation of perceptual-motor systems during language processing. Previous studies primarily used concrete visual stimuli to investigate mental imagery in language processing by native speakers (NSs) and second language (L2) learners, but few studies employed schematic diagrams. The study aims to investigate mental imagery in processing prepositional phrases by English NSs and L2 learners. Using image-schematic diagrams as primes, we examine whether any mental imagery effect is modulated by target preposition (over, in), the abstractness of meaning (spatial, extended), and stimulus onset asynchrony (SOA; 1,040 ms, 2,040 ms). A total of 79 adult L2 learners and 100 NSs of English completed diagram–picture matching and semantic priming phrasal decision tasks. Results revealed interference effects on L2 processing of over phrases and under 2,040 ms SOA, but no such effects were observed in the NS group. The selective interference effects in L2 suggest different mental imagery patterns between L1 and L2 processing, and processing schematic diagram primes requires high cognitive demands, potentially leading to difficulties in integrating visual and linguistic information and making grammaticality judgments. The findings partially validate schematic diagrams as visual representations of concepts and suggest the need for further examination of schematic diagrams with varying degrees of complexity.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press

1. Introduction

As a language-induced cognitive phenomenon, mental imagery is defined as the ability to mentally resemble and internally recreate perceptual-motor experiences in the absence of external physical stimuli (Barsalou, Reference Barsalou1999, Reference Barsalou2008). It posits that comprehending linguistic input about a perceived event involves a mental enactment of corresponding perceptual-motor components in the brain (Bergen, Reference Bergen, Gonzalez-Marquez, Mittelberg, Coulson and Spivey2007, Reference Bergen, Dąbrowska and Divjak2019). Research on mental imagery has predominantly used concrete pictures to explore its effects on the first language (L1) processing in adult native speakers (NSs; de Koning et al., Reference de Koning, Wassenburg, Bos and Van der Schoot2017a; Schütt et al., Reference Schütt, Dudschig, Bergen and Kaup2023; Stanfield & Zwaan, Reference Stanfield and Zwaan2001; Zwaan et al., Reference Zwaan, Stanfield and Yaxley2002). A few L1 mental imagery studies have used visual schematic diagrams to examine whether they can activate offline and online language processing (Richardson et al., Reference Richardson, Spivey, Edelman and Naples2001, Reference Richardson, Spivey, Barsalou and McRae2003). Compared to the abundant research on L1-based embodied mental imagery, there are far fewer studies on mental imagery in second language (L2) processing (Ahn & Jiang, Reference Ahn and Jiang2018; Norman & Peleg, Reference Norman and Peleg2022; Wheeler & Stojanovic, Reference Wheeler and Stojanovic2006). No prior studies have adopted schematic diagrams to examine L2 mental imagery.

Intertwining with human cognition and arising from sensorimotor experiences, image schemas are recurring and dynamic patterns of human perception (Lakoff & Johnson, Reference Lakoff and Johnson1980; Mandler, Reference Mandler and Hampe2005). These schemas, such as UP-DOWN, CONTAINMENT and FORCE, are multimodal analog representations that provide a holistic conceptual summary of perceived spatial relations and movements (Mandler, Reference Mandler2004). A schematic diagram is what cognitive linguists often use to visually illustrate the underlying spatiotemporal relationships of these schemas and externalized linguistic representations. Recently, schematic diagrams have been increasingly used in cognitive linguistics (CL)-inspired L2 pedagogy, but have produced very mixed findings (Boers et al., Reference Boers, Lindstromberg, Stengers, Eyckmans, Boers and Lindstromberg2008, Reference Boers, Piquer Píriz, Stengers and Eyckmans2009; Takimoto, Reference Takimoto2021; Wong et al., Reference Wong, Zhao and MacWhinney2018; Zhao et al., Reference Zhao, Huang, Zhou and Wang2020). Despite these growing applications of schematic diagrams, whether diagrams are appropriate visual representations of linguistic concepts and whether the schematic meanings conveyed by diagrams are comprehensible to L2 learners are often not empirically validated.

Currently, there is a great dearth of empirical studies employing reaction-time (RT)-based behavioral experiments to evaluate the validity of schematic diagrams in CL analysis, as well as how L1 and L2 speakers mentally represent the meanings conveyed by diagrams. These investigations are pivotal for establishing the psychological basis of diagrams as pedagogical aids in L2 instruction. To address these research gaps, the present study aims to examine whether and how mental imagery is triggered by diagrams during the online processing of English prepositional phrases among both L1 and L2 speakers.

We adopted a priming paradigm and utilized schematic diagrams illustrating the spatial configurations of prepositional senses derived from published CL diagrams (Tyler & Evans, Reference Tyler and Evans2003) as visual primes. Behavioral patterns of both L1 and L2 speakers were measured in response to prepositional phrases that shared similar spatial configurations as the schematic diagrams. By employing a priming paradigm, we can capture the real-time perception and cognition of diagrams. Additionally, this paradigm allows us to explore the extent to which the encoding of a diagram facilitates or interferes with the online processing of spatial configurations associated with prepositions. Ultimately, these findings will provide empirical validation for the integration of CL diagrams in language instruction.

1.1 Mental imagery in language processing and influential factors

Previous research on L1-based mental imagery has observed both compatibility and interference effects (Bergen et al., Reference Bergen, Lindsay, Matlock and Narayanan2007, Reference Bergen, Lau, Narayan, Stojanovic and Wheeler2010; Kaschak et al., Reference Kaschak, Madden, Therriault, Yaxley, Aveyard, Blanchard and Zwaan2005; Liu & Bergen, Reference Liu and Bergen2016; Stanfield & Zwaan, Reference Stanfield and Zwaan2001). The compatibility effect suggests a shared neural structure for language comprehension and spatial perception (Bergen, Reference Bergen, Gonzalez-Marquez, Mittelberg, Coulson and Spivey2007). For instance, processing a sentence like A boy climbs a mountain activates the UP-DOWN schema, leading to faster responses to a compatible vertical spatial configuration than to an incompatible horizontal one. In contrast, the interference effect implies mutual inhibition between language comprehension and spatial perceptions, resulting in slower responses when both processes recruit the same perceptual neurons concurrently (Bergen, Reference Bergen, Gonzalez-Marquez, Mittelberg, Coulson and Spivey2007). The precise mechanisms underlying these effects of mental imagery are not fully specified, but facilitation of imagery in tasks like object discrimination and image–word matching occurs when there are shared properties of identity, shape or location between the real and imaginary objects (Craver-Lemley & Arterberry, Reference Craver-Lemley and Arterberry2001). Interference effects are also observed when there is temporal overlap and weak integrability between matching tasks (Bergen, Reference Bergen, Gonzalez-Marquez, Mittelberg, Coulson and Spivey2007).

A widely used task in mental imagery studies is the priming-based sentence–picture verification task. Participants read or hear a prime sentence or a word and then verify whether a subsequent picture matches the meaning of the prime. Some studies reverse the presentation sequence, where participants saw the picture before the verbal stimuli (Bergen et al., Reference Bergen, Narayan and Feldman2003; Lindsay, Reference Lindsay2003). These task paradigms yielded mental imagery effects on processing language encoding orientation (Engelen et al., Reference Engelen, Bouwmeester, de Bruin and Zwaan2011; Richardson et al., Reference Richardson, Spivey, Barsalou and McRae2003; Stanfield & Zwaan, Reference Stanfield and Zwaan2001), shape (de Koning et al., Reference de Koning, Wassenburg, Bos and Van der Schoot2017a; Rommers et al., Reference Rommers, Meyer and Huettig2013; Schütt et al., Reference Schütt, Dudschig, Bergen and Kaup2023; Zwaan et al., Reference Zwaan, Stanfield and Yaxley2002) and size (Chen et al., Reference Chen, de Koning and Zwaan2020; de Koning et al., Reference de Koning, Wassenburg, Bos and Van der Schoot2017b).

Bergen et al. (Reference Bergen, Narayan and Feldman2003) found interference in an image–verb matching task, where English NSs primed with a verb image (e.g. run) took longer to reject effector-matching verbs (e.g. kick) than effector-mismatching verbs (e.g. drink). The interference was attributed to competition and mutual inhibition in brain circuits when words and pictorial representations for similar actions are presented (Bergen et al., Reference Bergen, Lau, Narayan, Stojanovic and Wheeler2010; Kaschak et al., Reference Kaschak, Madden, Therriault, Yaxley, Aveyard, Blanchard and Zwaan2005). Narayan et al. (Reference Narayan, Bergen and Weinberg2004) using a lexical matching task with English NSs also observed a similar interference effect.

Many factors influence the outcome of mental imagery, one of which is the abstractness of meanings. In a study by Liu and Bergen (Reference Liu and Bergen2016), English NSs judged sentence sensibility by pressing a button at different distances from their body. Their RT results showed that processing concrete language (e.g. Jeffery has thrown the pills onto the floor) led to compatibility effects, whereas processing abstract language (e.g. Dan is confessing his secret to the courtroom) resulted in interference effects. The authors argued that the spatial opaqueness of abstract language made it harder to process and mentally simulate simultaneously. In contrast, Richardson et al. (Reference Richardson, Spivey, Barsalou and McRae2003) reported no NS mental imagery difference based on semantic abstractness. They used concrete and abstract verbs normed for vertical (bomb, respect) or horizontal (pull, argue) image schemas and observed interference in a visual discrimination task but facilitation in a visual memory task, regardless of the encoded literal or metaphorical spatial meanings. The finding was attributed to the cognitive process underlying literal and metaphorical connections (Lakoff, Reference Lakoff1987).

Stimulus onset asynchrony (SOA), the time between prime and target onset, is another experimental influential factor that affects mental imagery outcomes (Bergen et al., Reference Bergen, Lindsay, Matlock and Narayanan2007; Masson et al., Reference Masson, Bub and Newton-Taylor2008). Bergen et al. (Reference Bergen, Lindsay, Matlock and Narayanan2007) noted interference with a shorter SOA (50–200 ms) suggesting resource conflict, while compatibility was more likely with sufficient priming time. However, interference effects were observed in longer SOA conditions (e.g. 1,500 ms in Bergen et al., Reference Bergen, Narayan and Feldman2003; Wheeler & Stojanovic, Reference Wheeler and Stojanovic2006), highlighting the complexity of SOA impact. Considering the diverse primes used in mental imagery experiments, ranging from words to images, concrete to abstract, an oversimplified conclusion about SOA is less informative. It is more meaningful to treat SOA as a variable and explore its effects on mental imagery outcomes in perceptual and language processing.

1.2 Mental imagery in L2 processing

Compared to the abundant research on L1-based embodied mental imagery, there are significantly fewer studies on L2 mental imagery (Feng & Zhou, Reference Feng and Zhou2021; Vukovic & Williams, Reference Vukovic and Williams2014; Wheeler & Stojanovic, Reference Wheeler and Stojanovic2006). Similar to the patterns of L1 mental imagery, both compatibility (Ahn & Jiang, Reference Ahn and Jiang2018; Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Koster et al., Reference Koster, Cadierno and Chiarandini2018; Tomczak & Ewert, Reference Tomczak and Ewert2015) and interference effects (Bergen et al., Reference Bergen, Lau, Narayan, Stojanovic and Wheeler2010; Buccino et al., Reference Buccino, Marino, Bulgarelli and Mezzadri2017; Vukovic & Williams, Reference Vukovic and Williams2014; Wheeler & Stojanovic, Reference Wheeler and Stojanovic2006) have been found in L2 mental imagery research. However, some studies have reported no effects or attenuated effects in L2 mental imagery relative to L1 mental imagery (Chen et al., Reference Chen, Wang, Zhang and Liu2019; Norman & Peleg, Reference Norman and Peleg2022; Wu, Reference Wu2016) due to a reduced degree of embodiment in an L2 compared to L1 (Foroni, Reference Foroni2015), or the formation of different mental representations in L1 and L2 (Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Monaco et al., Reference Monaco, Jost, Gygax and Annoni2019). Most existing studies using linguistic and visual stimuli support the perceptual-motor activation in L2 processing, covering various aspects like size and orientation (Koster et al., Reference Koster, Cadierno and Chiarandini2018), shape (Ahn & Jiang, Reference Ahn and Jiang2018), motion (Tomczak & Ewert, Reference Tomczak and Ewert2015) and emotion (Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Foroni, Reference Foroni2015). However, no studies have utilized diagrams to examine the extent to which perceptual-motor and linguistic information can be integrated into L2 processing and the potential factors that might influence this integration.

L2 proficiency has been identified as a key factor influencing the outcomes of L2 mental imagery. Wheeler and Stojanovic (Reference Wheeler and Stojanovic2006), for instance, employed the same image–verb matching task and stimuli as Bergen et al. (Reference Bergen, Narayan and Feldman2003) and found similar interference effects on L2 learners’ processing of English verbs. They also observed that the size of mental imagery effects increased as L2 proficiency developed. In other words, as L2 learners became more familiar with the target verbs, they were better able to fully identify and understand them, resulting in stronger interference effects. Ahn and Jiang (Reference Ahn and Jiang2018) reported compatibility effects, extending the scope to orientation and shape in a sentence–picture matching task. The authors argued that their L2 participants had developed native-like semantic integration abilities for processing linguistic and real-world knowledge and forming semantic representations of sentences. Consequently, sufficient L2 semantic integration competence contributed to the enactment of mental imagery.

In contrast, Chen et al. (Reference Chen, Wang, Zhang and Liu2019) and Norman and Peleg (Reference Norman and Peleg2022) conducted studies that did not find evidence of L2 (and L3) mental imagery, and their findings led to the argument that L2 proficiency may not significantly impact mental imagery in non-native language processing. Chen et al. (Reference Chen, Wang, Zhang and Liu2019) focused on the shape feature in multilingual speakers’ comprehension of L1 Cantonese, L2 Mandarin and L3 English. They discovered a compatibility effect in L1 processing but observed no such effect in L2 or L3, despite participants having higher proficiency in L2 than L3. The authors suggested that different conceptual systems were at play in L1, L2 and L3, with strong evidence of L1 mental imagery but a lack of embodied imagery in non-native language processing irrespective of proficiency levels. Norman and Peleg (Reference Norman and Peleg2022) also explored mental imagery with regard to shape. They detected compatibility effects in L1 Hebrew sentence processing but found no effects in L2 English. Their conclusions supported the idea of distinct L1 versus L2 conceptual systems and suggested that L2 comprehension might not be grounded in sensorimotor knowledge as L1 comprehension. The authors attributed these differences to the language acquisition settings, as their participants were late Hebrew–English bilinguals who acquired Hebrew in naturalistic settings but primarily received English instructions in formal settings. Consequently, they argued that the manner of language acquisition could modulate the presence of mental imagery effects, resulting in a reduced degree of embodiment of conceptual representations in L2 compared to L1 (Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Foroni, Reference Foroni2015). The debate over whether L2 learners’ processing patterns and behaviors align with those of NSs remains a topic of controversy in the field.

1.3 The application of schematic diagrams

Diagrams are graphic representations composed of simple symbols, such as lines, dots, arrows and boxes, that are used to convey meanings and represent spatial relations (Bryant & Tversky, Reference Bryant and Tversky1999; Tversky et al., Reference Tversky, Zacks, Lee, Heiser, Anderson, Cheng and Haarslev2000). These symbols, while basic, have the versatility to represent a wide range of concepts. For example, circles may denote plates and nodes, while lines can signify boundaries, limits and divisions. Serving as external graphic aids, diagrams visualize our internal mental representations of both concrete (e.g. people, space) and abstract concepts (e.g. time, quantity). By mapping diagram elements onto real-world referents, they facilitate inferences about spatial relationships (Tversky, Reference Tversky and Gero2015; Tversky et al., Reference Tversky, Gao, Corter, Tanaka, Nickerson, Jamnik, Uesaka and Schwartz2016). Additionally, diagrams bridge the gap between abstract information and the tangible world (Karaca, Reference Karaca2012), making abstract properties and relations more accessible (Hutchins, Reference Hutchins2005). However, as abstractions of concepts, diagrams may invite multiple interpretations due to their one-to-many mappings, leading to ambiguity in diagram interpretations (Tversky, Reference Tversky and Gero2015). For example, an over diagram that represents the concept above could be interpreted as spatial representations of other prepositions, such as up or on, resulting in reduced reliability of the over diagram and corresponding spatial meanings.

Diagrams play a crucial role in supporting cognitive processes involved in expressing and understanding complex linguistic meanings (Tylén et al., Reference Tylén, Fusaroli, Bundgaard and Østergaard2013). Unlike written language, diagrams are a unique type of pictorial language capable of elucidating phenomena by encapsulating specific instances within a prototype or schemata of a category (Churchland, Reference Churchland1992). Tyler and Evans (Reference Tyler and Evans2003) expanded on this concept by introducing diagrams to represent the proto-scene of English prepositions. They defined proto-scene as ‘an idealized mental representation across the recurring spatial scenes associated with a particular spatial particle’ (pp. 52) and as the ‘abstract mental representation of the primary sense’ of a preposition (pp. 65). Central to a proto-scene are the trajector (TR) and landmark (LM), which represent the fundamental spatial relationships evoked by the preposition. This proto-scene also provides a conceptual structure for understanding the spatial meaning conveyed by the preposition across different contexts. Understanding a diagram involves a prototype activation process, extracting general information from its well-constructed schematic representation with added unique features to aid interpretation (Craver, Reference Craver2007). Diagrams also exemplify the least ‘embedded’ level of mental representation, being the least situated and context-specific, most symbolic, abstract and imaginative (Zwaan, Reference Zwaan2014).

In theoretical CL literature, diagrams are viewed as imagistic tools extensively used to illustrate complex linguistic systems by providing heuristic representations of their meanings (Lakoff, Reference Lakoff1987; Langacker, Reference Langacker2013; Lindstromberg, Reference Lindstromberg2010; Tyler & Evans, Reference Tyler and Evans2003). Schematic diagrams, a CL creation, represent conceptual spatiotemporal configurations of abstract schematic meanings, such as CONTAINMENT, PATH, FORCE and so forth. Despite doubts about their ability to completely and realistically envision semantic concepts (Marshall & Freitas, Reference Marshall and Freitas2021), diagrams are supported by Langacker (Reference Langacker2013), noting that they excel in ‘elaborating technical displays of great complexity’ and ‘provide a level of precision and explicitness sufficient for most purposes, together with a kind of usability that facilitates discovery’ (pp. 10).

Diagrams have been employed in empirical research as visual stimuli to demonstrate the relationship between abstract schemas and linguistic instantiations (Richardson et al., Reference Richardson, Spivey, Barsalou and McRae2003; Spivey et al., Reference Spivey, Richardson, Gonzalez-Marquez, Pecher and Zwaan2005). Richardson et al. (Reference Richardson, Spivey, Edelman and Naples2001) conducted a norming study and found English NSs exhibited highly consistent diagrammatic representations of concrete and abstract verbs, with greater consistency in verbs associated with vertical schemas (bomb, respect) compared to verbs linked with predicted horizontal schemas (push, offend), providing empirical support for using schematic diagrams to represent both concrete and abstract meanings. Nevertheless, the diagrams generated by cognitive linguists are lack of enough empirical validation for their conceptual meanings (Tyler et al., Reference Tyler, Mueller and Ho2011).

Recently, diagrams have become increasingly prevalent in CL-inspired L2 pedagogy for teaching abstract concepts, grammar and vocabulary (Wong et al., Reference Wong, Zhao and MacWhinney2018; Zhao et al., Reference Zhao, Huang, Zhou and Wang2020). It was believed that diagrams assist learners in establishing conceptual mappings between physical and non-physical domains of human experience, encouraging deep processing and dual coding of language and images to enhance the mnemonic benefits for learning outcomes (Boers et al., Reference Boers, Lindstromberg, Stengers, Eyckmans, Boers and Lindstromberg2008). These diagrams vary in composition, some being purely graphic, while others incorporated word annotations alongside graphic symbols. These highly symbolic diagrams may possess limited visual saliency for prototype activation, and their less intuitive intended meaning may invite more interpretations. However, the accessibility of meaning to L2 learners remains an empirical question to be investigated.

1.4 The present study

The current study is the first investigation into mental imagery in the online processing of English prepositions, specifically primed by schematic diagrams. Our research aims to address several critical research gaps. Firstly, previous research on mental imagery in spatial language processing has primarily focused on verb or noun comprehension (Bergen, Reference Bergen2005; Bergen et al., Reference Bergen, Narayan and Feldman2003; Richardson et al., Reference Richardson, Spivey, Edelman and Naples2001, Reference Richardson, Spivey, Barsalou and McRae2003). Despite the crucial role prepositions play in expressing spatiality in English, previous studies have scarcely explored embodied mental imagery effects on preposition processing. Secondly, diagrams, as a widely used form of visualizing conceptual and schematic representations, have not been sufficiently investigated in mental imagery studies (Richardson et al., Reference Richardson, Spivey, Barsalou and McRae2003), and none in L2-based studies. However, diagram-based research will provide valuable insights into the psychological reality of abstract image schemas in language processing (Spivey et al., Reference Spivey, Richardson, Gonzalez-Marquez, Pecher and Zwaan2005). Investigation into diagrams in L2 processing will offer plausible explanations for the effectiveness (or lack thereof) of diagram-based CL pedagogy. Thirdly, while factors such as semantic abstractness and SOA have been recognized as important determinants of L1 mental imagery effects, these factors have not been thoroughly examined in previous L2 mental imagery studies.

Therefore, our study aims to address these gaps and examine mental imagery effects in the context of online processing of English prepositions. English prepositions are known to be challenging for many L2 learners (Tyler et al., Reference Tyler, Mueller and Ho2011). They have also garnered significant attention from cognitive linguists, who have created rich materials, including schematic diagrams and explanations that clarify the relationship between the spatial and extended (metaphorical) senses (Tyler & Evans, Reference Tyler and Evans2003). Despite the abundance of diagrams in the CL literature, the present study focused on over and in as target prepositions. These diagrams have previously demonstrated efficacy as L2 instructional aids in classroom settings (Wong et al., Reference Wong, Zhao and MacWhinney2018; Zhao et al., Reference Zhao, Huang, Zhou and Wang2020), making them intriguing candidates for further exploration of their psychological underpinnings. To investigate the extent to which L2 learners’ mental imagery patterns resemble those of NSs, we have collected data from adult English NSs to establish a baseline. Specifically, the study investigates the following research questions:

  1. a) Do schematic diagrams yield a mental imagery effect in adult NS speakers’ and L2 learners’ online processing of English prepositional phrases (PPs)?

  2. b) To what extent is any observed mental imagery effect modulated by target prepositions, abstractness of senses and SOAs?

2. Method

2.1 Participants

A total of 79 Chinese adult L2 learners of English (4 males and 75 females, mean age = 24.7, SD = 2.1) participated in the study. All were postgraduate students pursuing master’s degrees at a large Australian public university. Their first language was Mandarin Chinese and their average self-rated Chinese proficiency was 93.4 out of 100 (SD = 10.25). These participants achieved an English proficiency level of IELTS overall 6.0 (M = 6.9, SD = 0.4), corresponding to the B2 to C2 levels of the Common European Framework of Reference (Council of Europe, 2020). On average, they had 15.4 years of English learning experience (SD = 3.8) and an average of 12.9 months of residence in English-speaking countries (SD = 15.8). They reported an average of 10 hours per week of English communication (SD = 11.8). The baseline group consisted of 100 adult NSs of English (47 males and 53 females) with an average age of 44.1 (SD = 14.4). These NSs were recruited from Amazon Mechanical Turk and reported residing in one of the four English-speaking countries (87 in the USA, 10 in the UK, 2 in Canada and 1 in Australia). Their average self-rated English proficiency was 97.5 (SD = 7.5).

All participants were randomly allocated to one of two SOA conditions (1,040 ms vs. 2,040 ms). L2 learner participants in the two SOA conditions demonstrated comparable English proficiency, as indicated by their IELTS overall scores (t = −1.230, p = 0.223). A total of 21 participants (18 NSs and 3 NNSs) with accuracy rates below 70% in the priming task were excluded from the analysis due to data reliability concerns. After this exclusion, the final dataset included 76 L2 learners and 82 NSs of English (with 38 NNSs and 41 NSs in each SOA condition). All participants provided written informed consent and completed the same tasks, receiving monetary compensation for their participation.

2.2 Design and materials

We chose to focus on the prepositions over and in as they represent distinct spatial schemas in mental imagery: UP-DOWN and CONTAINMENT, respectively. Over exhibits a more complex prepositional polysemy, encompassing both a prominent location sense (e.g. The lamp is over the table) and an equally notable motion sense (e.g. He walks over the hill; Herskovits, Reference Herskovits and Stock1997). Despite these variations, both senses imply that the TR is positioned higher than the LM, leading Tyler and Evans (Reference Tyler and Evans2003) to argue that the locational HIGHER THAN sense is the primary sense of over Footnote 1. This primary sense is abstracted from specific spatial scenes and is referred to as the proto-scene of over. On the other hand, in, along with on and at, constitutes a fundamental set of prepositions in English (Kelleher et al., Reference Kelleher, Sloan and Mac Namee2009). The prototypical sense of in (CONTAINMENT) is topological in nature (Kelleher et al., Reference Kelleher, Sloan and Mac Namee2009), with minimal variation on its LM dimensionality. The LM can represent a canonical three-dimensional space (e.g. in the box) or a non-canonical two-dimensional planar surface (e.g. in the street; Tyler & Evans, Reference Tyler and Evans2003). In the polysemy systems of these prepositions, at least one extended sense is chained to its prototypical spatial sense, a connection motivated by conceptual metaphors (MORE IS UP and STATE IS A CONTAINER for over and in, respectively; Lakoff, Reference Lakoff1987; Lakoff & Johnson, Reference Lakoff and Johnson1980).

The schematic diagrams depicting the prototypical spatial configurations of the two prepositions were adopted from Tyler and Evans (Reference Tyler and Evans2003) (Table 1). Each diagram contains a TR and an LM, with the TR being conceptually movable and the LM stationary (Herskovits, Reference Herskovits and Stock1997). The original schematic diagrams of over and in in Tyler and Evans (Reference Tyler and Evans2003) were presented as black-and-white images. In our version, we enhanced the visual contrast saliency between the TR and LM by highlighting the TR with a red dot and thickening the black lines to represent the LM. In the over diagram, the area below the dashed line represents potential contact between the TR and LM (Tyler & Evans, Reference Tyler and Evans2003). In the in diagram, the space below the dashed line represents where the movable TR may be contained by the LM (Tyler & Evans, Reference Tyler and Evans2003).

Table 1. Target prepositions, schematic diagrams, and senses (Tyler & Evans, Reference Tyler and Evans2003)

Note: LM, landmark (ground); TR, trajector (figure).

We opted for the two diagrams provided by Tyler and Evans (Reference Tyler and Evans2003) for several important reasons. Firstly, these diagrams align with our focus on the two target prepositions and encapsulate their central senses as identified by Tyler and Evans (Reference Tyler and Evans2003). Their CL analysis of prepositions is argued to be grounded in speaker usage and holds potential relevance for L2 speakers (Tyler, Reference Tyler2012). Furthermore, we avoided diagrams containing graphic elements such as axes and arrows, as they do not pertain directly to the spatial relations inherent in our target prepositional meanings. We also avoided diagrams with explanatory text to prevent the introduction of additional cues that could interfere with the processing of the target prepositional phrase. Additionally, we eschewed diagrams featuring realistic picture elements, as our investigation encompassed both the spatial and metaphorically extended non-spatial sense of each preposition. Opting for abstract diagrams without realistic pictures allows for the same graphic representation of both concrete and abstract senses of the prepositions. Lastly, the distinct visual presentation of the over and in diagrams reduces the likelihood of confusion during rapid processing.

In the current study, participants first undertook a diagram–picture matching task, designed as training to ensure a consistent understanding of target schematic diagrams before the main semantic priming task. Both tasks were programmed and administered using PsyToolkit (version 3.4.2; Stoet, Reference Stoet2010, Reference Stoet2017).

2.2.1 The diagram–picture matching task

Given that schematic diagrams are abstract images with potentially ambiguous meanings that are not inherently self-explanatory, the diagram–picture matching task was formulated to aid participants in grasping the TR–LM spatial configurations depicted in the target diagrams. This was achieved by presenting participants with concrete pictures featuring drawings of real-world objects, which served as visual representations of sentences illustrating the spatial senses of over and in. As participants associated these scenes with schematic diagrams, it was anticipated that their relevant sensorimotor experiences would be activated and retrieved from long-term memory, enhancing their understanding of the spatial elements in the diagrams.

Task materials were two target diagrams and 30 pictures. Initially, 20 target sentences (10 per target preposition) and 10 filler sentences (involving non-target prepositions behind, between, out, at and towards) were created, all representing proto-spatial senses (Supplementary Material 1). Sentences were sourced from CL literature (Lindstromberg, Reference Lindstromberg2010; Tyler & Evans, Reference Tyler and Evans2003) and the Corpus of Contemporary American English (COCA; Davies, Reference Davies2008). A diagram–picture pair was deemed matched if the picture depicted a scene that illustrated the proto-scene of the target preposition, which is what the diagram represents. Figure 1 presents examples of matched pairs for over (Figure 1a: The bird flies over the city) and in (Figure 1b: A mole lives in a burrow). Conversely, a diagram–picture pair was considered mismatched if the picture portrayed a scene that did not illustrate the proto-scene of the target preposition (e.g. over diagram – in picture: A mole lives in a burrow). All pictures were black-and-white and created by a contracted professional graphic designer.

Figure 1. Sample stimuli of the diagram–picture matching task for over (a) and in (b).

Participants received written descriptions in their L1 before diagram–picture matching, explaining key features of the diagrams (e.g. the denotations of red dots, solid black lines and dashed lines). During the task, they assessed diagram–picture pairs (20 matched and 20 mismatched) with the diagram and picture randomly placed on either side of the screen. Participants determined if the pairs matched and received immediate corrective feedback. If a participant made a ‘matched’ judgment for a matched diagram–picture matching pair, the trial was recorded as accurate; otherwise, it was recorded as incorrect. Correct judgments indicated participants’ ability to comprehend the spatial configuration of diagrams by correctly associating the TR and LM in diagrams with real-world objects in realistic pictures. Participants who achieved accuracy above 70% after the 40 trials proceeded to the priming task. If not, they continued until reaching the required accuracy.

2.2.2 The semantic priming task

The task adopted a 2 relatedness (related, unrelated) × 2 target preposition (over, in) × 2 sense (spatial, extended) × 2 SOA (1,040 ms, 2,040 ms) factorial design, structured as a binary acceptability judgment task. Participants were presented with either a related or unrelated diagrammatic prime and then instructed to judge the acceptability of target prepositional phrases (PPs). Target PPs (e.g. over the horizon) were selected from COCA (Davies, Reference Davies2008) and consisted of three words (Preposition + Determiner/Adjective + Noun), where the noun served as the LM in the configuration. Each prime was displayed as a schematic diagram with an embedded word. The embedded word representing the TR was annotated in the red dot. The related and unrelated primes shared the same TR word but were paired with different diagrams. For instance, the TR word sun annotated in the over diagram is a related prime for the target PP over the horizon (Figure 2), whereas the same TR word annotated in the in diagram served as an unrelated prime for the same target PP (Figure 3).

Figure 2. A related trial of the semantic priming task (over diagram – over phrase).

Figure 3. An unrelated trial of the semantic priming task (in diagram – over phrase).

Two task stimuli versions were created using a Latin square design, ensuring each prime word was presented to each participant only once, without overlap across prepositions and senses. Each version consisted of 152 stimuli, including 88 target PPs (22 target phrases × 2 prepositions × 2 senses) and 64 fillers (Supplementary Material 1). Fillers were intentionally constructed as ungrammatical phrases (e.g. the to city, in room our), encompassing both target prepositions (over, in) and other fillers (on, to, from, under, across, through, below, before, after, between). The task stimuli were presented in a random order.

We maintained matching prime word frequency, target PP frequency and event plausibility of the prime–PP combinations (e.g. sun | over the horizon) across different stimulus conditions (see Supplementary Material 1). These frequencies were verified using COCA (Davies, Reference Davies2008). No significant differences were found in prime word frequency between prepositions (t = 1.251, p = 0.215) or between senses (t = 1.452, p = 0.151). Similarly, no significant differences were observed in target PP frequency between prepositions (t = −0.150, p = 0.881) or between senses (t = 0.037, p = 0.971).

An event plausibility task was conducted online using Qualtrics (Provo, UT, USA, version 12.2020). Additional 58 adult NSs of English were recruited from Amazon Mechanical Turk to participate in this norming task. Data from 4 participants were excluded due to unreliability. Participants were asked to rate the event plausibility of prime–PP combinations (e.g. helicopter | over the city) on a 7-point Likert scale, ranging from 1 (absolutely impossible) to 7 (definitely possible). No significant differences were observed between prepositions (t = −0.012, p = 0.990) or between senses (t = 1.050, p = 0.297).

The semantic priming task procedure (Figures 2 and 3) followed de Wit and Kinoshita (Reference de Wit and Kinoshita2015), with fonts being set as Arial 30. The task initiated with a 250-ms spot fixation (+) at the screen center, followed by a prime displayed for 1,000 or 2,000 ms, separated by a 40 ms interstimulus interval. Subsequently, a target phrase (e.g. over the horizon) appeared, and participants judged its acceptability by pressing the ‘F’ or ‘J’ as quickly as possible. The target phrase remained until participants responded or a maximum of 5,000 ms had elapsed. Each trial concluded with a 500-ms blank page before transitioning to the spot fixation for the next trial. Five training trials were set before the formal trials, with data excluded from the analysis. Only the RTs of correct responses were included in the analysis.

The two SOA conditions were designed following the SOA of 1,500 ms in Lindsay (Reference Lindsay2003) and Wheeler and Stojanovic (Reference Wheeler and Stojanovic2006). The design was also based on feedback from a pilot study involving three advanced L2 learners who completed the two tasks. The pilot participants found a prime presentation time of less than 1 s to be too short for meaningful processing of the prime and expressed greater comfort and better experiences with an extended 2-s presentation time. Pilot study data were excluded from the final analysis.

2.3 Statistical analysis

The R statistical platform (version 4.0.3; R Core Team, 2020) was used for data analysis. Before analysis, data trimming removed responses shorter than 200 ms and longer than 3,000 ms (2.3% of the data points). The lme4 package (version 1.1–29; Bates et al., Reference Bates, Maechler, Bolker, Walker, Christensen, Singmann, Dai, Scheipl, Grothendieck, Green, Fox, Bauer and Krivitsky2022) and lmerTest package (version 3.1–3; Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2017) were used to construct mixed-effects models to examine fixed effects and random effects on RTs (Linck & Cunnings, Reference Linck and Cunnings2015), with an alpha value set at 0.05 for all models. Separate models were built to analyze the RTs of NS and L2 learner groups respectively.

A simple contrast-coding scheme (−0.5, 0.5) was applied to categorical variables, including preposition, sense, SOA and relatedness. Dummy coding (1, 0) was applied to accuracy status in the priming task (correct coded as 1; incorrect as 0). Numerical variables including RTs and covariates (i.e. prime word frequency, target PP string frequency, syllable length of target PPs, prime–PP event plausibility, number of matching task trials, matching task accuracy) were log-transformed (natural log). All models were fitted with participants and items as random effects, with random intercepts for participants and items, a by-item random slope for SOA and by-participant random slopes for prepositions and senses. Model convergence was confirmed, and statistical assumptions were checked. The emmeans package (version 1.8.4; Lenth et al., Reference Lenth, Buerkner, Giné-Vázquez, Herve, Jung, Love, Miguez, Riebl and Singmann2023) applied Tukey correction for pairwise comparisons. Cohen’s d (Reference Cohen1977) was reported as the effect size for RTs, interpreted based on Plonsky and Oswald’s (Reference Plonsky and Oswald2014) recommendations: 0.60, 1.00, 1.40 corresponding to small, medium and large effect sizes for within-subject contrasts, and 0.40, 0.70 and 1.00 as small, medium and large effect sizes for between-group contrasts. Graphics were generated using the ggplot2 package (version 3.4.0; Wickham, Reference Wickham2016).

3. Results

3.1 The diagram–picture matching task

Descriptive statistics of the number of trials, RTs and accuracy rates (ARs) of the diagram–picture matching task are presented in Supplementary Material 2. For the L2 learner group, results of paired-sample t-tests showed the RTs for judging over diagrams were significantly longer than in diagrams (MD = 1,024.54, t = 0.32, p < 0.001, 95% CI [0.58, 1.10], Cohen’s d = 0.84). It was also found the acceptance ARs of the over diagram were significantly higher than the in diagram (MD = 0.05, t = 2.52, p = 0.01, 95% CI [0.06, 0.52], Cohen’s d = 0.29) but the rejection ARs of judging the over diagram were significantly lower than the in diagram (MD = 0.04, t = 2.03, p = 0.046, 95% CI [0.00, 0.46], Cohen’s d = 0.23). For the NS group, results of paired-sampled t-tests showed no difference between the RTs (t = 1.91, p = 0.06), acceptance ARs (t = 1.31, p = 0.20) or rejection ARs (t = −0.70, p = 0.49) of over and in diagrams.

3.2 The semantic priming task

Descriptive statistics of the RTs and ARs of the semantic priming task performance of the L2 learner and NS groups are presented in Supplementary Material 2.

3.2.1 The L2 learner group: Response times results

A full linear mixed-effects model on L2 RTs was built by including all the fixed effects (with interactions) and covariates (Supplementary Material 3). The proficiency (log(IELTS)) × relatedness interaction was built into the full model but turned out to be not significant. We then removed this interaction and only kept proficiency as a covariate. All the non-significant covariates were removed to construct the second modelFootnote 4, which proved to be a better one than the full model according to both Akaike information criterion (AIC) and Bayesian information criterion (BIC) values. Table 2 presents the model 2 output.

Table 2. L2 learners’ response time results of linear mixed effects modeling

Note: ***p < 0.001, **0.001 < p < 0.01, *0.01 < p < 0.05.

Preposition and sense had significant fixed effects on RTs, while relatedness and SOA did not create significant fixed effects. The post hoc analyses showed the mean RTs for over phrases (M = 1,061, SE = 34.0, 95% CI [987, 1,139]) were estimated to be 81 ms longer than those of the in phrases (M = 980, SE = 28.7, 95% CI [918, 1,046]; Cohen’s d = 0.28, SE = 0.10, 95% CI [0.09, 0.47], corresponding to a small effect). The mean RTs for extended senses (M = 1,072, SE = 32.9, 95% CI [1,001, 1,148]) were estimated to be 103 ms longer than spatial senses (M = 969, SE = 29.0, 95% CI [906, 1,036]; Cohen’s d = 0.36, SE = 0.09, 95% CI [0.19, 0.54], corresponding to a small effect). English proficiency indicated by the IELTS overall score was a significant covariate, indicating participants with a higher L2 proficiency spent less time processing English PPs than lower-proficiency participants.

The model also yielded significant two-way interactions. A preposition × relatedness interaction (Figure 4) revealed that the mean RTs of judging over phrases in related trials (M = 1,077, SE = 34.9, 95% CI [1,002, 1,158]) were estimated to be 33 ms slower than over unrelated trials (M = 1,044, SE = 33.9, 95% CI [971, 1,123]; t = 3.07, p = 0.002, Cohen’s d = 0.11, SE = 0.04, 95% CI [0.04, 0.18], corresponding to a small effect) but there was no difference between related and unrelated trials in the RTs of judging in phrases (t = −1.21, p = 0.23). A SOA × relatedness interaction (Figure 5) suggested that the mean RTs in related trials (M = 1,067, SE = 39.3, 95% CI [983, 1,159]) were estimated to be 25 ms longer than in unrelated trials (M = 1,042, SE = 38.4, 95% CI [960, 1,132]) in the 2,040 ms SOA condition (t = 2.36, p = 0.02, Cohen’s d = 0.08, SE = 0.04, 95% CI [0.01, 0.16], corresponding to a small effect), but not such a difference was observed in the 1,040 ms SOA (t = −0.50, p = 0.62).

Figure 4. Response times of related and unrelated trials by preposition. Black bars represent medians, and green dots represent the individual response times per participant per trial.

Figure 5. Response times of related and unrelated trials by stimulus onset asynchrony. Black bars represent medians, and blue dots represent the individual response times per participant per trial.

3.2.2 The native speaker group: Response times results

A full linear mixed-effects model on English NSs’ RTs was built by including all the fixed effects (with interactions) and covariates. All the non-significant covariates were removed to construct the model, which proved to be better than the full model according to both AIC and BIC values (Supplementary Material 3).

Among the fixed effects, target preposition (b = −0.04, SE = 0.01, 95% CI [−0.06, −0.01], t = −2.53, p = 0.012) was found to be a significant predictor of the RTs of NSs’ PP judgment. NSs spent significantly longer time (35 ms) judging over phrases (M = 993, SE = 30.4, 95% CI [927, 1,065]) than judging in phrases (M = 958, SE = 27.9, 95% CI [897, 1,024]; Cohen’s d = 0.15, SE = 0.06, 95% CI [0.03, 0.26], corresponding to a small effect). The event plausibility of prime–PP combinations was the only significant covariate (b = −0.51, SE = 0.16, 95% CI [−0.82, −0.19], t = −3.16, p = 0.002), implying the prime–PP combinations with a higher event plausibility were processed faster than those with a lower event plausibility.

4. Discussion

The study investigates Chinese L2 English learners’ mental imagery in processing English PPs in comparison with English NSs’ mental imagery in L1 processing via visual abstraction of schematic diagrams. RT results revealed interference effects on processing of over phrases and PPs under 2,040 ms SOA in the L2 learner group but not in the English NS group. Several factors influenced RTs of phrasal judgment. Preposition had a main effect on RTs in both L1 and L2 processing, resulting in longer RTs for processing over phrases. Sense influenced RTs in L2 PP processing, with extended senses leading to longer RTs compared to spatial senses. Besides, L2 learners’ RTs decreased with higher L2 proficiency, while NSs’ RTs decreased with greater event plausibility.

4.1 Selective interference effects of diagrams on L2 online processing

The interference in the L2 learner group but not in the NS group aligns partially with previous findings in L2 mental imagery (Bergen et al., Reference Bergen, Lau, Narayan, Stojanovic and Wheeler2010; Wheeler & Stojanovic, Reference Wheeler and Stojanovic2006). The discrepancy in mental imagery effects between the two participant groups could be attributed to the varying cognitive demands involved in processing schematic diagrams and linguistic stimuli. Many L1 mental imagery studies reported compatibility effects using concrete pictures (de Koning et al., Reference de Koning, Wassenburg, Bos and Van der Schoot2017a; Engelen et al., Reference Engelen, Bouwmeester, de Bruin and Zwaan2011; Schütt et al., Reference Schütt, Dudschig, Bergen and Kaup2023) with a higher level of embeddedness and proximity to contextualized situations (Zwaan, Reference Zwaan2014), thus resulting in the least cognitive demands and less difficulty integrating the pictures with linguistic contexts, whereas the current study used schematic diagrams, representing symbolic mental representations with lower embeddedness and higher abstraction, making the processing more cognitively demanding than pictures, which offsets the compatibility effects in the NS group.

The interference effect in the L2 learner group may be due to mutual inhibition and information integration. Firstly, the interference effects might stem from mutual inhibition arising from the competition between simultaneously processing visual and linguistic input through the same perceptual-motor neural circuit (Bergen et al., Reference Bergen, Lau, Narayan, Stojanovic and Wheeler2010; Kaschak et al., Reference Kaschak, Madden, Therriault, Yaxley, Aveyard, Blanchard and Zwaan2005). According to the theory of mirror neurons (Perry et al., Reference Perry, Stiso, Chang, Lin, Parvizi and Knight2018), neural structures can be activated when closely related words and pictorial representations of similar actions are presented (Bergen et al., Reference Bergen, Narayan and Feldman2003). In the current design, the TR word was embedded in the diagram and presented simultaneously as the prime. This activation process may lead to mutual inhibition between two mirror neural representations, one for the TR word (linguistic representation) and the other for the diagrammatic element representing the TR word (visual representation), as they compete with one another.

Additionally, L2 learners might be more affected in integrating information from various multimodal sources (Martin et al., Reference Martin, Garcia, Breton, Thierry and Costa2016; Romero-Rivas et al., Reference Romero-Rivas, Corey, Garcia, Thierry, Martin and Costa2017), given their reduced degree of embodiment in L2 compared to L1 (Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Foroni, Reference Foroni2015). For late bilinguals who acquired their L2 in a formal school setting, L2 lexical concepts were initially stored as mental representations in their L1 (Silverberg & Samuel, Reference Silverberg and Samuel2004), resulting in weak associations between L2 forms and mental representations (Dudschig et al., Reference Dudschig, de la Vega and Kaup2014; Monaco et al., Reference Monaco, Jost, Gygax and Annoni2019). Another contributing factor is that L2 learners tend to have a relatively limited working memory capacity for L2 processing compared to that of NSs for L1 processing, and these discrepancies are particularly noticeable in late bilinguals (Michael & Gollan, Reference Michael, Gollan, Kroll and de Groot2005; Zhou et al., Reference Zhou, Rossi and Chen2017). The current priming design involves the separate presentation of the TR and LM words in the diagrammatic prime and target phrase. This separation may hinder participants’ mental construction of the complete TR–LM spatial configuration when they solely view the diagrammatic prime. Consequently, this complexity places greater demands on cognitive resources for processing the initial stimuli, which limits the cognitive resources to process the subsequent stimuli.

The study identifies selective interference effects in L2 learners’ processing of over phrases, when presented with a related prime of the over diagram embedded with a TR word, but relatedness was not significant for the in-phrase processing. This preposition-specific interference effect can be attributed to both the lexical properties of over and the over diagram. As a preposition, over is less frequent but more semantically complex than in. The frequency difference (1,222,405 instances for over versus 16,541,037 instances for in according to COCA) implies that speakers have fewer encounters with the various uses of over, potentially leading to a lower quality of conceptual understanding (Bybee, Reference Bybee2010).

Additionally, the preposition over exhibits greater semantic complexity and polysemy compared to in. In terms of dynamicity, the prototypical sense of over (i.e. HIGHER THAN) is abstracted from both static location (e.g. The lamp is over the table) and dynamic motion senses (e.g. He walks over the hill; Herskovits, Reference Herskovits and Stock1997). Conversely, the prototypical sense of in (CONTAINMENT) is abstracted solely from static senses (e.g. in the car, in danger), as dynamic containment is typically expressed by the preposition into rather than in (Tyler & Evans, Reference Tyler and Evans2003). Therefore, over, incorporating both static and dynamic senses in its prototypical sense, is semantically more intricate than in, whose extended senses remain static. Additionally, Tyler and Evans’ (Reference Tyler and Evans2003) principled polysemy model indicates that over (with 14 extended senses in a five-cluster semantic network) demonstrates a higher degree of semantic polysemy than in (with 7 extended senses in four clusters), suggesting a more complex semantic structure for over. Given its greater polysemy and less direct chaining between its proto-scene and extended senses, comprehending the semantics of over can pose a challenge, even for NSs (Coventry & Mather, Reference Coventry, Mather, Coventry and Olivier2002). This complexity is evidenced by the longer RTs observed for judging over phrases in both NSs and L2 learners in our study.

As a diagram, over is more ambiguous and less reliable than in. The fuzziness in the vertical separation of TR and LM complicates precise localization in a vertically oriented image (Hampe & Grady, Reference Hampe and Grady2005). As a result, the over diagram may project the central sense of above (Lakoff, Reference Lakoff1987). Speakers might also interpret it as a spatial representation of up or on. Thus, the over diagram could lead to ambiguity with multiple interpretations (Tversky, Reference Tversky and Gero2015), and pose challenges in its processing. In comparison, the in diagram serves as a more reliable cue for the preposition in and containment interpretation. It cannot be used for other prepositions, like into, out, out of and through, maintaining a clear prototypical meaning and interpretation.

We contend that the potentially ambiguous over diagram is more cognitively demanding than the in diagram, supported by diagram–picture matching results. While NSs showed no differences in RTs, acceptance or rejection ARs between over and in diagrams, L2 learners spent significantly more time processing over diagrams and had lower rejection ARs. The longer RTs suggest deliberation and higher rejection ARs indicate the greater ambiguity of the over diagram. Rejecting diagram–picture mismatching pairs demands a higher level of understanding of the diagram, requiring participants to grasp both the TR and LM while distinguishing the depicted spatial configuration of the over diagram from others. Therefore, the diagram–picture matching results underscore that L2 learners invest more cognitive effort in processing the over diagram.

Finally, selective interference effects of diagrams emerged in L2 learners’ PP processing only at a 2,040 ms SOA, differing from previous patterns observed at shorter SOA (e.g. Bergen et al., Reference Bergen, Lindsay, Matlock and Narayanan2007). This finding contrasts with prior studies but aligns with Bergen et al. (Reference Bergen, Narayan and Feldman2003) and Wheeler and Stojanovic (Reference Wheeler and Stojanovic2006), where a longer SOA (up to 1,500 ms) revealed interference effects. The longer SOA enables deeper prime processing (Shaki & Fischer, Reference Shaki and Fischer2023), allowing L2 learners more time to digest the primes by integrating the TR word with diagrams. This higher processing depth also demands greater cognitive effort in mapping integrated information onto the LM in the target phrase.

4.2 Factors that influence L1 and L2 processing

The study identified that target prepositions influenced the RTs of phrasal judgment in both participant groups, with a greater impact on L2 learners compared to NSs. Despite advanced English proficiency in L2 learners, they exhibited nativelike sensitivity to prepositions, but with a slower processing speed. The faster responses to over phrases were not due to the differences in syllable length between the over versus in phrases, as it was not a significant covariate in both mixed models, thus statistically controlling this potential confounding variable (Sonbul, Reference Sonbul2015). Instead, we attribute the phenomenon to the higher complexity and lower frequency of over in corpus, which has been explained in the previous discussion.

In terms of abstractness of sense, L2 learners processed extended senses slower than spatial senses, which was not observed in NSs. This difference could stem from the figurative nature of extended senses, rooted in conventional conceptual metaphors MORE IS UP and STATE IS A CONTAINER (Lakoff, Reference Lakoff1987). Understanding figurative language involves cognitive effort in activating conceptual mapping between target and source domains (Lai & Curran, Reference Lai and Curran2013), posing a greater challenge for L2 learners who lack sensitivity and familiarity with the metaphors in the target language (Horvat et al., Reference Horvat, Bolognesi and Lahiri2021; Littlemore et al., Reference Littlemore, Chen, Koester and Barnden2011). It implies that L2 learners may compute the meaning of word components to reach an integral phrasal comprehension, contrasting with NSs who tend to process it as a whole with minimal computation during the phrasal processing (Shi et al., Reference Shi, Peng and Li2023).

The findings on covariates also highlight a significant distinction between L1 and L2 processing. The RTs of NSs were influenced by the event plausibility of prime–PP combinations (e.g. sun | over the horizon), while L2 learners did not display a strong sensitivity to this input factor. This finding aligns with usage-based studies of language acquisition (Ellis et al., Reference Ellis, Simpson‐Vlach and Maynard2008; Gries et al., Reference Gries, Hampe and Schönefeld2005; Zhao et al., Reference Zhao, Huang, Zhou and Wang2020), which demonstrate that NSs and L2 learners are sensitive to different input-based statistical information. NSs are primarily influenced by associational information (event plausibility defined by the association between the prime word and the phrasal context), while L2 learners often fail to show the same degree of sensitivity to contextualized probability as NSs.

Finally, the quality of L2 phrasal processing was modulated by the level of L2 proficiency (Wolter & Yamashita, Reference Wolter and Yamashita2018), as evidenced by the current finding that higher L2 proficiency leads to faster RTs in L2 learners’ PP judgment. This suggests that higher L2 proficiency enhances grammatical sensitivity in PP judgment.

4.3 The validity of schematic diagrams as representations of concepts

The discovery of selective interference effects in the L2 learner group provides compelling evidence of the presence of mental imagery in L2 processing and bolsters the validity of schematic diagrams as conceptual representations. Future studies may identify compatibility effects, serving as additional evidence to substantiate the validity of schematic diagrams. We acknowledge the current limitations and provide recommendations for future researchers and implications for future L2 instructors.

Firstly, we suggest future studies to increase the integrability of visual and linguistic information in the prime with the target. Considering that linguistic information could be more readily comprehensible compared to diagrammatic visual information, reversing the current visual–linguistic stimuli sequence by presenting linguistic stimuli (i.e. PP) first as the prime followed by diagrams embedded with TR word (and LM word) as the target might lead to the emergence of compatibility effects. This paradigm finds support in previous studies that presented sentences as primes and pictures as targets in the sentence–picture verification tasks (Ahn & Jiang, Reference Ahn and Jiang2018; Bergen, Reference Bergen, Gonzalez-Marquez, Mittelberg, Coulson and Spivey2007; Chen et al., Reference Chen, de Koning and Zwaan2020; de Koning et al., Reference de Koning, Wassenburg, Bos and Van der Schoot2017a; Wang & Zhao, Reference Wang and ZhaoUnder Review).

Secondly, it is advisable for future research to explore diagrams with varying degrees of transparency and graphic diversity. The current study only examined the over and in diagrams provided by Tyler and Evans (Reference Tyler and Evans2003) as exemplars among the abundant diagrams in CL literature and empirical research. These two diagrams are somehow abstract in nature, employing only fundamental graphic symbols like lines and dots to convey meanings, without incorporating supplementary graphic elements such as explanatory text or realistic imagery. We encourage future research to investigate a broader range of diagrams in subsequent experiments, encompassing other varieties of graphic elements including arrows, axes, text labels, shapes, symbols, colors, realistic pictures and so forth. This investigation can contribute to the field of mental imagery by addressing whether the graphic diversity of diagrams affects cognitive processing demands and their interaction with language processing.

Thirdly, we encourage future research in mental imagery to incorporate diagrammatic and linguistic stimuli that pertain to other abstract domains. Schematic diagrams, composed of abstract visual symbols, may be more suitable than pictures to investigate mental representations associated with abstract grammatical and semantic domains such as time, force dynamics, countability and figurativeness. These abstract domains have their roots in concrete spatial domains (Lakoff, Reference Lakoff1987), such as motion paths (Wang & Zhao, Reference Wang and ZhaoUnder Review). We argue that diagrams may offer a more appropriate means of representing abstract concepts that are invisible and challenging to describe solely through language, potentially leading to more pronounced mental imagery effects.

Finally, in terms of pedagogical implications, the present study contributes to the existing CL literature and provides a rationale for incorporating schematic diagrams into CL-based L2 instructions. These findings are in line with previous CL-based L2 instructional studies, which have suggested that the use of pictorial elucidations can potentially lead to distraction effects on L2 learners’ acquisition of relatively complex linguistic items (Boers et al., Reference Boers, Lindstromberg, Stengers, Eyckmans, Boers and Lindstromberg2008, Reference Boers, Piquer Píriz, Stengers and Eyckmans2009). Given the observed selective interference effects, caution can be warranted in selecting and using diagrams as instructional tools. For instance, when employing schematic diagrams to teach prepositions like over to L2 learners, providing explicit explanations is advisable to mitigate potential ambiguity or confusion.

In conclusion, the study introduces an innovative experimental paradigm, integrating semantic information of the TR word with image-schematic representation, influencing real-time language processing by encapsulating a figure–ground relation. Selective interference effects were observed in L2 learners’ processing of over phrases and PPs under the longer SOA condition, which suggests that mental imagery effects were modulated by target preposition and SOA but not sense. The findings support the mental imagery process in the L2 processing of English prepositions and the psychological reality of image-based semantic representations. The study provides new empirical evidence on the interaction between visual perception, mental imagery and linguistic processes, and inspires the application of schematic diagrams in future psycholinguistic studies.

Data availability statement

Data and supplementary materials are accessible from https://osf.io/v6ath/.

Acknowledgments

We would like to thank two anonymous reviewers for their valuable comments on this paper.

Footnotes

1 The prototypical meaning of over itself is debatable (Deane, Reference Deane, Hampe and Grady2005), with some arguing for ABOVE AND ACROSS (Lakoff, Reference Lakoff1987) and others for HIGHER THAN (Tyler & Evans, Reference Tyler and Evans2003).

2The bee’ is a movable TR, and ‘the flower’ is a stationary LM. The space below the dashed line and above the thickened black line signifies where ‘the bee’ may access nectar from ‘the flower’.

3The cow’ is a movable TR. Despite ‘the field’ being a stationary and physically planar container, it can be construed as a non-canonical bounded LM due to the flexible nature of human conceptualization of spatial components (Tyler & Evans, Reference Tyler and Evans2003). A canonical bounded LM would be ‘the bowl’ as in the sentence ‘A pear is in the bowl’.

4 Model equation: log(RT) ~ Sumtargetprep*Sumrelatedness + Sumsense*Sumrelatedness + SumSOA*Sumrelatedness + log(IELTS) + (1 + Sumtargetprep + Sumsense | Subjects) + (SumSOA | item)

References

Ahn, S., & Jiang, N. (2018). Automatic semantic integration during L2 sentential reading. Bilingualism: Language and Cognition, 21(2), 375383. https://doi.org/10.1017/S1366728917000256CrossRefGoogle Scholar
Barsalou, L. W. (1999). Perceptual symbol systems. Behavioral and Brain Sciences, 22(4), 577660. https://doi.org/10.1017/S0140525X99002149CrossRefGoogle ScholarPubMed
Barsalou, L. W. (2008). Grounded cognition. Annual Review of Psychology, 59(1), 617645. https://doi.org/10.1146/annurev.psych.59.103006.093639CrossRefGoogle ScholarPubMed
Bates, D., Maechler, M., Bolker, B., Walker, S., Christensen, R. H. B., Singmann, H., Dai, B., Scheipl, F., Grothendieck, G., Green, P., Fox, J., Bauer, A., & Krivitsky, P. N. (2022). Package ‘lme4’ (Version 1.1-29) [Computer software]. Comprehensive R Archive Network. https://cran.r-project.org/web/packages/lme4/lme4.pdfGoogle Scholar
Bergen, B. K. (2005). Mental simulation in spatial language processing. Proceedings of the Annual Meeting of the Cognitive Science Society, 27. https://escholarship.org/uc/item/5fn5t33s.Google Scholar
Bergen, B. K. (2007). Experimental methods for simulation semantics. In Gonzalez-Marquez, M., Mittelberg, I., Coulson, S., & Spivey, M. J. (Eds.), Human cognitive processing (Vol. 18, pp. 277301). John Benjamins Publishing Company.Google Scholar
Bergen, B. K. (2019). Embodiment. In Dąbrowska, E. & Divjak, D. (Eds.), Cognitive linguistics—Foundations of language (pp. 1135). De Gruyter Mouton.CrossRefGoogle ScholarPubMed
Bergen, B. K., Lau, T.-T. C., Narayan, S., Stojanovic, D., & Wheeler, K. (2010). Body part representations in verbal semantics. Memory & Cognition, 38(7), 969981. https://doi.org/10.3758/MC.38.7.969CrossRefGoogle ScholarPubMed
Bergen, B. K., Lindsay, S., Matlock, T., & Narayanan, S. (2007). Spatial and linguistic aspects of visual imagery in sentence comprehension. Cognitive Science, 31(5), 733764. https://doi.org/10.1080/03640210701530748CrossRefGoogle ScholarPubMed
Bergen, B. K., Narayan, S., & Feldman, J. (2003). Embodied verbal semantics: Evidence from an image-verb matching task. Proceedings of the Annual Meeting of the Cognitive Science Society, 25. https://escholarship.org/uc/item/1hg3n38mGoogle Scholar
Boers, F., Lindstromberg, S., Stengers, H., & Eyckmans, J. (2008). Variables in the mnemonic effectiveness of pictorial elucidation. In Boers, F. & Lindstromberg, S. (Eds.), Cognitive linguistic approaches to teaching vocabulary and phraseology (pp. 189216). De Gruyter Mouton.CrossRefGoogle Scholar
Boers, F., Piquer Píriz, A. M., Stengers, H., & Eyckmans, J. (2009). Does pictorial elucidation foster recollection of idioms? Language Teaching Research, 13(4), 367382. https://doi.org/10.1177/1362168809341505CrossRefGoogle Scholar
Bryant, D. J., & Tversky, B. (1999). Mental representations of perspective and spatial relations from diagrams and models. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25(1), 137156. https://doi.org/10.1037/0278-7393.25.1.137Google ScholarPubMed
Buccino, G., Marino, B. F., Bulgarelli, C., & Mezzadri, M. (2017). Fluent speakers of a second language process graspable nouns expressed in L2 like in their native language. Frontiers in Psychology, 8, 1306. https://doi.org/10.3389/fpsyg.2017.01306CrossRefGoogle ScholarPubMed
Bybee, J. (2010). Language, usage and cognition. Cambridge University Press.CrossRefGoogle Scholar
Chen, D., Wang, R., Zhang, J., & Liu, C. (2019). Perceptual representations in L1, L2 and L3 comprehension: Delayed sentence–picture verification. Journal of Psycholinguistic Research, 49(1), 4157. https://doi.org/10.1007/s10936-019-09670-xCrossRefGoogle Scholar
Chen, S.-C., de Koning, B. B., & Zwaan, R. A. (2020). Does object size matter with regard to the mental simulation of object orientation? Experimental Psychology, 67(1), 5672. https://doi.org/10.1027/1618-3169/a000468CrossRefGoogle Scholar
Churchland, P. M. (1992). A neurocomputational perspective: The nature of mind and the structure of science. The MIT Press.CrossRefGoogle Scholar
Cohen, J. (1977). Statistical power analysis for the behavioral sciences. Routledge.Google Scholar
Council of Europe (Ed.). (2020). Common European Framework of Reference for languages: Learning, teaching, assessment: Companion volume with new descriptors. Council of Europe Publishing.Google Scholar
Coventry, K. R., & Mather, G. (2002). The real story of over? In Coventry, K. R. & Olivier, P. (Eds.), Spatial language (pp. 165184). Springer Netherlands.CrossRefGoogle Scholar
Craver, C. F. (2007). Explaining the brain: Mechanisms and the mosaic unity of neuroscience. Oxford University Press.CrossRefGoogle Scholar
Craver-Lemley, C., & Arterberry, M. (2001). Imagery-induced interference on a visual detection task. Spatial Vision, 14(2), 101119. https://doi.org/10.1163/156856801300202887Google ScholarPubMed
Davies, M. (2008). The Corpus of Contemporary American English (COCA). https://www.english-corpora.org/coca/Google Scholar
de Koning, B. B., Wassenburg, S. I., Bos, L. T., & Van der Schoot, M. (2017a). Mental simulation of four visual object properties: Similarities and differences as assessed by the sentence–picture verification task. Journal of Cognitive Psychology, 29(4), 420432. https://doi.org/10.1080/20445911.2017.1281283CrossRefGoogle Scholar
de Koning, B. B., Wassenburg, S. I., Bos, L. T., & Van der Schoot, M. (2017b). Size does matter: Implied object size is mentally simulated during language comprehension. Discourse Processes, 54(7), 493503. https://doi.org/10.1080/0163853X.2015.1119604CrossRefGoogle Scholar
de Wit, B., & Kinoshita, S. (2015). The masked semantic priming effect is task dependent: Reconsidering the automatic spreading activation process. Journal of Experimental Psychology: Learning, Memory, and Cognition, 41(4), 10621075. https://doi.org/10.1037/xlm0000074Google ScholarPubMed
Deane, P. (2005). Multimodal spatial representation: On the semantic unity of over. In Hampe, B. & Grady, J. E. (Eds.), From perception to meaning (pp. 235284). De Gruyter Mouton.CrossRefGoogle Scholar
Dudschig, C., de la Vega, I., & Kaup, B. (2014). Embodiment and second-language: Automatic activation of motor responses during processing spatially associated L2 words and emotion L2 words in a vertical Stroop paradigm. Brain and Language, 132, 1421. https://doi.org/10.1016/j.bandl.2014.02.002CrossRefGoogle Scholar
Ellis, N. C., Simpson‐Vlach, R., & Maynard, C. (2008). Formulaic language in native and second language speakers: Psycholinguistics, corpus linguistics, and TESOL. TESOL Quarterly, 42(3), 375396. https://doi.org/10.1002/j.1545-7249.2008.tb00137.xCrossRefGoogle Scholar
Engelen, J. A. A., Bouwmeester, S., de Bruin, A. B. H., & Zwaan, R. A. (2011). Perceptual simulation in developing language comprehension. Journal of Experimental Child Psychology, 110(4), 659675. https://doi.org/10.1016/j.jecp.2011.06.009CrossRefGoogle ScholarPubMed
Feng, Y., & Zhou, R. (2021). Does embodiment of verbs influence predicate metaphor processing in a second language? Evidence from picture priming. Frontiers in Psychology, 12(759175), 112. https://doi.org/10.3389/fpsyg.2021.759175CrossRefGoogle Scholar
Foroni, F. (2015). Do we embody second language? Evidence for ‘partial’ simulation during processing of a second language. Brain and Cognition, 99, 816. https://doi.org/10.1016/j.bandc.2015.06.006CrossRefGoogle ScholarPubMed
Gries, S. T., Hampe, B., & Schönefeld, D. (2005). Converging evidence: Bringing together experimental and corpus data on the association of verbs and constructions. Cognitive Linguistics, 16(4), 635676. https://doi.org/10.1515/cogl.2005.16.4.635CrossRefGoogle Scholar
Hampe, B., & Grady, J. E. (Eds.). (2005). From perception to meaning: Image schemas in cognitive linguistics. De Gruyter Mouton. https://doi.org/10.1515/9783110197532CrossRefGoogle Scholar
Herskovits, A. (1997). Language, spatial cognition, and vision. In Stock, O. (Ed.), Spatial and temporal reasoning (pp. 155202). Springer Netherlands.CrossRefGoogle Scholar
Horvat, W., Bolognesi, M., & Lahiri, A. (2021). Processing of literal and metaphorical meanings in polysemous verbs: An experiment and its methodological implications. Journal of Pragmatics, 171, 131146. https://doi.org/10.1016/j.pragma.2020.10.007CrossRefGoogle Scholar
Hutchins, E. (2005). Material anchors for conceptual blends. Journal of Pragmatics, 37(10), 15551577. https://doi.org/10.1016/j.pragma.2004.06.008CrossRefGoogle Scholar
Karaca, K. (2012). Philosophical reflections on diagram models and diagrammatic representation. Journal of Experimental & Theoretical Artificial Intelligence, 24(3), 365384. https://doi.org/10.1080/0952813X.2012.693665CrossRefGoogle Scholar
Kaschak, M. P., Madden, C. J., Therriault, D. J., Yaxley, R. H., Aveyard, M., Blanchard, A. A., & Zwaan, R. A. (2005). Perception of motion affects language processing. Cognition, 94(3), B79B89. https://doi.org/10.1016/j.cognition.2004.06e.005CrossRefGoogle ScholarPubMed
Kelleher, J., Sloan, C., & Mac Namee, B. (2009). An investigation into the semantics of English topological prepositions. Cognitive Processing, 10(S2), 233236. https://doi.org/10.1007/s10339-009-0289-4CrossRefGoogle ScholarPubMed
Koster, D., Cadierno, T., & Chiarandini, M. (2018). Mental simulation of object orientation and size: A conceptual replication with second language learners. Journal of the European Second Language Association, 2(1), 3848. https://doi.org/10.22599/jesla.39CrossRefGoogle Scholar
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 126. https://doi.org/10.18637/jss.v082.i13CrossRefGoogle Scholar
Lai, V. T., & Curran, T. (2013). ERP evidence for conceptual mappings and comparison processes during the comprehension of conventional and novel metaphors. Brain and Language, 127(3), 484496. https://doi.org/10.1016/j.bandl.2013.09.010Google ScholarPubMed
Lakoff, G. (1987). Women, fire, and dangerous things: What categories reveal about the mind. University of Chicago Press.CrossRefGoogle Scholar
Lakoff, G., & Johnson, M. (1980). Metaphors we live by. University of Chicago Press.Google Scholar
Langacker, R. W. (2013). Essentials of cognitive grammar. Oxford University Press.Google Scholar
Lenth, R. V., Buerkner, P., Giné-Vázquez, I., Herve, M., Jung, M., Love, J., Miguez, F., Riebl, H., & Singmann, H. (2023). Package ‘emmeans’ (Version 1.8.4-1) [Computer software]. https://cran.r-project.org/web/packages/emmeans/emmeans.pdfGoogle Scholar
Linck, J. A., & Cunnings, I. (2015). The utility and application of mixed-effects models in second language research: Mixed-effects models. Language Learning, 65(S1), 185207. https://doi.org/10.1111/lang.12117CrossRefGoogle Scholar
Lindsay, S. (2003). Visual priming of language comprehension. University of Sussex.Google Scholar
Lindstromberg, S. (2010). English prepositions explained. John Benjamins Publish Company.CrossRefGoogle Scholar
Littlemore, J., Chen, P. T., Koester, A., & Barnden, J. (2011). Difficulties in metaphor comprehension faced by international students whose first language is not English. Applied Linguistics, 32(4), 408429. https://doi.org/10.1093/applin/amr009CrossRefGoogle Scholar
Liu, N., & Bergen, B. K. (2016). When do language comprehenders mentally simulate locations? Cognitive Linguistics, 27(2), 181203. https://doi.org/10.1515/cog-2015-0123CrossRefGoogle Scholar
Mandler, J. M. (2004). The foundations of mind. Oxford University Press.Google Scholar
Mandler, J. M. (2005). How to build a baby: III. Image schemas and the transition to verbal thought. In Hampe, B. (Ed.), From perception to meaning (Vol. 29, pp. 137164). De Gruyter Mouton.CrossRefGoogle Scholar
Marshall, G. C., & Freitas, A. (2021). Measuring diagram quality through semiotic morphisms. Semiotica, 2021(239), 125145. https://doi.org/10.1515/sem-2019-0029CrossRefGoogle Scholar
Martin, C. D., Garcia, X., Breton, A., Thierry, G., & Costa, A. (2016). World knowledge integration during second language comprehension. Language, Cognition and Neuroscience, 31(2), 206216. https://doi.org/10.1080/23273798.2015.1084012CrossRefGoogle Scholar
Masson, M. E. J., Bub, D. N., & Newton-Taylor, M. (2008). Language-based access to gestural components of conceptual knowledge. Quarterly Journal of Experimental Psychology, 61(6), 869882. https://doi.org/10.1080/17470210701623829CrossRefGoogle ScholarPubMed
Michael, E. B., & Gollan, T. H. (2005). Being and becoming bilingual. In Kroll, Judith. F. & de Groot, A. M. B. (Eds.), Handbook of bilingualism: Psycholinguistic approaches (pp. 389407). Oxford University Press.Google Scholar
Monaco, E., Jost, L. B., Gygax, P. M., & Annoni, J.-M. (2019). Embodied semantics in a second language: Critical review and clinical implications. Frontiers in Human Neuroscience, 13, 110. https://doi.org/10.3389/fnhum.2019.00110CrossRefGoogle Scholar
Narayan, S., Bergen, B. K., & Weinberg, Z. (2004). Embodied verbal semantics: Evidence from a lexical matching task. Proceedings of Berkeley Linguistics Society, 30, 493504. https://doi.org/10.3765/bls.v30i1.936CrossRefGoogle Scholar
Norman, T., & Peleg, O. (2022). The reduced embodiment of a second language. Bilingualism: Language and Cognition, 25(3), 406416. https://doi.org/10.1017/S1366728921001115CrossRefGoogle Scholar
Perry, A., Stiso, J., Chang, E. F., Lin, J. J., Parvizi, J., & Knight, R. T. (2018). Mirroring in the human brain: Deciphering the spatial-temporal patterns of the human mirror neuron system. Cerebral Cortex, 28(3), 10391048. https://doi.org/10.1093/cercor/bhx013CrossRefGoogle ScholarPubMed
Plonsky, L., & Oswald, F. L. (2014). How big Is “big”? Interpreting effect sizes in L2 research: Effect sizes in L2 research. Language Learning, 64(4), 878912. https://doi.org/10.1111/lang.12079CrossRefGoogle Scholar
R Core Team. (2020). R: A language and environment for statistical computing. [Computer software]. R Foundation for Statistical Computing, Vienna. https://www.R-project.org/Google Scholar
Richardson, D., Spivey, M. J., Barsalou, L. W., & McRae, K. (2003). Spatial representations activated during real-time comprehension of verbs. Cognitive Science, 27(5), 767780. https://doi.org/10.1207/s15516709cog2705_4Google Scholar
Richardson, D., Spivey, M. J., Edelman, S., & Naples, A. J. (2001). “Language is spatial”: Experimental evidence for image schemas of concrete and abstract verbs. Proceedings of the Annual Meeting of the Cognitive Science Society, 23. https://escholarship.org/uc/item/9vs820bx.Google Scholar
Romero-Rivas, C., Corey, J. D., Garcia, X., Thierry, G., Martin, C. D., & Costa, A. (2017). World knowledge and novel information integration during L2 speech comprehension. Bilingualism: Language and Cognition, 20(3), 576587. https://doi.org/10.1017/S1366728915000905CrossRefGoogle Scholar
Rommers, J., Meyer, A. S., & Huettig, F. (2013). Object shape and orientation do not routinely influence performance during language processing. Psychological Science, 24(11), 22182225. https://doi.org/10.1177/0956797613490746CrossRefGoogle Scholar
Schütt, E., Dudschig, C., Bergen, B. K., & Kaup, B. (2023). Sentence-based mental simulations: Evidence from behavioral experiments using garden-path sentences. Memory & Cognition, 51(4), 952965. https://doi.org/10.3758/s13421-022-01367-2CrossRefGoogle ScholarPubMed
Shaki, S., & Fischer, M. H. (2023). How does language affect spatial attention? Deconstructing the prime-target relationship. Memory & Cognition, 51(5), 11151124. https://doi.org/10.3758/s13421-022-01390-3CrossRefGoogle ScholarPubMed
Shi, J., Peng, G., & Li, D. (2023). Figurativeness matters in the second language processing of collocations: Evidence from a self‐paced reading experiment. Language Learning, 73(1), 4783. https://doi.org/10.1111/lang.12516CrossRefGoogle Scholar
Silverberg, S., & Samuel, A. G. (2004). The effect of age of second language acquisition on the representation and processing of second language words. Journal of Memory and Language, 51(3), 381398. https://doi.org/10.1016/j.jml.2004.05.003CrossRefGoogle Scholar
Sonbul, S. (2015). Fatal mistake, awful mistake, or extreme mistake? Frequency effects on off-line/on-line collocational processing. Bilingualism: Language and Cognition, 18(3), 419437. https://doi.org/10.1017/S1366728914000674CrossRefGoogle Scholar
Spivey, M. J., Richardson, D. C., & Gonzalez-Marquez, M. (2005). On the perceptual-motor and image-schematic infrastructure of language. In Pecher, D. & Zwaan, R. A. (Eds.), Grounding cognition (1st ed., pp. 246281). Cambridge University Press.CrossRefGoogle Scholar
Stanfield, R. A., & Zwaan, R. A. (2001). The effect of implied orientation derived from verbal context on picture recognition. Psychological Science, 12(2), 153156. https://doi.org/10.1111/1467-9280.00326CrossRefGoogle ScholarPubMed
Stoet, G. (2010). PsyToolkit: A software package for programming psychological experiments using Linux. Behavior Research Methods, 42(4), 10961104. https://doi.org/10.3758/BRM.42.4.1096CrossRefGoogle ScholarPubMed
Stoet, G. (2017). PsyToolkit: A novel web-based method for running online questionnaires and reaction-time experiments. Teaching of Psychology, 44(1), 2431. https://doi.org/10.1177/0098628316677643CrossRefGoogle Scholar
Takimoto, M. (2021). A comparative study of animation versus static effects in the spatial concept-based metaphor awareness-raising approach on EFL learners’ cognitive processing of request strategies. Language and Cognition, 13(2), 191226. https://doi.org/10.1017/langcog.2020.34CrossRefGoogle Scholar
Tomczak, E., & Ewert, A. (2015). Real and fictive motion processing in polish L2 users of English and monolinguals: Evidence for different conceptual representations. Modern Language Journal, 99(S1), 4965. https://doi.org/10.1111/j.1540-4781.2015.12178.xCrossRefGoogle Scholar
Tversky, B. (2015). On abstraction and ambiguity. In Gero, J. S. (Ed.), Studying visual and spatial reasoning for design creativity (pp. 215223). Springer Netherlands.CrossRefGoogle Scholar
Tversky, B., Gao, J., Corter, J. E., Tanaka, Y., & Nickerson, J. V. (2016). People, place, and time: Inferences from diagrams. In Jamnik, M., Uesaka, Y., & Schwartz, S. Elzer (Eds.), Diagrammatic representation and inference (Vol. 9781, pp. 258264). Springer International Publishing.CrossRefGoogle Scholar
Tversky, B., Zacks, J., Lee, P., & Heiser, J. (2000). Lines, blobs, crosses and arrows: Diagrammatic communication with schematic figures. In Anderson, M., Cheng, P., & Haarslev, V. (Eds.), Theory and application of diagrams (Vol. 1889, pp. 221230). Springer.CrossRefGoogle Scholar
Tylén, K., Fusaroli, R., Bundgaard, P. F., & Østergaard, S. (2013). Making sense together: A dynamical account of linguistic meaning-making. Semiotica, 2013(194), 3962. https://doi.org/10.1515/sem-2013-0021CrossRefGoogle Scholar
Tyler, A. (2012). Cognitive linguistics and second language learning: Theoretical basics and experimental evidence. Routledge.CrossRefGoogle Scholar
Tyler, A., & Evans, V. (2003). The semantics of English prepositions: Spatial scenes, embodied meaning and cognition. Cambridge University Press.CrossRefGoogle Scholar
Tyler, A., Mueller, C., & Ho, V. (2011). Applying cognitive linguistics to learning the semantics of English prepositions to, for, and at: An experimental investigation. Vigo International Journal of Applied Linguistics, 8, 181205.Google Scholar
Vukovic, N., & Williams, J. N. (2014). Automatic perceptual simulation of first language meanings during second language sentence processing in bilinguals. Acta Psychologica, 145, 98103. https://doi.org/10.1016/j.actpsy.2013.11.002CrossRefGoogle ScholarPubMed
Wang, M., & Zhao, H. (Under Review). Perceptual representations in L1 and L2 processing: An innovative sentence-diagram verification task. Frontiers in Human Neuroscience.Google Scholar
Wheeler, K. B., & Stojanovic, D. (2006). Non-native language processing engages mental imagery. Proceedings of the Annual Meeting of the Cognitive Science Society, 28. https://escholarship.org/uc/item/59r7r8zt.Google Scholar
Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer. https://ggplot2.tidyverse.orgCrossRefGoogle Scholar
Wolter, B., & Yamashita, J. (2018). Word frequency, collocational frequency, L1 congruency, and proficiency in L2 collocational processing: What accounts for L2 performance? Studies in Second Language Acquisition, 40(2), 395416. https://doi.org/10.1017/S0272263117000237CrossRefGoogle Scholar
Wong, M. H. I., Zhao, H., & MacWhinney, B. (2018). A cognitive linguistics application for second language pedagogy: The English preposition tutor. Language Learning, 68(2), 438468. https://doi.org/10.1111/lang.12278CrossRefGoogle Scholar
Wu, S.-L. (2016). Listening for imagery by native speakers and L2 learners. Languages, 1(10), 118. https://doi.org/10.3390/languages1020010CrossRefGoogle Scholar
Zhao, H., Huang, S., Zhou, Y., & Wang, R. (2020). Schematic diagrams in second language learning of English prepositions: A behavioral and event-related potential study. Studies in Second Language Acquisition, 42(4), 721748. https://doi.org/10.1017/S027226311900069XCrossRefGoogle Scholar
Zhou, H., Rossi, S., & Chen, B. (2017). Effects of working memory capacity and tasks in processing L2 complex sentence: Evidence from Chinese-English bilinguals. Frontiers in Psychology, 8, 595. https://doi.org/10.3389/fpsyg.2017.00595CrossRefGoogle ScholarPubMed
Zwaan, R. A. (2014). Embodiment and language comprehension: Reframing the discussion. Trends in Cognitive Sciences, 18(5), 229234. https://doi.org/10.1016/j.tics.2014.02.008CrossRefGoogle ScholarPubMed
Zwaan, R. A., Stanfield, R. A., & Yaxley, R. H. (2002). Language comprehenders mentally represent the shapes of objects. Psychological Science, 13(2), 168171. https://doi.org/10.1111/1467-9280.00430CrossRefGoogle ScholarPubMed
Figure 0

Table 1. Target prepositions, schematic diagrams, and senses (Tyler & Evans, 2003)

Figure 1

Figure 1. Sample stimuli of the diagram–picture matching task for over (a) and in (b).

Figure 2

Figure 2. A related trial of the semantic priming task (over diagram – over phrase).

Figure 3

Figure 3. An unrelated trial of the semantic priming task (in diagram – over phrase).

Figure 4

Table 2. L2 learners’ response time results of linear mixed effects modeling

Figure 5

Figure 4. Response times of related and unrelated trials by preposition. Black bars represent medians, and green dots represent the individual response times per participant per trial.

Figure 6

Figure 5. Response times of related and unrelated trials by stimulus onset asynchrony. Black bars represent medians, and blue dots represent the individual response times per participant per trial.