Hostname: page-component-586b7cd67f-rcrh6 Total loading time: 0 Render date: 2024-11-24T08:51:36.611Z Has data issue: false hasContentIssue false

Dynamic semantic networks for exploration of creative thinking

Published online by Cambridge University Press:  12 November 2024

Danko D. Georgiev
Affiliation:
Institute for Advanced Study, Varna, Bulgaria
Georgi V. Georgiev*
Affiliation:
Center for Ubiquitous Computing, Faculty of Information Technology and Electrical Engineering, University of Oulu, Oulu, Finland
*
Corresponding author: Georgi V. Georgiev; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Human creativity originates from brain cortical networks that are specialized in idea generation, processing, and evaluation. The concurrent verbalization of our inner thoughts during the execution of a design task enables the use of dynamic semantic networks as a tool for investigating, evaluating, and monitoring creative thought. The primary advantage of using lexical databases such as WordNet for reproducible information-theoretic quantification of convergence or divergence of design ideas in creative problem solving is the simultaneous handling of both words and meanings, which enables interpretation of the constructed dynamic semantic networks in terms of underlying functionally active brain cortical regions involved in concept comprehension and production. In this study, the quantitative dynamics of semantic measures computed with a moving time window is investigated empirically in the DTRS10 dataset with design review conversations and detected divergent thinking is shown to predict success of design ideas. Thus, dynamic semantic networks present an opportunity for real-time computer-assisted detection of critical events during creative problem solving, with the goal of employing this knowledge to artificially augment human creativity.

Type
Position Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press

Introduction

Creativity is the capacity to use imagination and inventiveness to bring into existence original ideas, solutions, or products. Ideas and products are judged as creative to the extent that they provide both an original and valuable solution to the problem at hand (Srinivasan and Chakrabarti, Reference Srinivasan and Chakrabarti2010; Moldovan et al., Reference Moldovan, Goldenberg and Chattopadhyay2011; Casakin and Georgiev, Reference Casakin and Georgiev2021), and the problem is heuristic rather than algorithmic (Amabile, Reference Amabile1983a, Reference Amabile1983b). If an idea is not new but already known, then we could just algorithmically copy the known idea investing no creative efforts. Alternatively, if a new solution of a posed problem is not useful, we would consider bringing it into existence as a waste of available resources. Therefore, originality (novelty) and usefulness (value) of ideas interweave together as parts of a single creativity construct (Stein, Reference Stein1953; Wang, Reference Wang2013; Taura, Reference Taura2016; Lee et al., Reference Lee, Ostwald and Gu2020).

The most creative ideas are both novel and useful, and the most creative people excel at both creativity dimensions. Descriptive psychological accounts of creativity (Guilford, Reference Guilford1957; Hudson, Reference Hudson1974; Runco, Reference Runco2004, Reference Runco2007; Runco and Pritzker, Reference Runco and Pritzker2020) postulate that novelty is generated through divergent (associative, analogical) thinking that breaks assumptions and rules, while usefulness is enhanced through convergent (analytical) thinking that adheres to needs, boundaries, and constraints (Miron-Spektor and Erez, Reference Miron-Spektor, Erez, Smith, Lewis, Jarzabkowski and Langley2017). Both convergent or divergent paths are instructed in the design process (Goldschmidt, Reference Goldschmidt2016, Reference Goldschmidt2019). The process of innovative abduction is also related to both divergent and convergent thinking (Dong et al., Reference Dong, Garbuio and Lovallo2016). Such descriptive accounts could be implemented in artificial intelligence machines only if translated into quantitatively precise procedures.

Cognitive processes and structures that underpin creative thinking and help produce creative acts and results are referred to as creative cognition (Finke et al., Reference Finke, Smith and Ward1992). While the dynamic cognitive processes in creative contexts have been in the focus of research (Wilkenfeld and Ward, Reference Wilkenfeld and Ward2001; Sonalkar et al., Reference Sonalkar, Mabogunje, Leifer and Roth2016), objective tools for evaluation of creative cognition have only recently been developed (Georgiev and Georgiev, Reference Georgiev and Georgiev2018, Reference Georgiev and Georgiev2019; Georgiev and Casakin, Reference Georgiev and Casakin2019; Han et al., Reference Han, Hua, Park, Wang and Childs2020, Reference Han, Sarica, Shi and Luo2021, Reference Han, Sarica, Shi and Luo2022; Chiu et al., Reference Chiu, Lim and Silva2023). For example, dynamic semantic networks have been used to analyze the datasets from the 10th Design Thinking Research Symposium (DTRS 10) with design review conversations between design students and real clients recorded in educational settings (Adams and Siddiqui, Reference Adams and Siddiqui2013, Reference Adams and Siddiqui2015; Georgiev and Georgiev, Reference Georgiev and Georgiev2018) and from the 12th Design Thinking Research Symposium (DTRS 12) with design discussions in a company context (Christiaans, Reference Christiaans and Christiaans2018; Georgiev and Georgiev, Reference Georgiev and Georgiev2023). Several semantic measures were found to be useful for quantitative evaluation of convergence/divergence in creative thinking and supported the role of divergent thinking for the success of generated design ideas (Georgiev and Georgiev, Reference Georgiev and Georgiev2018, Reference Georgiev and Georgiev2023). Procedures for real-time application of dynamic semantic networks in creative problem solving, however, are currently lacking. This necessitates the development of a detailed workflow for real-time application of dynamic semantic networks for monitoring and potential support of design creativity.

Main hypothesis and aims of the study

The main hypothesis of this work is that dynamic semantic networks could be used in real time to predict the success of creative design ideas. To test this hypothesis, first we aimed to develop a complete workflow for real-time application of dynamic semantic networks, using a moving time window for monitoring the cognitive processes during creative design. Second, we aimed to identify particular quantitative measures of information content and semantic similarity that highly correlate with human evaluation of word similarity in order to employ those measures in the constructed dynamic semantic networks for modeling creative cognition. Third, we aimed to evaluate the actual performance of the developed workflow through back-testing the dynamic semantic networks for predicting the success of design ideas using transcribed design review conversations from the DTRS 10 dataset.

Aim 1: Workflow for monitoring of creative cognition

The inner privacy of consciousness poses unique challenges to understanding cognitive processes (Georgiev, Reference Georgiev2017, Reference Georgiev2020a, Reference Georgiev2020b). Experiences, feelings, emotions, thoughts, and beliefs constitute one’s consciousness, but the phenomenological qualia of these conscious states are not directly accessible by external observers (Nagel, Reference Nagel1974). Instead, the individual conscious states need to be externalized through expression into classical bits of Shannon information (Shannon, Reference Shannon1948) such as words, gestures, or images. Because verbalized reports of the individual stream of consciousness concurrent with the execution of a given cognitive task can be used reliably as data (Ericsson and Simon, Reference Ericsson and Simon1980) for subsequent analysis with natural language processing scripts (Bird et al., Reference Bird, Klein and Loper2009), we have developed a workflow for monitoring of creative cognition based on design conversations that occur during the development of a particular design product.

The overall structure of the workflow consists of two stages: natural language processing and semantic network processing (Figure 1). The first stage consists of five steps, all of which can be automated with the use of available Python libraries and scripts:

  1. (1) audio recording of the design conversation,

  2. (2) speech-to-text conversion,

  3. (3) part-of-speech tagging for detection of nouns (concepts),

  4. (4) construction of a moving time window and

  5. (5) removal of duplicates.

Figure 1. Workflow for monitoring of creative cognition with dynamic semantic networks.

The second stage consists of 2 steps:

  1. (6) construction of semantic networks based on WordNet, including computation of quantitative semantic measures from graphs, and

  2. (7) dynamic statistical fit of trendlines for the selected semantic measures for detection of convergence or divergence in creative thinking.

In this work, we consider that the engineering solutions for natural language processing needed to accomplish stage 1 of the workflow are readily available (Bird et al., Reference Bird, Klein and Loper2009; Loria, Reference Loria2016; Lee, Reference Lee2024). Therefore, we will dedicate our efforts to providing detailed procedures for executing the semantic network processing using WordNet in stage 2 of the workflow. We will also elaborate on different graph theoretical alternatives for computation of information content, which is a measure of the surprisal due to the occurrence of a particular concept, and semantic similarity, which is a measure of how close the meanings of two concepts are. The dynamic trendlines for information content and semantic similarity obtained from recorded design conversations will then be backtested for correlation with the eventual success of developed design products using the DTRS10 dataset and we will propose potential application of dynamic semantic networks for real-time support or enhancement of design creativity.

Aim 2: Modeling creative cognition with dynamic semantic networks

Replacement of subjective expert coding with objective computer algorithms

Creativity could be modeled computationally (Sosa and van Dijck, Reference Sosa and van Dijck2021). The theoretical foundations of automated problem solving are based on automated reasoning systems based on heuristic search techniques such as general problem solver (Newell et al., Reference Newell, Shaw and Simon1959). Early theories of problem-solving use computer simulations to predict human performance, explain the underlying processes and mechanisms, account for incidental phenomena, show how performance changes under different conditions, and explain how problem-solving skills are learned (Simon and Newell, Reference Simon and Newell1971). The challenges of computational problem-solving in design revolve around encoding, representation, constraint analysis, and reduction of the effective solution space (Perkowski, Reference Perkowski2022).

Protocol analysis of concurrent verbalization from professional design teams allows for the identification of reasoning patterns in idea generation (Cramer-Petersen et al., Reference Cramer-Petersen, Christensen and Ahmed-Kristensen2019), dynamic process patterns in design communication (Cash et al., Reference Cash, Dekoninck and Ahmed-Kristensen2020), success of design ideation (Maccioni and Borgianni, Reference Maccioni, Borgianni, Boujut, Cascini, Ahmed-Kristensen, Georgiev and Iivari2020) or evaluation of external sources of inspiration (Borgianni et al., Reference Borgianni, Maccioni, Fiorineschi and Rotini2020). Typically, the protocol analysis requires protocol coding performed by an expert, which invariably introduces a level of subjectivity that limits reproducibility by independent research teams. Furthermore, significant attention has been focused on the subjective nature of evaluating creativity of design ideas using metrics (Fiorineschi and Rotini, Reference Fiorineschi and Rotini2023). The subjective judgments required in metrics can significantly impact the evaluation of design creativity, owing to differences in how evaluators perceive and prioritize different aspects of the design ideas (Fiorineschi et al., Reference Fiorineschi, Frillici and Rotini2022). Hence, further research is needed to address these challenges and develop more robust methods for evaluating design ideas (Borgianni et al., Reference Borgianni, Maccioni, Fiorineschi and Rotini2020). To dispense with the reliance on experts and ensure maximal reproducibility, here we describe an objective method for textual processing and construction of semantic networks that is fully automated by applied computer algorithms.

Semantic networks as graphs

Semantic networks represent knowledge in the form of graphs that consist of vertices and edges (Diestel, Reference Diestel2017). Vertices indicate individual concepts, whereas edges between pairs of vertices indicate specific semantic connections (Boden, Reference Boden2004). Graphs could be divided into directed or undirected depending on the type of edges contained in the graph. The lack of loops, which are edges that connect vertices to themselves, and multiple edges with the same source and target vertices generates a simple directed graph. The lack of directed cycles generates a directed acyclic graph (DAG). The underlying undirected graph of a DAG, however, could contain cycles (Figure 2). Further introduction of a root vertex such that all edges of the directed graph are directed either away from or towards the root generates a rooted directed acyclic graph. Assigning different weights to the edges of directed or undirected graphs generates weighted graphs.

Figure 2. The subgraph of meanings M in WordNet3.1 does not have directed cycles when the edges remain directed, however, when all edges are converted into undirected edges using the graph operator U, the graph U (M) becomes cyclic. One consequence of the structure of M is that even for two monosemous words such as “workspace” and “yellow” the shortest path in the undirected graph U(M) may not pass through the lowest common subsumer, which in this case is the root meaning vertex M00001740. In this example, the length of the shortest path between “workspace” and “yellow” is 12, whereas the distance between “workspace” and “yellow” through their lowest common subsumer M00001740 is 14. To avoid clutter in the image, we have added only a single word for each meaning vertex, however, many of the meaning vertices are subsumed by synsets of words. For example, both words “yellow” and “yellowness” subsume the meaning vertex M04972838.

WordNet is a lexical database, which represents human knowledge in a graph form (Miller et al., Reference Miller, Beckwith, Fellbaum, Gross and Miller1990; Miller, Reference Miller1995, Reference Miller and Fellbaum1998; Fellbaum, Reference Fellbaum1998a, Reference Fellbaum1998b). This is particularly suitable for imposing a distance function onto constructed semantic networks, which in turn enables quantitative exploration of dynamic cognitive processes such as creative cognition. Throughout this work, we employ WordNet 3.1, which is represented by a rooted directed acyclic graph of meanings (Figure 2). The constructed semantic networks are represented by undirected cyclic weighted graphs (Figure 3).

Figure 3. Modeling the creative design process of an “Electric car” with a dynamic semantic network. The dynamics of the semantic network from the stage of idea generation to the stage of fully developed solution provides a useful, reproducible, and fast computational tool for exploration of creative thinking.

Semantic networks as models of the creative mind

Semantic networks constructed from conversation transcripts represent computational models of conceptual associations and structures (Georgiev et al., Reference Georgiev, Nagai and Taura2008, Reference Georgiev, Nagai and Taura2010; Han et al., Reference Han, Hua, Park, Wang and Childs2020, Reference Han, Sarica, Shi and Luo2021, Reference Han, Sarica, Shi and Luo2022). Individual concepts in the semantic network need not be only single words but could also be phrases combining different parts of speech (Esparza et al., Reference Esparza, Sosa and Connor2019), and can employ specific technical concepts (Sarica et al., Reference Sarica, Song, Luo and Wood2021). However, for ensuring reproducibility and objectiveness, the extraction of concepts for the semantic networks should be based on a set of rules that can be automated by a computer program without the need of human intervention. This problem is addressed satisfactorily by identifying individual concepts with single words that could be further narrowed down to a single lexical category (nouns) with the use of part-of-speech tagging performed by natural language processing software (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

Polysemy necessitates simultaneous use of words and meanings

Polysemy is essential for creative conceptual blending and associative thinking (Ravin and Leacock, Reference Ravin and Leacock2000; Fauconnier and Turner, Reference Fauconnier, Turner, Nerlich, Zazie, Vimala and Clarke2003; Nerlich et al., Reference Nerlich, Zazie, Vimala and Clarke2003; Georgiev and Taura, Reference Georgiev and Taura2014). In particular, designer’s mind could work simultaneously with several meanings of a word, similarly to how writers employ polysemy in humorous works (Boxman-Shabtai and Shifman, Reference Boxman-Shabtai and Shifman2014). Meaning is deemed an essential component of creativity (Sääksjärvi and Gonçalves, Reference Sääksjärvi and Gonçalves2018). To capture the possible impact of polysemous words in design thinking, we employ a computational method that does not compromise, reduce, or disambiguate between the multiple senses. The linguistic distinction between words and meanings could be approached in two different ways. Inclusion of a pre-processing step called disambiguation of senses could convert all words into senses. This would modify the verbal data through injection of interpretation even before the data analysis is started and would delete potentially useful information about underlying difficult-to-observe cognitive processes. For example, the nouns entering into the description of creative ideas may acquire different senses that disagree with those listed in a dictionary (Georgiev and Taura, Reference Georgiev and Taura2014;Taura and Nagai, Reference Taura and Nagai2013). Furthermore, the polysemy of nouns was found to be instrumental in the association of ideas, which were previously not considered to be related by the creative problem solver (Georgiev and Taura, Reference Georgiev and Taura2014). To avoid the latter shortcomings, we construct semantic networks from verbal data without any disambiguation of senses of extracted words (nouns) (Georgiev and Georgiev, Reference Georgiev and Georgiev2018), but preserving two types of vertices in the semantic network, word vertices, and meaning vertices. The benefit is that discovered functional relationships in the semantic network could be correlated to the neural activity in specialized brain cortical areas such as Broca’s area, which translates meanings into words, and Wernicke’s area, which translates words into meanings (Georgiev et al., Reference Georgiev, Georgieva, Gong, Nanjappan and Georgiev2021). The utility of semantic networks of nouns for studying creativity and reconstruction of difficult-to-observe cognitive processes in conceptual design was demonstrated previously by different research teams with the use of several experimental datasets (Georgiev et al., Reference Georgiev, Nagai and Taura2008, Reference Georgiev, Nagai and Taura2010; Yamamoto et al., Reference Yamamoto, Goka, Yusof, Taura, Nagai, Bergendahl, Grimheden, Leifer, Skogstad and Lindemann2009; Taura et al., Reference Taura, Yamamoto, Fasiha, Goka, Mukai, Nagai and Nakashima2012).

Creative cognition modeled as a dynamic semantic network

To assess the dynamic aspects of cognitive processes, the constructed semantic network should be allowed to evolve in time (Figure 3). One inefficient approach is to consider the cumulative growth of the semantic network as new concepts appearing in the transcribed conversations are added to those concepts that have already appeared. This cumulative approach creates large networks fast even for relatively short conversations, which produces a large sample size for statistical analyses. The disadvantage is that it retains in the semantic network concepts that may have been briefly considered but then discarded by the cognitive processes underlying the given task. An alternative, much more efficient approach is to coarse–grain the time into time intervals, each of which will encompass a corresponding part of the conversation (Georgiev and Georgiev, Reference Georgiev and Georgiev2018). The advantage of the noncumulative approach is that concepts are retained in the semantic network only if they repeatedly appear in the course of the problem solving conversation. The minimal time interval should be coarse–grained to contain a sufficient number of concepts to form a meaningful network. For real-time monitoring of verbal output, the semantic network dynamics could be tracked with a moving time window allowing for dynamic update of the semantic measures with each new word added in the conversation.

WordNet structure as a graph of meanings and words

Lexical categories in WordNet

WordNet 3.1 is publicly available (http://wordnet.princeton.edu), lexical database for English created under the direction of G. A. Miller and hosted at Princeton University (Miller et al., Reference Miller, Beckwith, Fellbaum, Gross and Miller1990; Miller, Reference Miller1995; Fellbaum, Reference Fellbaum1998a, Reference Fellbaum1998b). WordNet contains four subnets that correspond to four basic lexical categories: nouns, verbs, adjectives, and adverbs (Miller, Reference Miller and Fellbaum1998; Fellbaum, Reference Fellbaum1998b). Because there are only a few cross-subnet pointers (Fellbaum, Reference Fellbaum1998b), calculation of graph-theoretic distances between words could be achieved by confining the constructed semantic networks to a single subnet. The subnet of nouns provides the largest and deepest hierarchical taxonomy in WordNet, which can be efficiently used for the construction of semantic networks of nouns. In an experimental study using design review conversations, it was found that over 99.8% of all nouns used in the conversations are also listed in WordNet 3.1 (Georgiev and Georgiev, Reference Georgiev and Georgiev2018). Further motivation for working with nouns in research of creative problem solving is provided by developmental linguistics findings that the category corresponding to nouns is, at its core, conceptually simpler or more basic than those corresponding to verbs and other parts of speech, which is exemplified by infants early advantage for learning nouns over verbs (Gentner, Reference Gentner and Kuczaj1982; Waxman et al., Reference Waxman, Fu, Arunachalam, Leddon, Geraghty and Song2013). In addition, experimental creativity research has shown that networks of nouns stimulate the generation of creative ideas (Georgiev and Georgiev, Reference Georgiev and Georgiev2019), different combinations of nouns and relations between nouns are associated with a display of creative thought (Dong, Reference Dong2009), and dissimilarity of noun pairs produces a higher number of emergent features of creative ideas (Wilkenfeld and Ward, Reference Wilkenfeld and Ward2001).

Hypernym-hyponym hierarchy of nouns

The two basic semantic relations in WordNet are synonymy, where sets of word synonyms (synsets) form the basic building blocks of the lexical hierarchy and hyponymy (subordination of synsets) where if a hyponym (subordinate) X is subsumed by a hypernym (superordinate) Y then it follows that “An X is a kind of Y” (Miller et al., Reference Miller, Beckwith, Fellbaum, Gross and Miller1990; Miller, Reference Miller and Fellbaum1998). The hypernym-hyponym (is-a) relationship provides a taxonomy of nouns in WordNet as follows: the root synset {entity} subsumes directly three different synsets, which can be viewed as classifying entities into {abstract_entity, abstraction}, {thing} or {physical_entity}. Each of these synsets then subsumes directly other synsets, further classifying classes of entities into subclasses, and so on. The hypernym-hyponym (is-a) hierarchy of nouns in WordNet is conceptually clear if it is represented in the form of a graph with two types of vertices: meaning vertices and word vertices.

The distinction between words and meanings would not have been required if there were one-to-one relationship between words and meanings. In natural language, however, one word can have several meanings (polysemy) and one meaning can be expressed by several words. For example, the word “horse” has five meanings thereby entering into five synsets as follows: M02377103 with synset {Equus_caballus, horse}, M03543217 with synset {gymnastic_horse, horse}, M03629976 with synset {horse, knight}, M04147696 with synset {buck, horse, sawbuck, sawhorse}, or M08414813 with synset {cavalry, horse, horse_cavalry}. Here, ‘M’ stands for meaning, the subsequent digits indicate the number by which the particular synset is referred to in WordNet. Explicitly labeling the meaning vertices with a numeric code further clarifies the significance of the synset and avoids possible misunderstanding of the synset as a list of words—in fact, the synset stands only for the meaning that is in common for all words in the list. For example, the meaning of M02377103 with synset {Equus_caballus, horse} is “the particular animal species in the genus Equus,” whereas the meaning of M03543217 with synset {gymnastic_horse, horse} is “an artistic gymnastics apparatus.” The fact that the meaning is the essential ingredient of a synset can be illustrated with a somewhat rare example of a meaning, which is difficult to guess from the words in the synset alone: the meaning of M03629976 with synset {horse, knight} is actually “a chess piece shaped to resemble the head of a horse” – this has to be understood from a sample sentence used in WordNet to clarify the meaning.

Composition of word and meaning subgraphs

The lexical hierarchy of nouns in WordNet 3.1 is comprised of 158,441 word vertices and 82,192 meaning vertices. Word vertices and meaning vertices are organized in two subgraphs, subgraph M, composed of 84,505 meaning → meaning edges between hypernyms and hyponyms, and subgraph W, composed of 189,555 word → meaning edges. The subgraph M is a rooted directed acyclic graph, which has as a root the meaning vertex M00001740 corresponding to the single-word synset {entity}. To compute graph theoretic measures for words, however, the subgraph M should be expanded with edges extracted from the subgraph W. For example, if we are interested in any semantic measure characterizing the word “horse,” we will need to extract the five edges from W that connect “horse” to each of its five meaning vertices. Only in the composite graph containing both meanings and words, we are able to comprehend the content of transcribed conversation.

The reason for appending extracted word edges to the graph M, instead of working in the full graph $ M\cup W $ , is to achieve computational efficiency, namely, the subgraph W, which is effectively discarded, has twice as many edges as the subgraph M and finding shortest paths is much faster in smaller graphs. Additional flexibility is achieved by the possible application of directed graph operators such as R(G), whose action on the graph G is to reverse the direction of all edges, or U(G), whose action on the graph G is to remove the directionality of all edges (Georgiev and Georgiev, Reference Georgiev and Georgiev2018). As an example, W contains word → meaning edges, R(W) contains meaning → word edges, and U(W) contains meaning – word edges. In this way, it is possible to add words as subordinate vertices (subsumed by meanings) or as superordinate vertices (subsumers of meanings) depending on the semantic measures that need to be computed (Figure 4).

Figure 4. Graph composition of meaning vertices and word vertices for different WordNet 3.1 searches. (A) Adding words as subordinate vertices subsumed by meanings allows for computing the depth in the taxonomy and listing the meaning subsumers of words. (B) Adding words as subsumers of meanings allows for listing the meaning subvertices and leaves. (C) Adding two distinct words {w 1; w 2} as subordinate vertices subsumed by meanings allows for computing their lowest common subsumer $ \mathcal{K}\left({w}_1,{w}_2\right) $ . (D) Adding two distinct words {w 1; w 2} into an undirected graph allows for computing the shortest path distance between the two words, which may not pass through their lowest common subsumer. Different graph compositions are needed for path-based or information content-based quantification of word similarity.

Semantic measures based on WordNet

Graph-theoretic functions

Defining semantic measures for words from the graph structure of WordNet requires the introduction of several basic functions that take as arguments a graph, denoted with capital letter, and one or more vertices, denoted with small letters (Georgiev and Georgiev, Reference Georgiev and Georgiev2018, Reference Georgiev and Georgiev2023). Hereafter, general type of vertices, either meanings or words, will be denoted as v 1, v 2,…, vn, whereas word vertices will be specified as w 1, w 2,…, wn.

Functions in a general graph

  • $ \mathcal{J}\left(G,v\right) $ lists all edges that are incident to vertex v in the graph G.

  • $ \mathcal{A}\left(G,v\right) $ lists all vertices that are adjacent to vertex v in the graph G.

Functions in a directed graph

  • $ \mathcal{V}\left(G,v\right) $ lists all subvertices of vertex v in the graph G, where subvertices are all vertices with finite directed path from v.

  • $ \mathcal{S}\left(G,v\right) $ lists all subsumers of vertex v in the graph G, where subsumers are all vertices with finite directed path to v.

  • $ \mathcal{L}\left(G,v\right) $ lists the leaves of a vertex v in the graph G, where leaves are all subvertices of v with a vertex out-degree of zero.

  • $ \mathcal{D}\left(G,{v}_1,{v}_2\right) $ gives the shortest path distance measured in edges from vertex v 1 to vertex v 2 in the graph G, where the output is infinite ∞ if there is no path from v 1 to v 2.

Functions in a rooted directed graph

$ \mathcal{T}\left(G,v\right) $ gives the depth in the taxonomy of a vertex v measured as the number of vertices on the shortest path from the root vertex r to v in the graph G.

Set theoretic functions

|f (x)| counts the number of elements in the list f (x).

Semantic measures for single words

The level of abstraction, polysemy and information content are semantic measures applicable to single words. For a semantic network composed of n word vertices, the average for each of these semantic measures could be determined with n searches in WordNet.

Level of abstraction

The level of abstraction of a word is the complement to unity of the word concreteness. For nouns, both measures are related to the noun depth in the WordNet taxonomy such that nouns located higher in the hierarchy are more abstract and less concrete, whereas nouns located lower in the hierarchy are less abstract and more concrete (Meng et al., Reference Meng, Huang and Gu2013; Georgiev and Georgiev, Reference Georgiev and Georgiev2018). In graph theoretic notation, the level of abstraction of the word w is

(1) $$ \mathrm{Abstraction}(w)=1-\frac{\mathcal{T}(w)-1}{{\mathcal{T}}_{\mathrm{max}}-1} $$

where $ {\mathcal{T}}_{\mathrm{max}}=19 $ is the maximal depth of WordNet 3.1 taxonomy and $ \mathcal{T}(w) $ is the depth of the word w computed as the shortest path distance between the root meaning vertex M00001740 with synset {entity} and the word vertex w in the graph $ M\cup \mathcal{J}\left[R(W),w\right] $ .

Polysemy

The polysemy measures the number of different meanings possessed by a given word. About 44% of English words are polysemous, which means that they have more than one meaning (Britton, Reference Britton1978). The log transformed value of polysemy quantifies the missing bits of information required for correct understanding of the intended meaning of a given word. That missing information is usually extracted from the context of the conversation. For monosemous words, which have only one meaning, there is no ambiguity and no information is missing. In graph theoretic notation, the polysemy is

(2) $$ \mathrm{Polysemy}(w)=\mid \mathcal{A}\left(W,w\right)\mid $$

which counts the number of all meaning vertices that are adjacent to the word vertex w.

Information content

The intrinsic information content of words or meanings is measured in bits solely from the graph-theoretic structure of WordNet 3.1. For comparison of different formulas that measure the information content of words in WordNet 3.1, however, it is helpful to work with normalized values in the unit interval [0,1]. To write compactly the formulas for information content, we will need the following word functions.

For a given word w: $ \mathcal{S}(w) $ lists the meaning subsumers in the graph $ M\cup \mathcal{J}\left[R(W),w\right] $ , $ \mathcal{V}(w) $ lists the meaning subvertices in the graph $ M\cup \mathcal{J}\left(W,w\right) $ , $ \mathcal{H}(w) $ lists the meaning hyponyms in the set difference $ \mathcal{V}(w)\backslash \left[\mathcal{A}\left(W,w\right)\cup w\right],\mathcal{L}(w) $ lists the leaves in the graph $ M\cup \mathcal{J}\left(W,w\right) $ , and $ \mathcal{C}(w) $ computes the commonness $ {\sum}_{i\in \mathcal{L}\left(G,w\right)}\frac{1}{S\left(M,i\right)} $ in the graph $ M\cup \mathcal{J}\left(W,w\right) $ .

Several constants specific for WordNet 3.1 are also useful: $ {\mathcal{V}}_{\mathrm{max}}=82192 $ is the maximal number of meaning subvertices, $ {\mathcal{L}}_{\mathrm{max}}=65031 $ is the maximal number of leaves, $ {\mathcal{C}}_{\mathrm{min}}=1/35 $ is the minimal commonness, and $ {\mathcal{C}}_{\mathrm{max}}=6863.6 $ is the maximal commonness.

Below, we summarize seven information content formulas whose performance for the analysis of creativity in design review conversations has been tested previously (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

Information content by Blanchard et al. (Reference Blanchard, Harzallah, Kuntz, Ghallab, Spyropoulos, Fakotakis and Avouris2008):

(3) $$ IC(w)=1-\frac{\log \mid \mathcal{L}(w)\mid }{\log \left({\mathcal{L}}_{\mathrm{max}}\right)} $$

Information content by Meng et al. (Reference Meng, Gu and Zhou2012):

(4) $$ IC(w)=\frac{\log \left[\mathcal{T}(w)\right]}{\log \left({\mathcal{T}}_{\mathrm{max}}\right)}\left[1-\frac{\log \left[1-{\sum}_{i\in \mathcal{H}(w)}\frac{1}{\mathcal{T}(i)}\right]}{\log \left({\mathcal{V}}_{\mathrm{max}}\right)}\right] $$

Information content by Sánchez et al. (Reference Sánchez, Batet and Isern2011):

(5) $$ IC(w)=\frac{\log \left({\mathcal{L}}_{\mathrm{max}}\right)+\log \mid \mathcal{S}(w)\mid -\log \mid \mathcal{L}(w)\mid }{\log \left({\mathcal{L}}_{\mathrm{max}}\right)-\log \left({\mathcal{C}}_{\mathrm{min}}\right)} $$

Information content by Sánchez and Batet (Reference Sánchez and Batet2012):

(6) $$ IC(w)=\frac{\log \left({\mathcal{C}}_{\mathrm{max}}\right)-\log \left[\mathcal{C}(w)\right]}{\log \left({\mathcal{C}}_{\mathrm{max}}\right)-\log \left({\mathcal{C}}_{\mathrm{min}}\right)} $$

Information content by Seco et al. (Reference Seco, Veale and Hayes2004):

(7) $$ IC(w)=1-\frac{\log \mid \mathcal{V}(w)\mid }{\log \mid {\mathcal{V}}_{\mathrm{max}}\mid } $$

Information content by Yuan et al. (Reference Yuan, Yu and Wang2013):

$$ IC(w)=\frac{\log \left[\mathcal{T}(w)\right]}{\log \left({\mathcal{T}}_{\mathrm{max}}\right)}\left(1-\frac{\log \mid \mathcal{L}(w)\mid }{\log \left({\mathcal{L}}_{\mathrm{max}}\right)}\right)+\frac{\log \mid \mathcal{S}(w)\mid }{\log \left({\mathcal{V}}_{\mathrm{max}}\right)} $$

Information content by Zhou et al. (Reference Zhou, Wang and Gu2008a):

(9) $$ IC(w)=\frac{1}{2}\left[1-\frac{\log \mid \mathcal{V}(w)\mid }{\log \left({\mathcal{V}}_{\mathrm{max}}\right)}+\frac{\log \left[\mathcal{T}(w)\right]}{\log \left({T}_{\mathrm{max}}\right)}\right] $$

Semantic similarity for word pairs

For a semantic network composed of n word vertices, the average semantic similarity for all pairs of vertices could be determined with (n 2n)/2 searches in WordNet. Different definitions of semantic similarity between a pair of distinct words w 1 and w 2 rely on the lowest common subsumer of w 1 and w 2, the fraction of common subsumers, or the shortest path distance between w 1 and w 2 (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

The lowest common subsumer $ \mathcal{K}\left({w}_1,{w}_2\right) $ of a pair of distinct words $ {w}_1 $ and $ {w}_2 $ in the graph $ G=M\cup \mathcal{J}\left[R(W),\left\{{w}_1,{w}_2\right\}\right] $ is the deepest meaning vertex in the taxonomy among all vertices i whose sum $ \mathcal{D}\left[G,i,{w}_1\right]+\mathcal{D}\left[G,i,{w}_2\right] $ is minimal. If several common subsumers of w 1 and w 2 are at the same depth in the WordNet 3.1 taxonomy, the meaning vertex with lowest entry number is considered to be the unique $ \mathcal{K}\left({w}_1,{w}_2\right) $ . The depth $ \mathcal{T}\left[\mathcal{K}\left({w}_1,{w}_2\right)\right] $ of the lowest common subsumer $ \mathcal{K}\left({w}_1,{w}_2\right) $ is determined solely within the subgraph M.

The shortest path distance $ \mathcal{D}\left({w}_1,{w}_2\right) $ between a pair of distinct words w 1 and w 2 in the graph $ U(M)\cup \mathcal{J}\left[U(W),\left\{{w}_1,{w}_2\right\}\right] $ is the number of edges on the shortest path with subtracted two-edge contribution outside the subgraph U(M).

Path-based similarity measures

Semantic similarity by Al-Mubaid and Nguyen (Reference Al-Mubaid and Nguyen2006):

(10) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=1-\frac{\log \left[1+\mathcal{D}\left({w}_1,{w}_2\right)\left\{{\mathcal{T}}_{\mathrm{max}}-\mathcal{T}\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]\right\}\right]}{\log \left[1+2{\left({\mathcal{T}}_{\mathrm{max}}-1\right)}^2\right]} $$

Semantic similarity by Leacock and Chodorow (Reference Leacock, Chodorow and Fellbaum1998):

(11) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=1-\frac{\log \left[\mathcal{D}\left({w}_1,{w}_2\right)+1\right]}{\log \left[2{\mathcal{T}}_{\mathrm{max}}-1\right]} $$

Semantic similarity by Li et al. (Reference Li, Bandar and McLean2003):

(12) $$ \mathrm{sim}\;\left({w}_1,{w}_2\right)={e}^{-0.2D\left({w}_1,{w}_2\right)}\frac{e^{1.2T\left[K\left({w}_1,{w}_2\right)\right]}-1}{e^{1.2T\left[K\left({w}_1,{w}_2\right)\right]}+1} $$

Semantic similarity by Rada et al. (Reference Rada, Mili, Bicknell and Blettner1989):

(13) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=1-\frac{\mathcal{D}\left({w}_1,{w}_2\right)}{2\left({\mathcal{T}}_{\mathrm{max}}-1\right)} $$

Semantic similarity by Wu and Palmer (Reference Wu and Palmer1994):

(14) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{2\left\{\mathcal{T}\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]-1\right\}}{2\left\{\mathcal{T}\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]-1\right\}+\mathcal{D}\left({w}_1,{w}_2\right)} $$

Subsumer-based similarity measures

Subsumer-based similarity measures reduce to path-based ones only for monosemous words, whereas for polysemous words they produce distinct results.

Semantic similarity by Jaccard (Reference Jaccard1912):

(15) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{\mid \mathcal{S}\left({w}_1\right)\cup \mathcal{S}\left({w}_2\right)\mid } $$

Semantic similarity by Braun-Blanquet (Reference Braun-Blanquet1932):

(16) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{\max \left[|\mathcal{S}\left({w}_1\right)|,|\mathcal{S}\left({w}_2\right)|\right]} $$

Semantic similarity by Dice (Reference Dice1945):

(17) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{2\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{\mid \mathcal{S}\left({w}_1\right)\mid +\mid \mathcal{S}\left({w}_2\right)\mid } $$

Semantic similarity by Otsuka (Reference Otsuka1936) and Ochiai (Reference Ochiai1957):

(18) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{\sqrt{\mid \mathcal{S}\left({w}_1\right)\Big\Vert \mathcal{S}\left({w}_2\right)\mid }} $$

Semantic similarity by Kulczyński (Reference Kulczyński1927):

(19) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{2}\left(\frac{1}{\mid \mathcal{S}\left({w}_1\right)\mid }+\frac{1}{\mid \mathcal{S}\left({w}_2\right)\mid}\right) $$

Semantic similarity by Simpson (Reference Simpson1960):

(20) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{\mid \mathcal{S}\left({w}_1\right)\cap \mathcal{S}\left({w}_2\right)\mid }{\min \left[|\mathcal{S}\left({w}_1\right)|,|\mathcal{S}\left({w}_2\right)|\right]} $$

Information content-based similarity measures

The following five information content-based similarity formulas could take as an input any of the seven information content formulas to give a total of 35 different information content-based similarity measures whose performance for the analysis of creativity in design review conversations has been tested previously (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

Semantic similarity by Jiang and Conrath (Reference Jiang and Conrath1997):

(21) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=1-\frac{1}{2}\left\{ IC\left({w}_1\right)+ IC\left({w}_2\right)-2 IC\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]\right\} $$

Semantic similarity by Lin (Reference Lin1998):

(22) $$ \mathrm{sim}\left({w}_1,{w}_2\right)=\frac{2 IC\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]}{IC\left({w}_1\right)+ IC\left({w}_2\right)} $$

Semantic similarity by Meng et al. (Reference Meng, Huang and Gu2014):

(23) $$ \mathrm{sim}\left({w}_1,{w}_2\right)={\left\{\frac{2 IC\left[\mathcal{K}\left({w}_1,{w}_2\right)\right]}{IC\left({w}_1\right)+ IC\left({w}_2\right)}\right\}}^{\frac{1-\exp \left[-0.08\mathcal{D}\left({w}_1,{w}_2\right)\right]}{\exp \left[-0.08\mathcal{D}\left({w}_1,{w}_2\right)\right]}} $$

Semantic similarity by Resnik (Reference Resnik1995):

(24) $$ \mathrm{sim}\left({w}_1,{w}_2\right)= IC\left[\mathcal{K}\left({w}_1,{w}_2\right)\right] $$

Semantic similarity by Zhou et al. (Reference Zhou, Wang and Gu2008b) is a weighted average of k× path-based Leacock–Chodorow similarity and (1 − k)× information content-based Jiang–Conrath similarity. Usually, the two weights are set to be equal k = 1 – k = $ \frac{1}{2} $ .

Correlation between subjective and objective evaluation of word similarity

Cluster analysis based on RG-65 dataset

The development of many semantic similarity measures based on the hypernym–hyponym hierarchy in WordNet was done by their authors using as a testbed the RG-65 dataset containing 65 noun–noun pairs, whose semantic similarity is evaluated from subjective reports collected from human subjects (Rubenstein and Goodenough, Reference Rubenstein and Goodenough1965). Consequently, we have also used the RG-65 dataset to assess the degree of correlation between subjective human evaluation of word similarity and all semantic similarity measures reported in this work. Pearson correlation analysis shows that subsumer-based similarity measures are least correlated with subjective human evaluation (average r = 0.67, P < 0.001), followed by path-based similarity measures (average r = 0.82, P < 0.001), and information content-based similarity measures (average r = 0.84, P < 0.001). Consequent hierarchical clustering segregates subsumer-based similarity measures into a single small cluster that is least correlated to human evaluation, but mixes path-based and information content-based similarity measures into another large cluster (Figure 5). Although any of the available path-based or information content-based similarity measures could be used for exploration of divergent or convergent thinking in conversational transcripts, previous experimental study of creative cognition by design students in real-world educational setting has found that information content-based similarity measures exhibit highest statistical power to differentiate between successful and unsuccessful ideas (Georgiev and Georgiev, Reference Georgiev and Georgiev2018). Here, we have identified the information content formula by Sánchez–Batet (6), as the one that exhibits highest correlation with human evaluation of word similarity, with an average r = 0.85 across all five information content-based semantic similarity formulas. Furthermore, the semantic similarity formula by Lin (22) has the highest r = 0.85 when used with Sánchez–Batet formula (6) among all purely information content-based semantic similarity formulas, which are computationally fast to execute in real-time application. Thus, the combination of formulas (6) and (22) ensures best correlation with human evaluation of similarity and optimizes computational speed for engineering applications.

Figure 5 Pearson correlation matrix map with hierarchical clustering (dendrogram) based on the Pearson correlation distance between subjective human evaluation (HE) of word similarity for noun–noun pairs in the RG-65 dataset and 46 objective semantic similarity measures computed with the use of WordNet 3.1. The similarity measures segregate into two clusters, a larger cluster composed from information content-based or path-based similarity measures, and a smaller cluster composed from subsumer-based similarity measures. Formulas for the similarity measures are provided in the main text. Abbreviations: AN: Al-Mubaid–Nguyen, B: Braun-Blanquet, BHK: Blanchard–Harzallah–Kuntz, D: Dice, J: Jaccard, JC: Jiang–Conrath, K: Kulczyński, L: Lin, LBM: Li–Bandar– McLean, LC: Leacock–Chodorow, MGZ: Meng–Gu–Zhou, MHG: Meng–Huang–Gu, OO: Otsuka–Ochiai, R: Resnik, RMBB: Rada–Mili–Bicknell–Blettner, S: Simpson, SB: Sánchez–Batet, SBI: Sánchez–Batet–Isern, SVH: Seco–Veale–Hayes, WP: Wu–Palmer, YYW: Yuan–Yu–Wang, ZWG: Zhou–Wang–Gu.

Aim 3: Back-testing dynamic semantic measures for success of design ideas

Construction of semantic networks from design conversations

The concurrent verbalization may not access all cognitive processes involved in design thinking. Nevertheless, the language provides an output channel of information that could be used to monitor the ongoing design process and an input channel of information that could be used by the designer to incorporate external feedback on the designed product. Thus, it is desirable to assess whether verbalization and language could be useful in aiding the design process, even though they do not exhaust everything that goes on in the designer’s mind. Next, we illustrate with a concrete empirical example how design review conversations could be analyzed using moving time window and demonstrate the utility of dynamic semantic measures to differentiate between successful and unsuccessful ideas.

Distinguishing successful ideas from unsuccessful ideas

For numerical analysis, we have employed the experimental dataset of complete transcripts with design review conversations provided as a part of the 10th design thinking research symposium (DTRS 10) including two subsets with students majoring in industrial design: a subset with seven junior students and a subset with five graduate students (Adams and Siddiqui, Reference Adams and Siddiqui2013, Reference Adams and Siddiqui2015). Each design project consisted of five stages: (1) task review, (2) concept review, (3) client review, (4) concept reduction review, and (5) final presentation. For each project, the students developed several possible design solutions (Figure 6), from which only the best one was selected to appear in the final presentation.

Figure 6. Part of design conversation (concept review) from DTRS 10 dataset.

This final design solution, which was selected after consultation with the client, is considered to be the successful idea because it has won the competition with other design solutions (ideas) that were not successful in regard to appearing in the final presentation. Here, our main motivation is to distinguish the best design solution from all the rest. Segmentation of the transcripts with regard to successful and unsuccessful ideas was performed based on the videos and the presentation slides. Overall, there were 12 successful ideas, and 41 unsuccessful ideas in the DTRS 10 dataset (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

Construction of moving time window

To test for possible relationship between attributes of divergent/convergent thinking and the success of design ideas, we have split the design conversation transcripts for each idea and have constructed dynamic semantic networks with a moving time window that contains six distinct nouns. This approach is different from bag-of-words, because it does not keep multiplicity of nouns. The first time window is constructed by removing repeated nouns in the conversation until there are collected six distinct nouns. Average (mean) information content of the six nouns in each time window was computed using the Sánchez–Batet formula (6), whereas average (mean) semantic similarity of the 15 noun pairs was computed with the Lin formula (22). These particular formulas have been also found to differentiate well between successful and unsuccessful ideas when the design conversations are split into three equal parts (Georgiev and Georgiev, Reference Georgiev and Georgiev2018).

The temporal duration for the development of each idea was normalized within the unit interval, so that time = 0 indicates the start and time = 1 indicates the end of the design review conversation that pertains to the idea under consideration. The shortest time step corresponds to the appearance of the next noun in the conversation. If the next noun is already repeated in the preceding time window, the time step is added but there is no dynamic change of the semantic network. Alternatively, if the next noun is not repeated in the preceding time window, then both the time step is added and there is a dynamic change of the semantic network.

Because different students had generated different numbers of unsuccessful ideas, to ensure equal weight of each project we have first averaged the dynamic trajectories per student and only then we have averaged the trajectories of unsuccessful ideas across students. Despite that the resulting average trajectories of semantic measures appear to be noisy, it is possible to extract smooth trendlines using linear best fit that minimizes the sum of squared residuals (Figure 7).

Figure 7. Dynamics of semantic measures for successful ideas (selected for final presentation) or unsuccessful ideas (not selected for final presentation) computed from design review conversations. (A) The information content increases in time for successful ideas, whereas it decreases for unsuccessful ideas. (B) The semantic similarity decreases in time for successful ideas, whereas it increases for unsuccessful ideas. Legend: s, average trajectory of successful ideas; u, average trajectory of unsuccessful ideas; L(s), linear best fit of successful ideas; L(u), linear best fit of unsuccessful ideas. The averages are based on n = 12 student projects in industrial design from DTRS10 dataset.

Statistical differences in the rates of change of semantic measures

To test whether the linear trendlines constructed with moving time window are able to extract faithfully the rates of change (slopes of the trendlines) for information content and semantic similarity reported in previous study where the design review conversations were divided into three equal parts (Georgiev and Georgiev, Reference Georgiev and Georgiev2018), we have employed one tailed paired t-tests. The statistical analysis indeed confirmed that the information content exhibited positive rate of change (k s = 0.024) for successful ideas, whereas it exhibited negative rate of change (k u = –0.012) for unsuccessful ideas, (t = 2.24, P = 0.024). Opposite dynamics was observed for semantic similarity, which exhibited negative rate of change (k s = –0.08) for successful ideas and positive rate of change (k u = 0.01) for unsuccessful ideas, (t = –2.05, P = 0.032). Computationally, divergent thinking is identified by negative rate of change of semantic similarity in time, whereas convergent thinking is identified by positive rate of change of semantic similarity in time. Thus, consistent with psychological theories linking creativity with divergent thinking (Guilford, Reference Guilford1957; Hudson, Reference Hudson1974; Runco, Reference Runco2004, Reference Runco2007; Runco and Pritzker, Reference Runco and Pritzker2020), we have found that successful ideas exhibit computational attributes of divergent thinking such as increasing information content (Figure 7a) and decreasing semantic similarity (Figure 7b) during the development of ideas in time, whereas unsuccessful ideas exhibit computational attributes of convergent thinking such as decreasing information content (Figure 7a) and increasing semantic similarity (Figure 7b). These results were obtained in retrospective fashion through analysis of human curated transcripts, which eliminated machine errors in speech-to-text conversion and also performed post-processing of nouns such as conversion of plural to singular and omission of nouns that are absent from WordNet 3.1. However, the numerical plots establish as a proof of principle that conversation analysis could be employed in real time and the trendlines for semantic measures could be provided to the designer as a future forecast of whether the design product is going to be successful.

Discussion

WordNet captures faithfully the distinction between words and meanings

Progress on difficult scientific problems usually requires the development and adoption of new research tools for investigation (Laudan, Reference Laudan1978; Marx, Reference Marx2013; Glocker et al., Reference Glocker, Musolesi, Richens and Uhler2021). Here, our goal was to lay the foundations of an objective methodology for approaching the problem of human creativity. While the idea of using WordNet’s hypernymy for the investigation of analogical concepts is not new (Geum and Park, Reference Geum and Park2016), here we have scrutinized the possible implementation of dynamic semantic networks based on WordNet 3.1 as a tool for the exploration of creative thinking through analysis of verbal data obtained concurrently with the act of problem solving. The graph theoretic representation of WordNet 3.1 as a composition of two directed subgraphs, respectively for words and meanings, is computationally powerful and neuroscientifically well-tailored to capture the two anatomically distinct language-related brain cortical areas specialized for functional processing of words and meanings (Georgiev and Georgiev, Reference Georgiev and Georgiev2018;Georgiev et al., Reference Georgiev, Georgieva, Gong, Nanjappan and Georgiev2021). This is to be contrasted with the prevalent natural language processing approaches whose main goal is to analyze the meanings extracted from the verbal data, while viewing the words only as labels for the intended meanings that need to be disambiguated by a special preprocessing step of the transcribed texts. By keeping both words and meanings, the presented approach captures more faithfully the complexity of human thinking and provides an inroad to virtually imaged concepts that were not verbalized but provide links in the WordNet 3.1 intrinsic hierarchy (Yamamoto et al., Reference Yamamoto, Goka, Yusof, Taura, Nagai, Bergendahl, Grimheden, Leifer, Skogstad and Lindemann2009; Georgiev et al., Reference Georgiev, Nagai and Taura2010).

Advantages of dynamic semantic networks

Having thoroughly discussed the theoretical and applied aspects of dynamic semantic networks, we could summarize their main advantages as follows:

  1. (1) reproducible objective analysis of verbal data,

  2. (2) extraction of both verbalized and virtually imaged concepts in creative problem solving,

  3. (3) minimal confounding injection of interpretation at the stage of data analysis due to dual use of words and meanings,

  4. (4) minimal disturbance of spontaneous creative cognition due to reliance on verbalization of naturally occurring inner monologue,

  5. (5) explicit acknowledgment of educational status and language proficiency of test subjects, and

  6. (6) possibility for real-time computer-assisted audio-visual feedback of the constructed dynamic semantic network for enhancement of human creativity.

Limitations of dynamic semantic networks

The main limitations of dynamic semantic networks include:

  1. (1) lower performance with certain types of creativity that rely on sensual inner stream of consciousness composed of visual images or sounds instead of words (e.g. painting of art or composing of music),

  2. (2) possibly inadequate testing of individuals with low educational status, low language-proficiency or neurological deficits leading to aphasia, and

  3. (3) lack of direct access to neural processes that remain outside of the contents of individual conscious experiences (Georgiev, Reference Georgiev2017, Reference Georgiev2020a, Reference Georgiev2020b).

These cons provide an effective definition of the domain of applicability of semantic networks. For supporting creative cognition, inference methodologies based on semantic distance can be employed (Sarica et al., Reference Sarica, Song, Luo and Wood2021). For studying creative cognition outside of the domain of individual conscious experiences, dynamic semantic networks could be complemented with mind-reading technologies that rely on reconstruction of mental images (e.g. visual images) from recorded electrical brain activity (Horikawa and Kamitani, Reference Horikawa and Kamitani2017; Roelfsema et al., Reference Roelfsema, Denys and Klink2018).

Outlook for future work

The range of creative activities that could benefit from dynamic analysis with semantic networks is quite extensive and includes much of problem solving in science, technology, engineering, and mathematics. Economically most important forms of creativity related to design and innovation of cutting-edge products, equipment, or services, is performed by highly trained, well-educated professionals with excellent language proficiency. Therefore, their creative performance is subject to verbalization, modeling, and improvement with dynamic semantic networks, which substantiates the need of future wider adoption of semantic networks in theoretical and applied cognitive science. This can be connected with an interdisciplinary approach to design thinking. Computer systems endowed with general artificial intelligence may dynamically monitor semantic measures of verbalized inner monologue, e.g. if the designer thinks aloud, and provide real-time feedback on creative problem solving. Human creativity could be then enhanced through suggestions that lead to divergence of semantic similarity of the developed solutions.

Conclusions

This work sought to develop a complete workflow for real-time application of dynamic semantic networks for monitoring cognitive processes during creative design, using specific measures on these networks that correlate with human evaluation. To achieve that objective, we back-tested the actual performance of the developed workflow evaluating ideas generated in design review conversations from an established dataset. This testing involved construction of semantic networks from design conversations, distinguishing successful ideas from unsuccessful ones, and construction of moving time window. The results demonstrate statistical differences in the rate of change of semantic measures for successful ideas and unsuccessful ideas. Overall, successful ideas exhibit computational attributes pertaining to divergent thinking, while unsuccessful ideas exhibit attributes of convergent thinking. This is seen as a proof of principle that dynamic analysis of conversations can be employed in real-time as a future forecast of the success of ideas and design products. This workflow allows for objective analysis of verbal data in design while preserving the spontaneity of creative cognition. This opens up the possibility of real-time AI tools that analyze and enhance human creativity as it occurs during the design process.

Competing interest

None declared.

Data availability statement

WordNet 3.1 is available online at: https://wordnet.princeton.edu/. The RG-65 dataset (Rubenstein and Goodenough, Reference Rubenstein and Goodenough1965) for human judgements of word similarity is available online at: https://doi.org/10.1145/365628.365657. The authors have signed Data-Use Agreements to Dr. Robin Adams (Purdue University) for accessing the Purdue DTRS Design Review Conversations Database, thereby agreeing not to reveal personal identifiers and not to create any commercial products.

References

Adams, RS and Siddiqui, JA (2013 ) Purdue DTRS – Design review conversations database. XRoads Technical Report TR-01-13. West Lafayette, Indiana: Purdue University.Google Scholar
Adams, RS and Siddiqui, JA (2015). Analyzing Design Review Conversations. West Lafayette, Indiana: Purdue University Press.Google Scholar
Amabile, TM (1983a) The Social Psychology of Creativity. Springer Series in Social Psychology. New York: Springer. https://doi.org/10.1007/978-1-4612-5533-8.CrossRefGoogle Scholar
Amabile, TM (1983b) The social psychology of creativity: a componential conceptualization. Journal of Personality and Social Psychology 45 (2), 357376. https://doi.org/10.1037/0022-3514.45.2.357.CrossRefGoogle Scholar
Bird, S, Klein, E, and Loper, E (2009) Natural Language Processing with Python. Sebastopol, California: O’Reilly Media.Google Scholar
Blanchard, E, Harzallah, M, and Kuntz, P (2008) A generic framework for comparing semantic similarities on a subsumption hierarchy. In ECAI 2008: 18th European Conference on Artificial Intelligence Including Prestigious Applications of Intelligent Eystems (PAIS 2008), Ghallab, M, Spyropoulos, CD, Fakotakis, N, and Avouris, N. (eds.), Frontiers in Artificial Intelligence and Applications. Patras, Greece: IOS Press, pp. 2024 https://doi.org/10.3233/978-1-58603-891-5-20.Google Scholar
Boden, MA( 2004 ) The Creative Mind: Myths and Mechanisms. 2nd edition. London: Routledge.CrossRefGoogle Scholar
Borgianni, Y, Maccioni, L, Fiorineschi, L, and Rotini, F (2020) Forms of stimuli and their effects on idea generation in terms of creativity metrics and non-obviousness. International Journal of Design Creativity and Innovation 8 (3), 147164. https://doi.org/10.1080/21650349.2020.1766379.CrossRefGoogle Scholar
Boxman-Shabtai, L and Shifman, L (2014) Evasive targets: deciphering polysemy in mediated humor. Journal of Communication 64 (5), 977998. https://doi.org/10.1111/jcom.12116.CrossRefGoogle Scholar
Braun-Blanquet, J (1932) Plant sociology: The Study of Plant Communities. New York: McGraw Hill.Google Scholar
Britton, BK (1978) Lexical ambiguity of words used in English text. Behavior Research Methods and Instrumentation 10 (1), 17. https://doi.org/10.3758/bf03205079.CrossRefGoogle Scholar
Casakin, H, and Georgiev, V (2021) Design creativity and the semantic analysis of conversations in the design studio. International Journal of Design Creativity and Innovation 9 (1), 6177. https://doi.org/10.1080/21650349.2020.1838331.CrossRefGoogle Scholar
Cash, P, Dekoninck, E, and Ahmed-Kristensen, S (2020) Work with the beat: how dynamic patterns in team processes affect shared understanding. Design Studies 69, 100943. https://doi.org/10.1016/j.destud.2020.04.003.CrossRefGoogle Scholar
Chiu, M, Lim, S, and Silva, A.. 2023. Visualizing design project team and individual progress using NLP: a comparison between latent semantic analysis and Word2Vector algorithms. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 37, e18. https://doi.org/10.1017/S0890060423000094.CrossRefGoogle Scholar
Christiaans, H. 2018. Introduction to DTRS12 “Tech-centred Design Thinking: Perspectives from a Rising Asia”. In Christiaans, (ed.), 12th Design Thinking Research Symposium (DTRS12), November 15–16, 2018. Ulsan, South Korea: Ulsan National Institute of Science and Technology.Google Scholar
Cramer-Petersen, C. L., Christensen, B. T., and Ahmed-Kristensen, S.. 2019. Empirically analysing design reasoning patterns: abductive-deductive reasoning patterns dominate design idea generation. Design Studies 60, 3970. https://doi.org/10.1016/j.destud.2018.10.001.CrossRefGoogle Scholar
Dice, LR (1945) Measures of the amount of ecologic association between species. Ecology 26(3), 297302. https://doi.org/10.2307/1932409.CrossRefGoogle Scholar
Diestel, R( 2017) Graph Theory . 5th edition. Graduate Texts in Mathematics. Berlin: Springer. https://doi.org/10.1007/978-3-662-53622-3.CrossRefGoogle Scholar
Dong, A (2009 ) The Language of Design: Theory and Computation. London: Springer. https://doi.org/10.1007/978-1-84882-021-0.Google Scholar
Dong, A, Garbuio, M, and Lovallo, D (2016) Generative sensing in design evaluation. Design Studies 45, 6891. https://doi.org/10.1016/j.destud.2016.01.003.CrossRefGoogle Scholar
Ericsson, KA and Simon, HA (1980) Verbal reports as data. Psychological Review 87(3), 215251. https://doi.org/10.1037/0033-295X.87.3.215.CrossRefGoogle Scholar
Esparza, A, Sosa, R, and Connor, A (2019) Entrepreneurial ideation: effects of morphology and complexity. Proceedings of the Design Society: International Conference on Engineering Design 1(1), 29913000. https://doi.org/10.1017/dsi.2019.306.Google Scholar
Fauconnier, G and Turner, M (2003) Polysemy and conceptual blending. In Nerlich, B, Zazie, T, Vimala, H and Clarke, DD (eds.), Polysemy: Flexible Patterns of Meaning in Mind and Language, Trends in Linguistics. Studies and Monographs. Berlin: Walter de Gruyter, pp. 7994. https://doi.org/10.1515/9783110895698.79.CrossRefGoogle Scholar
Fellbaum, C (1998a ) A semantic network of English: the mother of all WordNets. Computers and the Humanities 32(2/3), 209220. https://doi.org/10.1023/a:1001181927857.CrossRefGoogle Scholar
Fellbaum, C (1998b) WordNet: An Electronic Lexical Database. Language, Speech, and Communication. Cambridge, Massachusetts: The MIT Press. https://doi.org/10.7551/mitpress/7287.001.0001.CrossRefGoogle Scholar
Finke, RA, Smith, SM, and Ward, TB (1992) Creative Cognition: Theory, Research, and Applications. Cambridge, Massachusetts: MIT Press.CrossRefGoogle Scholar
Fiorineschi, L, Frillici, FS, and Rotini, F(2022) Refined metric for a-posteriori novelty assessments. Journal of Engineering Design 33 (1), 3963. https://doi.org/10.1080/09544828.2021.1976397.CrossRefGoogle Scholar
Fiorineschi, L and Rotini, F (2023). Uses of the novelty metrics proposed by Shah et al.: what emerges from the literature? Design Science 9, e11. https://doi.org/10.1017/dsj.2023.9.CrossRefGoogle Scholar
Gentner, D (1982) Why nouns are learned before verbs: linguistic relativity versus natural partitioning. In Kuczaj, SA II (ed), Language Development. Vol 2: Language, Thought and Culture, pp. 301334. Hillsdale, New Jersey: Lawrence Erlbaum.Google Scholar
Georgiev, DD (2017) Quantum Information and Consciousness: A Gentle Introduction. Boca Raton: CRC Press. https://doi.org/10.1201/9780203732519.CrossRefGoogle Scholar
Georgiev, DD (2020a) Inner privacy of conscious experiences and quantum information. Biosystems 187:104051. https://doi.org/10.1016/j.biosystems.2019.104051.CrossRefGoogle ScholarPubMed
Georgiev, DD (2020b ) Quantum information theoretic approach to the mind–brain problem. Progress in Biophysics and Molecular Biology 158, 1632. https://doi.org/10.1016/j.pbiomolbio.2020.08.002.CrossRefGoogle Scholar
Georgiev, DD, Georgieva, I, Gong, Z, Nanjappan, V, and Georgiev, GV (2021) Virtual reality for neurorehabilitation and cognitive enhancement. Brain Sciences 11(2), 221. https://doi.org/10.3390/brainsci11020221.CrossRefGoogle ScholarPubMed
Georgiev, GV and Casakin, H (2019) Semantic measures for enhancing creativity in design education. Proceedings of the Design Society: International Conference on Engineering Design 1 (1), 369378. https://doi.org/10.1017/dsi.2019.40.Google Scholar
Georgiev, GV and Georgiev, DD (2018) Enhancing user creativity: semantic measures for idea generation. Knowledge-Based Systems 151, 115. https://doi.org/10.1016/j.knosys.2018.03.016.CrossRefGoogle Scholar
Georgiev, GV and Georgiev, DD (2019). Semantic analysis approach to studying design problem solving. Proceedings of the Design Society: International Conference on Engineering Design 1 (1), 18231832. https://doi.org/10.1017/dsi.2019.188.Google Scholar
Georgiev, GV and Georgiev, DD (2023). Quantitative dynamics of design thinking and creativity perspectives in company context. Technology in Society 74, 102292. https://doi.org/10.1016/j.techsoc.2023.102292.CrossRefGoogle Scholar
Georgiev, GV, Nagai, Y, and Taura, T (2008) Method of design evaluation focused on relations of meanings for a successful design. In Marjanovic D, Storga M, Pavkovic N, and Bojcetic N (eds), 10th International Design Conference, Design 2008, The Design Society, pp. 11491158. https://www.designsociety.org/publication/26746/.Google Scholar
Georgiev, GV, Nagai, Y, and Taura, T (2010) A method for the evaluation of meaning structures and its application in conceptual design. Journal of Design Research 8(3), 214234. https://doi.org/10.1504/jdr.2010.032607.CrossRefGoogle Scholar
Georgiev, GV, and Taura, T (2014). Polysemy in design review conversations. In 10th Design Thinking Research Symposium. Purdue University. http://docs.lib.purdue.edu/dtrs/2014/Identity/2/.Google Scholar
Geum, Y, and Park, Y (2016) How to generate creative ideas for innovation: a hybrid approach of WordNet and morphological analysis. Technological Forecasting and Social Change 111, 176187. https://doi.org/10.1016/j.techfore.2016.06.026.CrossRefGoogle Scholar
Glocker, B, Musolesi, M, Richens, J, and Uhler, C (2021) Causality in digital medicine. Nature Communications 12(1), 5471. https://doi.org/10.1038/s41467-021-25743-9.Google Scholar
Goldschmidt, G (2016) Linkographic evidence for concurrent divergent and convergent thinking in creative design. Creativity Research Journal 28 (2), 115122. https://doi.org/10.1080/10400419.2016.1162497.CrossRefGoogle Scholar
Goldschmidt, G (2019) Design creativity research: recent developments and future challenges. International Journal of Design Creativity and Innovation 7(4), 194195. https://doi.org/10.1080/21650349.2019.1646387.CrossRefGoogle Scholar
Guilford, JP (1957) Creative abilities in the arts. Psychological Review 64(2), 110118. https://doi.org/10.1037/h0048280.CrossRefGoogle ScholarPubMed
Han, J, Hua, M, Park, D, Wang, P and Childs, PRN (2020) Computational conceptual distances in combinational creativity. Proceedings of the Design Society: DESIGN Conference 1, 177186. https://doi.org/10.1017/dsd.2020.36.Google Scholar
Han, J, Sarica, S, Shi, F, and Luo, J (2021) Semantic networks for engineering design: a survey. Proceedings of the Design Society 1, 26212630. https://doi.org/10.1017/pds.2021.523.CrossRefGoogle Scholar
Han, J, Sarica, S, Shi, F, and Luo, J (2022) Semantic networks for engineering design: state of the art and future directions. Journal of Mechanical Design 144(2), 020802. https://doi.org/10.1115/1.4052148.Google Scholar
Horikawa, T, and Kamitani, Y (2017). Generic decoding of seen and imagined objects using hierarchical visual features. Nature Communications 8(1), 15037. https://doi.org/10.1038/ncomms15037.CrossRefGoogle ScholarPubMed
Hudson, L (1974 ) Contrary Imaginations: A Psychological Study of the English Schoolboy. Pelican Books. Harmondsworth: Penguin Books.Google Scholar
Jaccard, P (1912) The distribution of the flora in the alpine zone. New Phytologist 11(2), 3750. https://doi.org/10.1111/j.1469-8137.1912.tb05611.x.CrossRefGoogle Scholar
Jiang, JJ, and Conrath, DW (1997) Semantic similarity based on corpus statistics and lexical taxonomy. In International Conference on Research on Computational Linguistics (ROCLING X), pp. 1933. Taipei, Taiwan: Association for Computational Linguistics and Chinese Language Processing.Google Scholar
Kulczyński, S (1927) Die Pflanzenassoziationen der Pieninen. Bulletin International de l’Academie Polonaise des Sciences et des Lettres, Classe des Sciences Mathematiques et Naturelles, Série B, Supplément II, pp. 57203.Google Scholar
Laudan, L (1978) Progress and its Problems: Towards a Theory of Scientific Growth. Berkeley: University of California Press.Google Scholar
Leacock, C and Chodorow, M (1998) Combining local context and WordNet similarity for word sense identification. In Fellbaum, C (ed.), WordNet: an electronic lexical database, pp. 265283. Cambridge, Massachusetts: MIT Press.CrossRefGoogle Scholar
Lee, JH, Ostwald, MJ, and Gu, N (2020 ) Design Thinking: Creativity, Collaboration and Culture. Cham, Switzerland: Springer. https://doi.org/10.1007/978-3-030-56558-9.CrossRefGoogle Scholar
Lee, RST (2024) Natural Language Processing: A Textbook With Python Implementation. Singapore: Springer. https://doi.org/10.1007/978-981-99-1999-4.CrossRefGoogle Scholar
Li, Y, Bandar, ZA, and McLean, D (2003) An approach for measuring semantic similarity between words using multiple information sources. IEEE Transactions on Knowledge and Data Engineering 15(4), 871882. https://doi.org/10.1109/tkde.2003.1209005.Google Scholar
Lin, D (1998) An information-theoretic definition of similarity. In ICML ’98 Proceedings of the 15th International Conference on Machine Learning, pp. 296304. San Francisco: Morgan Kaufmann Publishers.Google Scholar
Loria, S (2016) Textblob: Simplified Text Processing. Charlottesville, Virginia: Center for Open Science. https://textblob.readthedocs.io/en/dev/.Google Scholar
Maccioni, L and Borgianni, Y (2020) Success-oriented eco-ideation sessions: lessons learnt from the use often eco-design guidelines. In Boujut, JF, Cascini, G, Ahmed-Kristensen, S, Georgiev, GV, and Iivari, N (eds.), Proceedings of the sixth international conference on design creativity (ICDC 2020), pp. 125132. Glasgow, United Kingdom: The Design Society. https://doi.org/10.35199/icdc.2020.16.CrossRefGoogle Scholar
Marx, V (2013) The big challenges of big data. Nature 498 (7453), 255260. https://doi.org/10.1038/498255a.CrossRefGoogle ScholarPubMed
Meng, L, Gu, J, and Zhou, Z (2012) A new model of information content based on concept’s topology for measuring semantic similarity in WordNet. International Journal of Grid and Distributed Computing 5(3), 8194.Google Scholar
Meng, L, Huang, R, and Gu, J (2013) A review of semantic similarity measures in WordNet. International Journal of Hybrid Information Technology 6(1), 112.Google Scholar
Meng, L, Huang, R, and Gu, J (2014) Measuring semantic similarity of word pairs using path and information content. International Journal of Future Generation Communication and Networking 7(3), 183194. https://doi.org/10.14257/ijfgcn.2014.7.3.17.CrossRefGoogle Scholar
Miller, GA (1995) WordNet: a lexical database for English. Communications of the ACM 38(11), 3941. https://doi.org/10.1145/219717.219748.CrossRefGoogle Scholar
Miller, GA (1998) Nouns in WordNet. In Fellbaum, C., WordNet: an electronic lexical database, pp. 2346 Language, Speech, and Communication. Cambridge, Massachusetts: The MIT Press. https://doi.org/10.7551/mitpress/7287.003.0006.CrossRefGoogle Scholar
Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., and Miller, K.J.. 1990. Introduction to WordNet: an on-line lexical database. International Journal of Lexicography 3 (4), 235244. https://doi.org/10.1093/ijl/3.4.235.CrossRefGoogle Scholar
Miron-Spektor, E., and Erez, M.. 2017. Looking at creativity through a paradox lens: deeper understanding and new insights. In Smith, WK, Lewis, MW, Jarzabkowski, P, and Langley, A, (eds.) Handbook of Organizational Paradox: Approaches to Plurality, Tensions and Contradictions, pp. 434451, Chap. 22. Oxford: Oxford University Press. https://doi.org/10.1093/oxfordhb/9780198754428.013.22.Google Scholar
Moldovan, S, Goldenberg, J, and Chattopadhyay, A (2011) The different roles of product originality and usefulness in generating word-of-mouth. International Journal of Research in Marketing 28(2), 109119. https://doi.org/10.1016/j.ijresmar.2010.11.003.CrossRefGoogle Scholar
Al-Mubaid, H, and Nguyen, HA (2006) A cluster-based approach for semantic similarity in the biomedical domain. In 2006 International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 27132717. Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/iembs.2006.259235.CrossRefGoogle Scholar
Nagel, T (1974) What is it like to be a bat? The Philosophical Review 83(4), 435450. https://doi.org/10.2307/2183914.CrossRefGoogle Scholar
Nerlich, B, Zazie, T, Vimala, H, and Clarke, DD (2003). Polysemy: flexible patterns of meaning in mind and language. Vol. 142. Trends in Linguistics. Studies and Monographs. Berlin: Walter de Gruyter. https://doi.org/10.1515/9783110895698.CrossRefGoogle Scholar
Newell, A, Shaw, JC, and Simon, HA (1959) Report on a general problem-solving program. In Proceedings of the international conference on information processing, pp. 256264. Paris: UNESCO.Google Scholar
Ochiai, A (1957) Zoogeographical studies on the soleoid fishes found in Japan and its neighhouring regions-II. Bulletin of the Japanese Society of Scientific Fisheries 22(9), 526530. https://doi.org/10.2331/suisan.22.526.CrossRefGoogle Scholar
Otsuka, Y (1936). The faunal character of the Japanese pleistocene marine Mollusca, as evidence of the climate having become colder during the plestocene in Japan. Bulletin of the Biogeographical Society of Japan 6(16), 165170.Google Scholar
Perkowski, M (2022) Inverse problems, constraint satisfaction, reversible logic, invertible logic and Grover quantum oracles for practical problems. Science of Computer Programming 218, 102775. https://doi.org/10.1016/j.scico.2022.102775.CrossRefGoogle Scholar
Rada, R, Mili, H, Bicknell, E, and Blettner, M (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man, and Cybernetics 19(1), 1730. https://doi.org/10.1109/21.24528.CrossRefGoogle Scholar
Ravin, Y, and Leacock, C (2000). Polysemy: Theoretical and Computational Approaches. Oxford: Oxford University Press.CrossRefGoogle Scholar
Resnik, P (1995) Using information content to evaluate semantic similarity in a taxonomy. In IJCAI’95 Proceedings of the 14th International Joint Conference on Artificial Intelligence, Vol. 1, pp. 448453. San Francisco: Morgan Kaufmann Publishers. http://dl.acm.org/citation.cfm?id=1625914.Google Scholar
Roelfsema, PR, Denys, D, and Klink, PC (2018). Mind reading and writing: the future of neurotechnology. Trends in Cognitive Sciences 22(7), 598610. https://doi.org/10.1016/j.tics.2018.04.001.CrossRefGoogle ScholarPubMed
Rubenstein, H and Goodenough, JB (1965) Contextual correlates of synonymy. Communications of the ACM 8(10), 627633. https://doi.org/10.1145/365628.365657.CrossRefGoogle Scholar
Runco, MA (2004) Creativity. Annual Review of Psychology 55(1), 657687. https://doi.org/10.1146/annurev.psych.55.090902.141502.CrossRefGoogle ScholarPubMed
Runco, MA (2007 ) Creativity: Theories and Themes: Research, Development, and Practice. Burlington, Massachusetts: Elsevier Academic Press.Google Scholar
Runco, MA and Pritzker, SR (2020). Encyclopedia of Creativity, Two-Volume Set. 3rd edition. Amsterdam: Academic Press.Google Scholar
Sääksjärvi, M and Gonçalves, M (2018) Creativity and meaning: including meaning as a component of creative solutions. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 32(4), 365379. https://doi.org/10.1017/S0890060418000112.CrossRefGoogle Scholar
Sánchez, D., and Batet, M (2012) A new model to compute the information content of concepts from taxonomic knowledge. International Journal on Semantic Web and Information Systems 8(2), 3450. https://doi.org/10.4018/jswis.2012040102.CrossRefGoogle Scholar
Sánchez, D, Batet, M, and Isern, D (2011) Ontology-based information content computation. Knowledge-Based Systems 24 (2), 297303. https://doi.org/10.1016/j.knosys.2010.10.001.CrossRefGoogle Scholar
Sarica, S, Song, B, Luo, J, and Wood, KL (2021) Idea generation with technology semantic network. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 35(3), 265283. https://doi.org/10.1017/S0890060421000020.CrossRefGoogle Scholar
Seco, N, Veale, T, and Hayes, J (2004) An intrinsic information content metric for semantic similarity in WordNet. In de Mántaras RL and Saitta L (eds), ECAI 2004: 16th European Conference on Artificial Intelligence Including Prestigious Applicants of Intelligent Systems (PAIS 2004), pp. 10891090. Valencia, Spain: IOS Press.Google Scholar
Shannon, CE. (1948) A mathematical theory of communication. The Bell System Technical Journal 27(3), 379423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x.CrossRefGoogle Scholar
Simon, HA and Newell, A (1971). Human problem solving: the state of the theory in 1970. American Psychologist 26(2), 145159. https://doi.org/10.1037/h0030806.CrossRefGoogle Scholar
Simpson, GG (1960). Notes on the measurement of faunal resemblance. American Journal of Science 258 (Bradley Volume), 300311. http://earth.geology.yale.edu/~ajs/BradleyVol.html.Google Scholar
Sonalkar, N, Mabogunje, A, Leifer, L, and Roth, B (2016). Visualising professional vision interactions in design reviews. CoDesign 12(1-2), 7392. https://doi.org/10.1080/15710882.2015.1135245.CrossRefGoogle Scholar
Sosa, R, and van Dijck, (2021). A computational interrogation of “Big-C”’ and “little-c”’ creativity. Creativity Research Journal 34(3), 295307. https://doi.org/10.1080/10400419.2021.1992195.CrossRefGoogle Scholar
Srinivasan, V and Chakrabarti, A (2010). Investigating novelty–outcome relationships in engineering design. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 24(2), 161178. https://doi.org/10.1017/S089006041000003X.CrossRefGoogle Scholar
Stein, MI (1953) Creativity and culture. Journal of Psychology 36(2), 311322. https://doi.org/10.1080/00223980.1953.9712897.CrossRefGoogle Scholar
Taura, T (2016) Creative Design Engineering: Introduction to an Interdisciplinary Approach. Amsterdam: Academic Press. https://doi.org/10.1016/c2015-0-01489-7.Google Scholar
Taura, T, and Nagai, Y (2013) Concept Generation for Design Creativity: A Systematized Theory and Methodology. London: Springer. https://doi.org/10.1007/978-1-4471-4081-8.CrossRefGoogle Scholar
Taura, T, Yamamoto, E, Fasiha, MYN, Goka, M, Mukai, F, Nagai, Y, and Nakashima, H (2012) Constructive simulation of creative concept generation process in design: a research method for difficult-to-observe design-thinking processes. Journal of Engineering Design 23(4), 297321. https://doi.org/10.1080/09544828. 2011.637191.CrossRefGoogle Scholar
Wang, Y (2013) In search of cognitive foundations of creativity. In Carayannis EG (ed.), Encyclopedia of Creativity, Invention, Innovation and Entrepreneurship, pp. 902913. New York: Springer. https://doi.org/10.1007/978-1-4614-3858-8_350.Google Scholar
Waxman, S, Fu, X, Arunachalam, S, Leddon, E, Geraghty, K, and Song, H-J (2013) Are nouns learned before verbs? Infants provide insight into a longstanding debate. Child Development Perspectives 7(3), 155159. https://doi.org/10.1111/cdep.12032.CrossRefGoogle ScholarPubMed
Wilkenfeld, MJ and Ward, TB (2001) Similarity and emergence in conceptual combination. Journal of Memory and Language 45(1), 2138. https://doi.org/10.1006/jmla.2000.2772.CrossRefGoogle Scholar
Wu, Z and Palmer, M (1994) Verbs semantics and lexical selection. In 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico: Association for Computational Linguistics, pp. 133138. https://doi.org/10.3115/981732.981751.CrossRefGoogle Scholar
Yamamoto, E, Goka, M, Yusof, NFM, Taura, T, and Nagai, Y (2009) Virtual modeling of concept generation process for understanding and enhancing the nature of design creativity. In Bergendahl, M. Norell, Grimheden, M., Leifer, L., Skogstad, P., and Lindemann, U. (eds) 17th International Conference on Engineering Design, vol. 2, Design Theory and Research Methodology, pp. 101112. The Design Society.Google Scholar
Yuan, Q, Yu, Z, and Wang, K (2013) A new model of information content for measuring the semantic similarity between concepts. In 2013 International Conference on Cloud Computing and Big Data, pp 141146. Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/cloudcom-asia.2013.25.CrossRefGoogle Scholar
Zhou, Z, Wang, Y, and Gu, J (2008a) A new model of information content for semantic similarity in wordnet. In FGCNS ’08 Second International Conference on Future Generation Communication and Networking Symposia, Vol. 3. Institute of Electrical and Electronics Engineers. pp. 8589. https://doi.org/10.1109/fgcns.2008.16.Google Scholar
Zhou, Z, Wang, Y, and Gu, J (2008b). New model of semantic similarity measuring in wordnet. In ISKE 2008, 3rd International Conference on Intelligent System and Knowledge Engineering, Vol. 1, Institute of Electrical and Electronics Engineers. pp. 256261. https://doi.org/10.1109/iske.2008.4730937.Google Scholar
Figure 0

Figure 1. Workflow for monitoring of creative cognition with dynamic semantic networks.

Figure 1

Figure 2. The subgraph of meanings M in WordNet3.1 does not have directed cycles when the edges remain directed, however, when all edges are converted into undirected edges using the graph operator U, the graph U (M) becomes cyclic. One consequence of the structure of M is that even for two monosemous words such as “workspace” and “yellow” the shortest path in the undirected graph U(M) may not pass through the lowest common subsumer, which in this case is the root meaning vertex M00001740. In this example, the length of the shortest path between “workspace” and “yellow” is 12, whereas the distance between “workspace” and “yellow” through their lowest common subsumer M00001740 is 14. To avoid clutter in the image, we have added only a single word for each meaning vertex, however, many of the meaning vertices are subsumed by synsets of words. For example, both words “yellow” and “yellowness” subsume the meaning vertex M04972838.

Figure 2

Figure 3. Modeling the creative design process of an “Electric car” with a dynamic semantic network. The dynamics of the semantic network from the stage of idea generation to the stage of fully developed solution provides a useful, reproducible, and fast computational tool for exploration of creative thinking.

Figure 3

Figure 4. Graph composition of meaning vertices and word vertices for different WordNet 3.1 searches. (A) Adding words as subordinate vertices subsumed by meanings allows for computing the depth in the taxonomy and listing the meaning subsumers of words. (B) Adding words as subsumers of meanings allows for listing the meaning subvertices and leaves. (C) Adding two distinct words {w1;w2} as subordinate vertices subsumed by meanings allows for computing their lowest common subsumer $ \mathcal{K}\left({w}_1,{w}_2\right) $. (D) Adding two distinct words {w1;w2} into an undirected graph allows for computing the shortest path distance between the two words, which may not pass through their lowest common subsumer. Different graph compositions are needed for path-based or information content-based quantification of word similarity.

Figure 4

Figure 5 Pearson correlation matrix map with hierarchical clustering (dendrogram) based on the Pearson correlation distance between subjective human evaluation (HE) of word similarity for noun–noun pairs in the RG-65 dataset and 46 objective semantic similarity measures computed with the use of WordNet 3.1. The similarity measures segregate into two clusters, a larger cluster composed from information content-based or path-based similarity measures, and a smaller cluster composed from subsumer-based similarity measures. Formulas for the similarity measures are provided in the main text. Abbreviations: AN: Al-Mubaid–Nguyen, B: Braun-Blanquet, BHK: Blanchard–Harzallah–Kuntz, D: Dice, J: Jaccard, JC: Jiang–Conrath, K: Kulczyński, L: Lin, LBM: Li–Bandar– McLean, LC: Leacock–Chodorow, MGZ: Meng–Gu–Zhou, MHG: Meng–Huang–Gu, OO: Otsuka–Ochiai, R: Resnik, RMBB: Rada–Mili–Bicknell–Blettner, S: Simpson, SB: Sánchez–Batet, SBI: Sánchez–Batet–Isern, SVH: Seco–Veale–Hayes, WP: Wu–Palmer, YYW: Yuan–Yu–Wang, ZWG: Zhou–Wang–Gu.

Figure 5

Figure 6. Part of design conversation (concept review) from DTRS 10 dataset.

Figure 6

Figure 7. Dynamics of semantic measures for successful ideas (selected for final presentation) or unsuccessful ideas (not selected for final presentation) computed from design review conversations. (A) The information content increases in time for successful ideas, whereas it decreases for unsuccessful ideas. (B) The semantic similarity decreases in time for successful ideas, whereas it increases for unsuccessful ideas. Legend: s, average trajectory of successful ideas; u, average trajectory of unsuccessful ideas; L(s), linear best fit of successful ideas; L(u), linear best fit of unsuccessful ideas. The averages are based on n = 12 student projects in industrial design from DTRS10 dataset.