No CrossRef data available.
Published online by Cambridge University Press: 26 May 2022
In this paper, we develop a design agent based on reinforcement learning to mimic human design behaviours. A data-driven reward mechanism based on the Markov chain model is introduced so that it can reinforce prominent and beneficial design patterns. The method is implemented on a set of data collected from a solar system design problem. The result indicates that the agent provides higher prediction accuracy than the baseline Markov chain model. Several design strategies are also identified that differentiate high-performing designers from low-performing designers.