The same information has different cognitive difficulty in different representation forms, especially in the field of interaction design. Thus, Scientists pay attention to the design effectiveness based on visual perception. This study focuses on two problems: 1) The relationship between textual comprehension, spatial understanding and cognitive accuracy of text information; 2) The transformation differences of cognitive elements from text information to 3D image information. First, we conduct an experiment to show the cognitive transformation difference of text elements and 3D image elements. Then, we take the design of "Logoup" 3D modeling software (This is programming driven 3D modeling software) as an example, and applies the experimental results in this study to the interface design of the software. By setting up horizontal and vertical reference planes in the real-time rendering area of the software, we can improve the cognitive efficiency and user experience of users and provide non-professional 3D modeling skill of users with an entrance to create 3D shapes.