Hostname: page-component-669899f699-chc8l Total loading time: 0 Render date: 2025-04-25T03:24:12.027Z Has data issue: false hasContentIssue false

Online robust self-learning terminal sliding mode control for balancing control of reaction wheel bicycle robots

Published online by Cambridge University Press:  19 December 2024

Xianjin Zhu
Affiliation:
School of Mechatronics Engineering, Harbin Institute of Technology, Harbin, China
Wenfu Xu
Affiliation:
School of Mechatronics Engineering and Automation, Harbin Institute of Technology, Shenzhen, China
Zhang Chen
Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Yang Deng
Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Qingyuan Zheng
Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Bin Liang
Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Yu Liu*
Affiliation:
School of Mechatronics Engineering, Harbin Institute of Technology, Harbin, China
*
Corresponding author: Yu Liu; Email: [email protected]

Abstract

This paper proposes an online robust self-learning terminal sliding mode control (RS-TSMC) with stability guarantee for balancing control of reaction wheel bicycle robots (RWBR) under model uncertainties and disturbances, which improves the balancing control performance of RWBR by optimising the constrained output of TSMC. The TSMC is designed for a second-order mathematical model of RWBR. Then robust adaptive dynamic programming based on an actor-critic algorithm is used to optimise the TSMC only by data sampled online. The system closed-loop stability and convergence of the neural network weights are guaranteed based on the Lyapunov analysis. The effectiveness of the proposed algorithm is demonstrated through simulations and experiments.

Type
Research Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Rubio, F., Valero, F. and Llopis-Albert, C., “A review of mobile robots: Concepts, methods, theoretical framework, and applications,” Int J Adv Robot Syst. 16(2), 122 (2019).CrossRefGoogle Scholar
Fadini, G., Kumar, S., Kumar, R., Flayols, T., Del Prete, A., Carpentier, J. and Souères, P., “Co-designing versatile quadruped robots for dynamic and energy-efficient motions,” Robotica 42(6), 20042025 (2024).CrossRefGoogle Scholar
Huang, Y., Liao, Q., Guo, L. and Wei, S., “Simple realization of balanced motions under different speeds for a mechanical regulator-free bicycle robot,” Robotica 33(9), 19581972 (2015).CrossRefGoogle Scholar
Huang, J., Zhang, M., Ri, S., Xiong, C., Li, Z. and Kang, Y., “High-order disturbance-observer-based sliding mode control for mobile wheeled inverted pendulum systems,” IEEE T Ind Electron. 67(3), 20302041 (2020).CrossRefGoogle Scholar
Beznos, A., Formal’sky, A., Gurfinkel, E., Jicharev, D., Lensky, A., Savitsky, K. and Tchesalin, L., “Control of autonomous motion of two-wheel bicycle with gyroscopic stabilisation,” In: IEEE International Conference on Robotics and Automation, Leuven, Belgium, (1998) pp. 26702675.Google Scholar
Chen, C.-K., Chu, T.-D. and Zhang, X.-D., “Modeling and control of an active stabilizing assistant system for a bicycle,” Sensors 19(2), 248 (2019).CrossRefGoogle ScholarPubMed
Keo, L. and Yamakita, M., “Controller design of an autonomous bicycle with both steering and balancer controls,” In: IEEE International Conference on Control Applications/International Symposium on Intelligent Control, St Petersburg, Russia (2009) pp. 12941299.Google Scholar
He, K., Deng, Y., Wang, G., Sun, X., Sun, Y. and Chen, Z., “Learning-based trajectory tracking and balance control for bicycle robots with a pendulum: A gaussian process approach,” IEEE-ASME T Mech. 27(2), 634644 (2022).CrossRefGoogle Scholar
Kanjanawanishkul, K., “LQR and MPC controller design and comparison for a stationary self-balancing bicycle robot with a reaction wheel,” Kybernetika 51(1), 173191 (2015).Google Scholar
Wang, S., Cui, L., Lai, J., Yang, S., Chen, X., Zheng, Y., Zhang, Z. and Jiang, Z.-P., “Gain scheduled controller design for balancing an autonomous bicycle,” In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Electr Network (2020-2021) pp. 75957600.CrossRefGoogle Scholar
Kim, H.-W., An, J.-W., Yoo, H.D and Lee, J.-M., “Balancing control of bicycle robot using pid control,” In: 13th International Conference on Control, Automation and Systems (ICCAS), Gwangju, South Korea (2013) pp. 145147.Google Scholar
Xiong, C., Huang, Z., Gu, W., Pan, Q., Liu, Y., Li, X. and Wang, E. X., “Static balancing of robotic bicycle through nonlinear modeling and control,” In: 3rd International Conference on Robotics and Automation Engineering (ICRAE), Guangzhou, China (2018) pp. 2428.Google Scholar
Owczarkowski, A., Horla, D. and Zietkiewicz, J., “Introduction of feedback linearization to robust lqr and lqi control - analysis of results from an unmanned bicycle robot with reaction wheel,” Asian J Control 21(2), 10281040 (2019).CrossRefGoogle Scholar
Jeong, S. and Chwa, D., “Sliding-mode-disturbance-observer-based robust tracking control for omnidirectional mobile robots with kinematic and dynamic uncertainties,” IEEE-ASME T Mech 26(2), 741752 (2021).CrossRefGoogle Scholar
Tuan, L. A. and Ha, Q. P., “Adaptive fractional-order integral fast terminal sliding mode and fault-tolerant control of dual-arm robots,” Robotica 42(5), 14761499 (2024).CrossRefGoogle Scholar
Song, J., Ho, D. W. C. and Niu, Y., “Model-based event-triggered sliding-mode control for multi-input systems: Performance analysis and optimisation,” IEEE T Cybernetics 52(5), 39023913 (2022).CrossRefGoogle Scholar
Behera, A., Bandyopadhyay, B., Cucuzzella, M., Ferrara, A. and Yu, X., “A survey on event-triggered sliding mode control,” IEEE Journal of Emerging and Selected Topics in Industrial Electronics 2(3), 206217 (2021).CrossRefGoogle Scholar
Guo, L., Liao, Q. and Wei, S., “Design of fuzzy sliding-mode controller for bicycle robot nonlinear system,” In: IEEE International Conference on Robotics and Biomimetics (ROBIO 2006), Kunming, China (2006) pp. 176180.Google Scholar
Alizadeh, M., Ramezani, A. and Saadatinezhad, H., “Fault tolerant control in an unmanned bicycle robot via sliding mode theory,” IET Cyber-syst Robot. 4(2), 139152 (2022).CrossRefGoogle Scholar
Chen, L., Yan, B., Wang, H., Shao, K., Kurniawan, E. and Wang, G., “Extreme-learning-machine-based robust integral terminal sliding mode control of bicycle robot,” Control Eng Pract. 121, 105064 (2022).CrossRefGoogle Scholar
Chen, L., Liu, J., Wang, H., Hu, Y., Zheng, X., Ye, M. and Zhang, J., “Robust control of reaction wheel bicycle robot via adaptive integral terminal sliding mode,” Nonlinear Dynam. 104(3), 22912302 (2021).CrossRefGoogle Scholar
Zhu, X., Deng, Y., Zheng, X., Zheng, Q., Liang, B. and Liu, Y., “Online reinforcement-learning-based adaptive terminal sliding mode control for disturbed bicycle robots on a curved pavement,” Electronics 11(21), 3495 (2022).CrossRefGoogle Scholar
Zhu, X., Deng, Y., Zheng, X., Zheng, Q., Chen, Z., Liang, B. and Liu, Y., “Online series-parallel reinforcement-learning- based balancing control for reaction wheel bicycle robots on a curved pavement,” IEEE Access 11, 6675666766 (2023).CrossRefGoogle Scholar
Huo, B., Yu, L., Liu, Y. and Sha, S., “Reinforcement learning based path tracking control method for unmanned bicycle on complex terrain,” In: IECON. 2023- 49th Annual Conference of the IEEE Industrial Electronics Society, Singapore, Singapore (2023) pp. 16.Google Scholar
Guo, L., Lin, H., Jiang, J., Song, Y. and Gan, D., “Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot,” ISA T. 145, 479492 (2024).CrossRefGoogle ScholarPubMed
Ma, Q., Zhang, X., Xu, X., Yang, Y. and Wu, E. Q., “Self-learning sliding mode control based on adaptive dynamic programming for nonholonomic mobile robots,” ISA T. 142, 136147 (2023).CrossRefGoogle ScholarPubMed
Zhu, Y. and Zhao, D., “Comprehensive comparison of online adp algorithms for continuous-time optimal control,” Artif Intell Rev. 49(4), 531547 (2018).CrossRefGoogle Scholar
Vamvoudakis, K. G. and Lewis, F. L., “Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem,” Automatica 46(5), 878888 (2010).CrossRefGoogle Scholar
Liu, D., Xue, S., Zhao, B., Luo, B. and Wei, Q., “Adaptive dynamic programming for control: A survey and recent advances,” IEEE T Syst Man Cy-S. 51(1), 142160 (2021).CrossRefGoogle Scholar
Bhasin, S., Kamalapurkar, R., Johnson, M., Vamvoudakis, K. G., Lewis, F. L. and Dixon, W. E., “A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems,” Automatica 49(1), 8292 (2013).CrossRefGoogle Scholar
Vamvoudakis, K. G., Vrabie, D. and Lewis, F. L., “Online adaptive algorithm for optimal control with integral reinforcement learning,” Int J Robust Nonlin. 24(17), 26862710 (2014).CrossRefGoogle Scholar
Yu, S., Yu, X. and Zhihong, M., “Robust global terminal sliding mode control of SISO nonlinear uncertain systems,” In: Proceedings of the 39th IEEE Conference on Decision and Control (Cat 00CH37187), vol. 3 (2000) pp. 21982203.Google Scholar
Spong, M., Corke, P. and Lozano, R., “Nonlinear control of the reaction wheel pendulum,” Automatica 37(11), 18451851 (2001).CrossRefGoogle Scholar
Sutton RS, B. A.. Reinforcement Learning: An Introduction (MIT Press, United States, 2018).Google Scholar