Online robust self-learning terminal sliding mode control for balancing control of reaction wheel bicycle robots

Xianjin Zhu; Wenfu Xu; Zhang Chen; Yang Deng; Qingyuan Zheng; Bin Liang; Yu Liu

doi:10.1017/S0263574724001437

Online robust self-learning terminal sliding mode control for balancing control of reaction wheel bicycle robots

Published online by Cambridge University Press: 19 December 2024

Xianjin Zhu

Wenfu Xu ,

Zhang Chen ,

Yang Deng ,

Qingyuan Zheng ,

Bin Liang and

Yu Liu

Show author details

Xianjin Zhu: Affiliation:
School of Mechatronics Engineering, Harbin Institute of Technology, Harbin, China
Wenfu Xu: Affiliation:
School of Mechatronics Engineering and Automation, Harbin Institute of Technology, Shenzhen, China
Zhang Chen: Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Yang Deng: Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Qingyuan Zheng: Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Bin Liang: Affiliation:
Department of Automation, Tsinghua University, Beijing, China
Yu Liu*: Affiliation:
School of Mechatronics Engineering, Harbin Institute of Technology, Harbin, China
*: Corresponding author: Yu Liu; Email: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

This paper proposes an online robust self-learning terminal sliding mode control (RS-TSMC) with stability guarantee for balancing control of reaction wheel bicycle robots (RWBR) under model uncertainties and disturbances, which improves the balancing control performance of RWBR by optimising the constrained output of TSMC. The TSMC is designed for a second-order mathematical model of RWBR. Then robust adaptive dynamic programming based on an actor-critic algorithm is used to optimise the TSMC only by data sampled online. The system closed-loop stability and convergence of the neural network weights are guaranteed based on the Lyapunov analysis. The effectiveness of the proposed algorithm is demonstrated through simulations and experiments.

Keywords

Control of robotic systems adaptive dynamic programming sliding mode control reaction wheel bicycle robots balancing control

Type: Research Article
Information: Robotica , Volume 42 , Issue 10 , October 2024 , pp. 3416 - 3430

DOI: https://doi.org/10.1017/S0263574724001437 [Opens in a new window]
Copyright: © The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Rubio, F., Valero, F. and Llopis-Albert, C., “A review of mobile robots: Concepts, methods, theoretical framework, and applications,” Int J Adv Robot Syst. 16(2), 1–22 (2019).CrossRef Google Scholar

Fadini, G., Kumar, S., Kumar, R., Flayols, T., Del Prete, A., Carpentier, J. and Souères, P., “Co-designing versatile quadruped robots for dynamic and energy-efficient motions,” Robotica 42(6), 2004–2025 (2024).CrossRef Google Scholar

Huang, Y., Liao, Q., Guo, L. and Wei, S., “Simple realization of balanced motions under different speeds for a mechanical regulator-free bicycle robot,” Robotica 33(9), 1958–1972 (2015).CrossRef Google Scholar

Huang, J., Zhang, M., Ri, S., Xiong, C., Li, Z. and Kang, Y., “High-order disturbance-observer-based sliding mode control for mobile wheeled inverted pendulum systems,” IEEE T Ind Electron. 67(3), 2030–2041 (2020).CrossRef Google Scholar

Beznos, A., Formal’sky, A., Gurfinkel, E., Jicharev, D., Lensky, A., Savitsky, K. and Tchesalin, L., “Control of autonomous motion of two-wheel bicycle with gyroscopic stabilisation,” In: IEEE International Conference on Robotics and Automation, Leuven, Belgium, (1998) pp. 2670–2675.Google Scholar

Chen, C.-K., Chu, T.-D. and Zhang, X.-D., “Modeling and control of an active stabilizing assistant system for a bicycle,” Sensors 19(2), 248 (2019).CrossRef Google Scholar PubMed

Keo, L. and Yamakita, M., “Controller design of an autonomous bicycle with both steering and balancer controls,” In: IEEE International Conference on Control Applications/International Symposium on Intelligent Control, St Petersburg, Russia (2009) pp. 1294–1299.Google Scholar

He, K., Deng, Y., Wang, G., Sun, X., Sun, Y. and Chen, Z., “Learning-based trajectory tracking and balance control for bicycle robots with a pendulum: A gaussian process approach,” IEEE-ASME T Mech. 27(2), 634–644 (2022).CrossRef Google Scholar

Kanjanawanishkul, K., “LQR and MPC controller design and comparison for a stationary self-balancing bicycle robot with a reaction wheel,” Kybernetika 51(1), 173–191 (2015).Google Scholar

Wang, S., Cui, L., Lai, J., Yang, S., Chen, X., Zheng, Y., Zhang, Z. and Jiang, Z.-P., “Gain scheduled controller design for balancing an autonomous bicycle,” In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Electr Network (2020-2021) pp. 7595–7600.CrossRef Google Scholar

Kim, H.-W., An, J.-W., Yoo, H.D and Lee, J.-M., “Balancing control of bicycle robot using pid control,” In: 13th International Conference on Control, Automation and Systems (ICCAS), Gwangju, South Korea (2013) pp. 145–147.Google Scholar

Xiong, C., Huang, Z., Gu, W., Pan, Q., Liu, Y., Li, X. and Wang, E. X., “Static balancing of robotic bicycle through nonlinear modeling and control,” In: 3rd International Conference on Robotics and Automation Engineering (ICRAE), Guangzhou, China (2018) pp. 24–28.Google Scholar

Owczarkowski, A., Horla, D. and Zietkiewicz, J., “Introduction of feedback linearization to robust lqr and lqi control - analysis of results from an unmanned bicycle robot with reaction wheel,” Asian J Control 21(2), 1028–1040 (2019).CrossRef Google Scholar

Jeong, S. and Chwa, D., “Sliding-mode-disturbance-observer-based robust tracking control for omnidirectional mobile robots with kinematic and dynamic uncertainties,” IEEE-ASME T Mech 26(2), 741–752 (2021).CrossRef Google Scholar

Tuan, L. A. and Ha, Q. P., “Adaptive fractional-order integral fast terminal sliding mode and fault-tolerant control of dual-arm robots,” Robotica 42(5), 1476–1499 (2024).CrossRef Google Scholar

Song, J., Ho, D. W. C. and Niu, Y., “Model-based event-triggered sliding-mode control for multi-input systems: Performance analysis and optimisation,” IEEE T Cybernetics 52(5), 3902–3913 (2022).CrossRef Google Scholar

Behera, A., Bandyopadhyay, B., Cucuzzella, M., Ferrara, A. and Yu, X., “A survey on event-triggered sliding mode control,” IEEE Journal of Emerging and Selected Topics in Industrial Electronics 2(3), 206–217 (2021).CrossRef Google Scholar

Guo, L., Liao, Q. and Wei, S., “Design of fuzzy sliding-mode controller for bicycle robot nonlinear system,” In: IEEE International Conference on Robotics and Biomimetics (ROBIO 2006), Kunming, China (2006) pp. 176–180.Google Scholar

Alizadeh, M., Ramezani, A. and Saadatinezhad, H., “Fault tolerant control in an unmanned bicycle robot via sliding mode theory,” IET Cyber-syst Robot. 4(2), 139–152 (2022).CrossRef Google Scholar

Chen, L., Yan, B., Wang, H., Shao, K., Kurniawan, E. and Wang, G., “Extreme-learning-machine-based robust integral terminal sliding mode control of bicycle robot,” Control Eng Pract. 121, 105064 (2022).CrossRef Google Scholar

Chen, L., Liu, J., Wang, H., Hu, Y., Zheng, X., Ye, M. and Zhang, J., “Robust control of reaction wheel bicycle robot via adaptive integral terminal sliding mode,” Nonlinear Dynam. 104(3), 2291–2302 (2021).CrossRef Google Scholar

Zhu, X., Deng, Y., Zheng, X., Zheng, Q., Liang, B. and Liu, Y., “Online reinforcement-learning-based adaptive terminal sliding mode control for disturbed bicycle robots on a curved pavement,” Electronics 11(21), 3495 (2022).CrossRef Google Scholar

Zhu, X., Deng, Y., Zheng, X., Zheng, Q., Chen, Z., Liang, B. and Liu, Y., “Online series-parallel reinforcement-learning- based balancing control for reaction wheel bicycle robots on a curved pavement,” IEEE Access 11, 66756–66766 (2023).CrossRef Google Scholar

Huo, B., Yu, L., Liu, Y. and Sha, S., “Reinforcement learning based path tracking control method for unmanned bicycle on complex terrain,” In: IECON. 2023- 49th Annual Conference of the IEEE Industrial Electronics Society, Singapore, Singapore (2023) pp. 1–6.Google Scholar

Guo, L., Lin, H., Jiang, J., Song, Y. and Gan, D., “Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot,” ISA T. 145, 479–492 (2024).CrossRef Google Scholar PubMed

Ma, Q., Zhang, X., Xu, X., Yang, Y. and Wu, E. Q., “Self-learning sliding mode control based on adaptive dynamic programming for nonholonomic mobile robots,” ISA T. 142, 136–147 (2023).CrossRef Google Scholar PubMed

Zhu, Y. and Zhao, D., “Comprehensive comparison of online adp algorithms for continuous-time optimal control,” Artif Intell Rev. 49(4), 531–547 (2018).CrossRef Google Scholar

Vamvoudakis, K. G. and Lewis, F. L., “Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem,” Automatica 46(5), 878–888 (2010).CrossRef Google Scholar

Liu, D., Xue, S., Zhao, B., Luo, B. and Wei, Q., “Adaptive dynamic programming for control: A survey and recent advances,” IEEE T Syst Man Cy-S. 51(1), 142–160 (2021).CrossRef Google Scholar

Bhasin, S., Kamalapurkar, R., Johnson, M., Vamvoudakis, K. G., Lewis, F. L. and Dixon, W. E., “A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems,” Automatica 49(1), 82–92 (2013).CrossRef Google Scholar

Vamvoudakis, K. G., Vrabie, D. and Lewis, F. L., “Online adaptive algorithm for optimal control with integral reinforcement learning,” Int J Robust Nonlin. 24(17), 2686–2710 (2014).CrossRef Google Scholar

Yu, S., Yu, X. and Zhihong, M., “Robust global terminal sliding mode control of SISO nonlinear uncertain systems,” In: Proceedings of the 39th IEEE Conference on Decision and Control (Cat 00CH37187), vol. 3 (2000) pp. 2198–2203.Google Scholar

Spong, M., Corke, P. and Lozano, R., “Nonlinear control of the reaction wheel pendulum,” Automatica 37(11), 1845–1851 (2001).CrossRef Google Scholar

Sutton RS, B. A.. Reinforcement Learning: An Introduction (MIT Press, United States, 2018).Google Scholar

Article contents

Online robust self-learning terminal sliding mode control for balancing control of reaction wheel bicycle robots

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests