Hostname: page-component-78c5997874-xbtfd Total loading time: 0 Render date: 2024-11-09T14:48:29.943Z Has data issue: false hasContentIssue false

Adaptive integral terminal sliding mode control of unmanned bicycle via ELM and barrier function

Published online by Cambridge University Press:  12 September 2024

Long Chen
Affiliation:
School of Electronics and Information, Hangzhou Dianzi University, Hangzhou, China
Zhihui Jin
Affiliation:
School of Electronics and Information, Hangzhou Dianzi University, Hangzhou, China
Ke Shao*
Affiliation:
School of Civil Aviation, Northwestern Polytechnical University, Xi’an, China
Guangyi Wang
Affiliation:
School of Electronics and Information, Hangzhou Dianzi University, Hangzhou, China
Shuping He
Affiliation:
School of Electrical Engineering and Automation, Anhui University, Hefei, China
Vladimir Stojanovic
Affiliation:
Department of Automatic Control, Robotics and Fluid Technique, University of Kragujevac, Kragujevac, Serbia
Parisa Arabzadeh Bahri
Affiliation:
School of Engineering and Energy; Centre of Water, Energy & Waste, Murdoch University, Perth, Australia
Hai Wang*
Affiliation:
School of Engineering and Energy; Centre of Water, Energy & Waste, Murdoch University, Perth, Australia
*
Corresponding authors: Hai Wang; Email: [email protected]; Ke Shao; Email: [email protected]
Corresponding authors: Hai Wang; Email: [email protected]; Ke Shao; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

In this paper, an unmanned bicycle (UB) with a reaction wheel is designed, and a second-order mathematical model with uncertainty is established. In order to achieve excellent balancing performance of the UB system, an adaptive controller is designed, which is composed of nominal feedback control, compensating control using extreme learning machine observer and reaching control via integral terminal sliding mode (ITSM) and barrier function (BF)-based adaptive law. Owing to the features of BF-based ITSM (BFITSM), not only any uncertainty or disturbance upper bound is not needed any longer but also the finite-time convergence of the closed-loop system can be ensured with a predefined error bound. Moreover, the BF-based control gain can be adaptively adjusted according to the update of the lumped uncertainty such that the overestimation is removed. The stability analysis of the closed-loop system is given according to Lyapunov theory. Comparable experimental results on an actual UB are carried out to validate the superior balancing performance of the proposed controller.

Type
Research Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press

1. Introduction

Unmanned bicycle (UB), which is well known for their ability to maintain balance when steering, has been becoming increasingly popular and have been widely used in rescue and other tasks within a narrow space due to its great flexibility and ability in intelligent transportation [Reference Shafiekhani, Mahjoob and Akraminia1]. However, to accomplish particular tasks, the major challenging technical issue of UB is to ensure satisfying self-balancing performance in different driving scenarios, especially at low or even zero speeds.

Accurate dynamic modelling is the basis of realizing excellent self-balance of the UB system. Recently, many researches have been carried out from the perspectives of both mathematical models and control approaches [Reference Shafiekhani, Mahjoob and Akraminia1Reference Nenner, Linker and Gutman5]. For example, in ref. [Reference Chen and Dao6], the dynamics of the UB using Lagrange’s equations for quasi-coordinates are developed, which considers pure rolling without slipping constraints between the ground and the two wheels. In ref. [Reference Astrom, Klein and Lennartsson7], the dynamics of bicycles such as self-stabilizing models and rear wheel steering models are considered and analysed from the perspective of control. In ref. [Reference Defoort and Murakami8], a dynamic model is presented, which considers the geometric-stabilization mechanisms due to bicycle trail. It is pointed out that the geometry is the most important factor that affects the stability of the bicycle. In ref. [Reference Xiong, Wang and Liu9], the dynamics of the Whipple bicycle is analysed, and a complete non-linear model is constructed, from which the equilibrium point of the bicycle in both straight and circular motions is determined. In ref. [Reference Tofigh, Mahjoob, Hanachi and Ayati10], the gyrostabilizer is used as the actuator to keep the UB balance, and the dynamic model of the UB together with the gyrostabilizer is developed using Newton–Euler method. Unfortunately, gyrostabilizers can dramatically increase the bicycle’s mass. Inspired by [Reference Grau, Kapitola, Weiss and Noack11], this work installs an independent reaction wheel (RW) in the UB. RW is usually used in spacecraft attitude control due to the advantages of simple structure, low cost and long duration. When the bicycle tilts, a high-performance motor-driven RW actively provides an anti-roll torque, which counteracts the tilting torque and balances the bicycle. Owing to the RW, the model of the UB is changed from a conventional single-input double-output system to a single-input single–output system, and as a result, the balancing control of the UB is greatly simplified.

On the other hand, the superior control strategy is also crucial for the UB self-balance. Many non-linear control strategies have been developed for non-linear systems, such as model-predictive control [Reference Buizza Avanzini, Zanchettin and Rocco12Reference Nfaileh, Alipour, Tarvirdizadeh and Hadi13], fuzzy control [Reference Dimeas, Sako, Moulianitis and Aspragathos14Reference Li, Zhang, Wu, Chen, Liu and Yao15], robust control [Reference Chocoteco, Morales, Feliu and Sira-Ramírez16], adaptive control [Reference Pierri, Muscio and Caccavale17] and sliding mode control (SMC) [Reference Sariyildiz, Mutlu and Yu18Reference Villanueva, Luque-Vega, González-Jiménez and Arellano-Muro20]. Among these methods, as a classical non-linear control algorithm, SMC has been widely used in a variety of mechatronic systems due to its fast response, high precision and strong robustness against external disturbances. In ref. [Reference Liu, Liu and Wu21], a novel adaptive sliding controller based on a robust disturbance observer is designed for non-linear uncertain robotic systems to deal with the tracking control problem. In ref. [Reference Qiao and Zhang22], an adaptive non-singular integral terminal SMC for AUVs is presented to eliminate the need for pre-known uncertainty bounds, which offers the improved tracking accuracy, rapid convergence and robustness against uncertainties and disturbances. In ref. [Reference Qiao and Zhang23], an adaptive fast non-singular integral terminal SMC, based on desired trajectory, is proposed for AUVs, achieving faster convergence rates in AUV dynamics. In ref. [Reference Lin24], an adaptive non-singular terminal sliding mode tracking controller is designed for robotic systems using fuzzy wavelet networks to improve the control performance. In ref. [Reference Ma, Liu, Xiong, Li, Liu and Sun25], a novel predefined-time barrier function (BF) adaptive SMC strategy for robust control of disturbed systems is designed. This approach ensures that, should an escape event occur at any given time instant, the system trajectory is guaranteed to return to its ultimate bound within a predefined time frame. SMC has also been applied to UB due to its superior performance and strong robustness. In ref. [Reference Chiu and Wu26], a gyroscopic balancer based on fuzzy SMC is used to control the bicycle, wherein the roll angle is controlled by a backstepping algorithm. In ref. [Reference Hwang, Wu and Shih27], fuzzy SMC is designed, wherein the large uncertainty caused by complex ground conditions is well tackled by the inherent under actuated control scheme. Moreover, in the recent years, many artificial intelligence technologies are applied to the control field. In ref. [Reference Yao, Xu, Jiang and Yao28], reinforcement learning is introduced for high-accuracy tracking control of 6-degree-of-freedom (6-DOF) hydraulic robotic manipulators, demonstrating its effectiveness in providing system-level performance guarantees through experimental validation on a 6-DOF platform.

Based on the above discussions, for the purpose of achieving excellent balancing control of the UB, a BF-based adaptive integral terminal sliding mode (BFITSM) control scheme combined with an extreme learning machine (ELM) observer is proposed in this paper. The main contributions of this article are summarized as follows:

  1. 1) An adaptive integral terminal sliding mode controller combining ITSM and BF techniques is designed such that the reaching phase can be efficiently eliminated to improve the convergence performance. Since the BF-based adaptive gain is updated according to the lumped uncertainty, overestimation can be effectively suppressed. Moreover, the size of the region that the roll angle tracking error converges to can be exactly predefined. However, the parameter selection of BF is one of the difficulties in the controller design.

  2. 2) To counteract the effects of uncertain disturbances and reduce the chattering phenomenon, an ELM observer is developed to estimate the lumped uncertainty. The output weights of the proposed ELM are adaptively adjusted to ensure the Lyapunov stability of the closed-loop control. As such, the estimation of the ELM observer is used as compensating control input to further improve the closed-loop control performance. To achieve better training effect, the value of the $\boldsymbol{O}_{i}$ should be appropriately chosen. In this work, the output weight $\boldsymbol{O}_{\boldsymbol{i}}$ is chosen according to the Lyapunov function to keep the control system stable and achieve the excellent estimate.

  3. 3) Combined with the BF adaptive gain and ELM techniques, a finite-time control strategy that without any disturbance upper bound information and has excellent convergence performance is realized and applied to the UB experimental to verify the effectiveness. Experimental results demonstrate the excellent control performance of the proposed controller.

The reminder of this paper is organized as follows. In Section 2, the uncertain dynamic model of the RW-based UB is presented. An adaptive ITSM controller based on ELM observer is designed in Section 3, and the closed-loop stability analysis is rigorously given in detail. In Section 4, experiment studies on an actual UB are carried out by comparison with conventional control schemes. Section 5 finally concludes the paper.

Figure 1. Bicycle structure. (a) Top view, and (b) Rear view.

2. Modeling of reaction wheel unmanned bicycle

Fig. 1(a) and 1(b) shows the top and rear views for the RW-based UB system, respectively. The dynamics of the UB can be described by the following second-order differential equation [Reference Astrom, Klein and Lennartsson7]:

(1) \begin{equation}J\ddot{\theta }=\frac{mfhv}{c}\dot{\delta }+\frac{mhv^{2}}{c}\delta +mgh\sin (\theta)+d-u\end{equation}

For a better illustration, the signification of the notations appeared in Fig. 1 and Eq. (1) are given in Table I.

Table I. Notations description.

Since the roll angle of the bicycle is kept within a relatively small range in the process of motion, $\sin (\theta )$ in (1) can be approximated as $\theta$ [Reference Chen, Yan, Wang, Shao, Kurniawan and Wang29]. For simplification, $\frac{mf}{c}$ and $\frac{mh}{c}$ are denoted as $N$ and $P$ , respectively, and $mgh$ is denoted as $M$ . Then, at a small roll angle $\theta$ , (1) can be rewritten as

(2) \begin{equation}J\ddot{\theta }=Nv\dot{\delta }+Pv^{2}\delta +M\theta +d-u\end{equation}

Since it is impossible to obtain accurate parameter values in a practical UB system, parametric uncertainties have to be considered in the physical model. The system parameters can be normally divided into a nominal part and an uncertain part as follows [Reference Sariyildiz, Mutlu and Yu18]:

(3) \begin{equation}J=J_{0}+\Delta J\end{equation}
(4) \begin{equation}N=N_{0}+\Delta N\end{equation}
(5) \begin{equation}P=P_{0}+\Delta P\end{equation}
(6) \begin{equation}M=M_{0}+\Delta M\end{equation}

where $J_{0}$ , $N_{0}$ , $P_{0}$ and $M_{0}$ are the nominal values determined from preliminary experiments, and $\Delta J$ , $\Delta N$ , $\Delta P$ and $\Delta M$ are the corresponding model parameter errors, respectively. Then, the dynamical Eq. (2) can be rewritten as

(7) \begin{equation}J_{0}\ddot{\theta }=N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +l-u\end{equation}

where $l$ is the lumped uncertainty, which is given by

(8) \begin{equation}l=d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta +\Delta M\theta -\Delta J\ddot{\theta }\end{equation}

According to Appendix A, the closed-loop control signal is chosen to be upper bounded by the following polynomial function in this paper:

(9) \begin{equation}\left| u\right| \lt \zeta _{0}+\zeta _{1}\!\left| \theta \right| +\zeta _{2}|\dot{\theta |}\end{equation}

Then the lumped uncertainty is bounded by [Reference Hu and Wang30], the detailed derivation process is given in Appendix B:

(10) \begin{equation}\left| l\right| \lt \overline{l}\end{equation}

where $\overline{l}$ is the upper bound of $l$ , given by

(11) \begin{equation}\overline{l}=\mu _{0}+\mu _{1}\!\left| \theta \right| +\mu _{2}|\dot{\theta |}\end{equation}

where $\zeta _{0}$ , $\zeta _{1}$ , $\zeta _{2}$ , $\mu _{0}$ , $\mu _{1}$ and $\mu _{2}$ are unknown positive constants.

The control objective is to keep the roll angle of the UB within a small range under the impact of system uncertainty. To balance the UB, an appropriate reaction force $u$ can be produced such that the roll angle $\theta$ can be retained to be zero with high precision, fast response and strong robustness.

Remark 1. In practice, it is quite difficult to obtain the accurate bound information of the lumped uncertainty in (11). To deal with the effect of the lumped uncertainty, a conservatively large constant is usually selected in conventional SMC systems. However, the overly large bound may lead to severe control chattering and large control efforts. Although boundary layer technique can be normally used to alleviate this phenomenon, the control precision will be accordingly degraded. In this paper, in order to achieve better control performance, an ELM observer and BF-based adaptive ITSM controller will be designed such that not only the need for the prior uncertainty information can be eliminated wherein the control gain will be adaptively adjusted but also the control precision can be exactly ensured by tuning parameters.

3. Design of controller

In this section, we develop an ELM-based BFITSM (ELM-BFITSM) controller for the RW-based UB system. First, a feedback control component using ITSM is designed without considering the system uncertainty. Then, the compensating control component based on ELM and the reaching control component using BF-based adaptive law are designed to deal with the lumped uncertainty.

3.1. Design of ELM-BFITSM

In this work, the tracking error $e$ is defined as

(12) \begin{equation}e=\theta -\theta _{d}\end{equation}

where $\theta _{d}=0$ is the target roll angle which equals to zero under absolutely balanced state. Then, according to Eq. (7), the second derivative of error $e$ can be obtained as follows when $\theta _{d}=0$ :

(13) \begin{equation}\ddot{e}=\ddot{\theta }=\frac{1}{J_{0}}\left(N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +l-u\right)\end{equation}

To derive the controller, a novel integral terminal sliding mode variable $s$ is designed as [Reference Chen, Liu, Wang, Hu, Zheng, Ye and Zhang31Reference Ye and Wang32]:

(14) \begin{equation}s=\dot{e}+\alpha _{1}e+\alpha _{2}e_{I}\end{equation}
(15) \begin{equation}e_{I}=\begin{cases} e_{I}(0),\qquad t=0\\[8pt] \int _{0}^{t}e^{\dfrac{q_{1}}{q_{2}}}d\tau\quad t\gt 0 \end{cases}\end{equation}

where $\alpha _{1}\gt 0$ and $\alpha _{2}\gt 0$ are constants, $q_{1}$ and $q_{2}$ are odd integers satisfying $q_{2}\gt q_{1}\gt 0$ to keep $e^{q_{1}/q_{2}}$ a real number and ensure finite time convergence of $e$ [Reference Feng, Yu and Man33]. The initial value $e_{I}(0)$ is set as

(16) \begin{equation}e_{I}(0)=\frac{\dot{e}(0)+\alpha _{1}e(0)}{\alpha _{2}}\end{equation}

where $e(0)$ and $\dot{e}(0)$ are the initial values of the actual error status available for eliminating the reaching time. Ignoring the lumped uncertainty and combining $\dot{s}=0$ with (13), the equivalent control input of the proposed sliding mode controller can be designed as

(17) \begin{equation}u_{0}=N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)\end{equation}

Next, based on the ELM technique, the compensating control input $u_{1}$ of the closed-loop control system is designed as

(18) \begin{equation}u_{1}=\hat{l}\end{equation}

where $\hat{l}$ is the estimate value of the lumped uncertainty via ELM that will be given later.

Furthermore, in order to achieve that the system converges from arbitrary initial state to the preset sliding mode surface quickly, a reaching control input is introduced as follows:

(19) \begin{equation}u_{2}=J_{0}\hat{k}\text{sign}(s)\end{equation}

where $\hat{k}$ represents an adaptive control gain.

Figure 2. Schematic diagram of the proposed ELM-BFITSM controller for UB.

The schematic diagram of the proposed ELM-BFITSM is shown in Fig. 2. In this paper, we define a small positive number $\varepsilon$ by which to define the region of ( $-\varepsilon,\varepsilon$ ) that the sliding variable will converge to [Reference Shao, Zheng, Wang, Wang, Lu and Man34Reference Labbadi, Hashim, Eltoukhy and Djemai35]. If the initial value of the sliding mode variable $s$ is outside the region $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ , namely $\left| s(0)\right| \gt \frac{\varepsilon }{2}$ , it will converge to the region under a monotonically increasing gain. It defines $\overline{t}$ as the time instant when the initial state firstly converges into the above region. After the initial value of $s$ is in this region, namely $\left| s(0)\right| \leq \frac{\varepsilon }{2}$ , a BF-based gain will be activated. Explicitly, the proposed adaptive law is designed as

(20) \begin{equation}\hat{k}=\left\{\begin{array}{l@{\quad}c} \mu \int _{0}^{t}\exp\! (\sigma | s|)\left| s\right| d\tau, & \text{for }0\leq t\leq \overline{t}\\[10pt] f_{b}(s), & \textrm{for}\ t\gt \overline{t} \end{array}\right.\end{equation}

where the parameters $\sigma \geq 0$ , $\mu \gt 0$ , and the initial value of $\hat{k}(0)\gt 0$ . $\overline{t}$ is the time instant that the sliding variable converges to the region of $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ , i.e., $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}$ . $f_{b}(s)$ is the BF which is defined as

(21) \begin{equation}f_{b}(s)=\frac{\left| s\right| }{\varepsilon -\left| s\right| }\end{equation}

Remark 2. When $t\leq \overline{t}$ , under $\hat{k}=\mu \int _{0}^{t}\exp\! (\sigma | s| )| s| d\tau$ , the sliding variable $s$ will quickly converge to the region $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ . When $t\gt \overline{t}$ , $\hat{k}$ is updated by the BF function $f_{b}(s)$ , whose output changes with the variation of $s$ . The reason why BF can reduce the chattering effect compared to constant gain is that: if $| s|$ gradually increases in $[0,\varepsilon )$ , $f_{b}(s)$ will also accordingly increase to a large number to pull $| s|$ back to the predefined region; if $| s|$ reaches the sliding surface, $\hat{k}$ will reduce to a small number. Meanwhile, in this way, it can guarantee that the control gain of the ELM-BFITSM is not overestimated.

Remark 3. In conventional SMC, an overlarge constant $k$ is usually selected as the gain of reaching law to keep the system stable, which requires the upper bound of $l$ at advance and leads to a serious chattering problem. The reason is that when $s$ crosses 0 frequently, the control law changes discontinuously in two values of $\pm k$ . In this paper, because of the introduction of BF, the final $u_{2}$ becomes continuous. Since the sliding variable $s$ is changing continuously, the $f_{b}(s)$ will be continuous either. It can be seen from Fig. 3 that when $s$ crosses 0, the final $u_{2}$ is continuously changed despite the involved signum function. Therefore, the signal chattering in control input can be effectively reduced. Moreover, $l$ is compensated by ELM ( $u_{1}$ ) such that the control gain needs to be greater than the estimation error and does not need to be greater than the upper bound, which will effectively reduce the chattering problem. Moreover, the BF function will further reduce chattering as we discussed above.

Figure 3. The scheme of barrier function.

3.2. Design of ELM observer

The architecture of the ELM observer is shown in Fig. 4. The weight and bias of the input layer of ELM proposed by Huang [Reference Huang, Zhu and Siew36] are randomly generated without reverse adjustment, and the output layer weights are calculated directly through a least squares solution. Conversely, the radial basis function (RBF) necessitates the employment of iterative methodologies to ascertain the optimal values for the centres, widths, and weights of the network and require an iterative optimization process to adjust their parameters, which can be more time consuming. Therefore, compared with the conventional neural network algorithm, ELM has a lower algorithm complexity and a faster learning speed. The training process of ELM is mainly divided into two steps.

Figure 4. ELM observer architecture.

Step 1: Random generation of input weights $\boldsymbol{W}_{\boldsymbol{i}}$ and bias $\boldsymbol{b}_{\boldsymbol{i}}$

The output of the hidden layer is the input multiplied by the corresponding weight plus its deviation, and then it is summed by a non-linear function (activation function) of all the nodes. When the sample $X_{1}$ is input, the corresponding hidden layer output is as follows:

(22) \begin{equation}\boldsymbol{h}_{i}\!\left(X_{1}\right)=\boldsymbol{G}\!\left(W_{i}X_{1}+b_{i}\right) i=1,2,\cdots,L\end{equation}

where $\boldsymbol{h}_{i}(X_{1})$ is the output of the i-th hidden layer node when the input sample $X_{1}$ is input, and G (·) is the activation function. We can obtain the output matrix H (X) of the hidden layer:

(23) \begin{align}\boldsymbol{H} & =\left(\begin{array}{c@{\quad}c@{\quad}c} \boldsymbol{h}_{\textbf{1}}\!\left(X_{1}\right) & \cdots & {\boldsymbol{h}_{\boldsymbol{L}}\!\left(X_{1}\right)}\\ \cdots & \ddots & \cdots\\ \boldsymbol{h}_{\textbf{{1}}}\!\left(X_{U}\right) & \cdots & {\boldsymbol{h}_{\boldsymbol{L}}\!\left(X_{U}\right)}\end{array}\right)\nonumber\\[8pt]& =\left(\begin{array}{c@{\quad}c@{\quad}c} \boldsymbol{G}\!\left(X_{1}W_{1}+b_{1}\right) & \cdots & \boldsymbol{G}\!\left(X_{1}W_{L}+b_{L}\right)\\ \vdots & \ddots & \vdots \\ \boldsymbol{G}\!\left(X_{U}W_{1}+b_{1}\right) & \cdots & \boldsymbol{G}\!\left(X_{U}W_{L}+b_{L}\right) \end{array}\right)\in R^{U\times L}\end{align}

The network output of the feedforward neural network ELM with a single hidden layer is as follows:

(24) \begin{equation}\boldsymbol{HO}=\boldsymbol{T}\end{equation}

where $\boldsymbol{T}=\left[\boldsymbol{t}_{1}^{\textrm{T}},\boldsymbol{t}_{2}^{\textrm{T}},\cdots,\boldsymbol{t}_{U}^{\textrm{T}}\right]^{\textrm{T}}\in R^{U\times m}$ , and $\boldsymbol{O}$ is the output weight, which will be obtained in Step 2.

Step 2: Obtain the output weight $\boldsymbol{O}_{\boldsymbol{i}}$ by minimizing the loss function

The goal of single hidden layer neural network learning is to minimize the output error, which is the difference between the network output $\boldsymbol{T}$ and the objective function $\boldsymbol{Y}$ . In order to achieve better training effect, it needs to choose the value of the $\boldsymbol{O}_{i}$ . That is, the weight $\boldsymbol{O}_{i}$ that connects the hidden layer. The output layer is solved by minimizing the approximate square variance. The objective function is as follows:

(25) \begin{equation}\left\| \boldsymbol{T}^{\boldsymbol{*}}-\boldsymbol{Y}\right\| =\textbf{min } \left\| \boldsymbol{T}-\boldsymbol{Y}\right\|\end{equation}

The above equation can also be written in the following form:

(26) \begin{equation}\left\| \boldsymbol{H}\!\left(\boldsymbol{X},\boldsymbol{W},\boldsymbol{b}\right)\boldsymbol{O}^{\boldsymbol{*}}-\boldsymbol{Y}\right\| = \boldsymbol{\min\limits_{\boldsymbol{O}}} \!\left\| \boldsymbol{H}\!\left(\boldsymbol{X},\boldsymbol{W},\boldsymbol{b}\right)\boldsymbol{O}-\boldsymbol{Y}\right\|\end{equation}

where $\boldsymbol{O}^{\textrm{*}}$ is the optimal solution that requires, and it can be solved by the following equation:

(27) \begin{equation}\boldsymbol{O}^{\boldsymbol{*}}=\boldsymbol{H}^{\dagger }\boldsymbol{Y}\end{equation}

where $\boldsymbol{H}^{\dagger }$ is the generalized inverse of $\boldsymbol{H}$ .

Therefore, the estimated lumped uncertainty $\hat{l}$ in (18) can be obtained via the ELM mechanism modelled as

(28) \begin{equation}\hat{l}=\boldsymbol{H}\!\left(\boldsymbol{x},\boldsymbol{w},\boldsymbol{b}\right)\hat{\boldsymbol{O}}\end{equation}

where $X_{in}(0)=[\theta,\dot{\theta }]^{\textrm{T}}$ is input vector and $\hat{\boldsymbol{O}}$ is an design function. The output matrix of the ELM is also designed as a piecewise function as shown below:

(29) \begin{equation}\dot{\hat{\boldsymbol{O}}}^{T}=\left\{\begin{array}{l@{\quad}c} \rho s\boldsymbol{H}, & \text{for }0\leq t\leq \overline{t}\\[5pt] J_{0}\rho \!\left(\dfrac{s}{J_{0}}+\dfrac{\hat{k}}{J_{0}}\dfrac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\boldsymbol{H}, & \textrm{for}\ t\gt \overline{t} \end{array}\right.\end{equation}

where $\rho$ is a positive constant and $\hat{k}$ is the adaptive gain given in (20). It should be noted that the initial condition of $\dot{\hat{\boldsymbol{O}}}^{T}$ depends on the initial condition $X_{in}(0)=[\theta (0),\dot{\theta }(0)]^{\textrm{T}}$ . In real applications, to keep a balanced state, we often select the initial condition as $\theta (0)=\dot{\theta }(0)=0$ .

The estimate value of lumped uncertainty is approximated by

(30) \begin{equation}l=\boldsymbol{H}\boldsymbol{O}^{\boldsymbol{*}}+\zeta\end{equation}

where $l$ is the ideal output of single hidden-layer feedforward network, $\boldsymbol{O}^{*}$ is the optimal constant output weight vector, and $\zeta$ is the approximation error which is assumed to be bounded by

(31) \begin{equation}\max \!\left\{| {J_{0}}^{-1}\tilde{l}|, | {J_{0}}^{-1}\zeta | \right\}\lt k\end{equation}

where $\tilde{l}=l-\hat{l}$ , and $k$ denotes the bound of the disturbance that is not compensated.

Remark 4. In this article, the inputs of ELM are chosen as $X_{in}=[\theta,\dot{\theta }]^{\textrm{T}}$ . The concealed layer input weights and biases are generated randomly according to the Gaussian distribution, and the output weights is obtained from the generalized inverse matrix. Different from the conventional ELM algorithm proposed for classification problems [Reference Huang, Lu, Tao, Li, Tan and Liu37Reference Yan, Ji, Lu, Huang, Shen and Xue38], the proposed ELM algorithm estimates the lumped uncertainty and takes the estimate value as the compensating control input as shown in (18).

3.3. Stability analysis

Theorem 1: For system (2), if the sliding function is chosen as (14) and the controller is designed as

(32) \begin{equation}u=u_{0}+u_{1}+u_{2}\end{equation}

where $u_{0}$ is the equivalent control input (17), $u_{1}$ is the compensating control input (18) obtained by the ELM observer (28), and $u_{2}$ is the reaching control input (19). The proposed controller effectively avoids the singularity problem. Then, under the proposed controller (32), the sliding variable converges to ( $-\varepsilon,\varepsilon$ ) and the tracking error converges to $\left(-\dfrac{\varepsilon }{\alpha _{1}},\dfrac{\varepsilon }{\alpha _{1}}\right)$ , both in finite time.

Proof : According to (20), the proof will be accomplished by two steps, i.e., $0\leq t\leq \overline{t}$ and $t\gt \overline{t}$ .

Step 1. $\textbf{0}\leq \boldsymbol{t}\leq \overline{\boldsymbol{t}}$ for the control law (20)

Select a Lyapunov function $V_{1}$ as

(33) \begin{equation}V_{1}=L_{1}+\frac{\tilde{\boldsymbol{O}}^{T}\tilde{\boldsymbol{O}}}{2J_{0}\rho }\end{equation}

where $L_{1}=\frac{s^{2}}{2}+\frac{\tau }{2}\tilde{k}^{2}$ with $\tau \gt 1$ , $\tilde{\boldsymbol{O}}$ is the estimation output error of the ELM output layer given by:

(34) \begin{equation}\tilde{\boldsymbol{O}}=\boldsymbol{O}^{*}-\hat{\boldsymbol{O}}\end{equation}

and $\tilde{k}$ is the adaptive error $\tilde{k}=\hat{k}-k$ .

Taking the derivative of $V_{1}$ with respect to time, we obtain:

\begin{align*} \dot{V_{1}} & =s\dot{s}+\tau \tilde{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\\ & =s\!\left(\ddot{e}+\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)+\tau \tilde{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\\ & =s\!\left(\frac{1}{J_{0}}\left(N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +l-u_{0}-u_{1}-u_{2}\right)+\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)+\tau \tilde{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}} \end{align*}

Next, substituting (17), (18) and (19) into the above equation and combining (28) with (30) yield:

(35) \begin{align}\dot{V_{1}} & =\frac{s}{J_{0}}\left(l-\hat{l}-J_{0}\hat{k}\text{sign}(s)\right)+\tau \tilde{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\nonumber\\[4pt] & =\frac{s}{J_{0}}\left(\boldsymbol{H}(\boldsymbol{O}^{*}-\hat{\boldsymbol{O}})+\zeta -J_{0}\hat{k}\text{sign}(s)\right)+\tau \tilde{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\nonumber\\[4pt] & =\frac{s}{J_{0}}\boldsymbol{H}\tilde{\boldsymbol{O}}+\frac{1}{J_{0}\rho }{\dot{\tilde{\boldsymbol{O}}}}{}^{T}\tilde{\boldsymbol{O}}+\frac{s}{J_{0}}(\zeta -J_{0}\hat{k}\text{sign}(s))+\tau \tilde{k}\dot{\hat{k}}\nonumber\\[4pt] & =\left(\frac{s}{J_{0}}\boldsymbol{H}+\frac{1}{J_{0}\rho }{\dot{\tilde{\boldsymbol{O}}}}{}^{T}\right)\tilde{\boldsymbol{O}}+\frac{s}{J_{0}}\left(\zeta -J_{0}\hat{k}\text{sign}(s)\right)+\tau \tilde{k}\dot{\hat{k}}\end{align}

If $\dot{\hat{\boldsymbol{O}}}^{T}$ in (28) is chosen as

(36) \begin{equation}\dot{\hat{\boldsymbol{O}}}^{T}=\rho s\boldsymbol{H}\end{equation}

The disturbance in UB system is slow time varying, so there exists a slow-changing optimal parameter $\boldsymbol{O}^{\textrm{*}}$ , whose derivative is a very small number such that $\dot{\tilde{\boldsymbol{O}}}^{T}=-\dot{\hat{\boldsymbol{O}}}^{T}=-\rho s\boldsymbol{H}$ . Then, considering (20), (35) can be rewritten as

\begin{align*} \dot{V_{1}} & =\frac{1}{J_{0}}\left(\zeta s-J_{0}s\hat{k}\text{sign}(s)\right)+\tau \tilde{k}\dot{\hat{k}}\\[4pt] & =\frac{1}{J_{0}}\big(\zeta s-J_{0}\hat{k}| s|\big)+\tau (\hat{k}-k)\mu \exp\! (\sigma | s|)\left| s\right| +\left(k-k\right)\left| s\right| \\[4pt]& \leq \frac{1}{J_{0}}\left(\left| \zeta \right| \left| s\right| -J_{0}\hat{k}\!\left| s\right| \right)+\tau (\hat{k}-k)\mu \exp\! (\sigma | s|)\left| s\right| +\left(k-k\right)\left| s\right| \\[4pt] & \leq \frac{1}{J_{0}}\left(\left| \zeta \right| \left| s\right| -J_{0}k\!\left| s\right| \right)+\tau (\hat{k}-k)\mu \exp\! (\sigma | s|)\left| s\right| -(\hat{k}-k)\left| s\right| \\[4pt] & =-\frac{1}{J_{0}}\left(J_{0}k\!\left| s\right| -\left| \zeta \right| \left| s\right| \right)+(\hat{k}-k)\left| s\right| \left(\tau \mu \exp\! (\sigma | s|)-1\right) \end{align*}

Till to now, it has been proved that for $0\leq t\leq \overline{t}$ the adaptive gain $\hat{k}=\eta \int _{0}^{t}\exp\! (\alpha | s| )| s| d\tau$ is bounded [Reference Shao, Zheng, Huang, Wang, Man and Fu39]. Then, defining $k$ as the upper bound of $\hat{k}$ that satisfies (31), we get $\hat{k}-k=-| k-\hat{k}|$ , and as a result,

(37) \begin{align}\dot{V_{1}} & \leq -\frac{1}{J_{0}}\left(J_{0}k\!\left| s\right| -\left| \zeta \right| \left| s\right| \right)-\big| k-\hat{k}\big| \left| s\right| \left(\tau \mu \exp\! (\sigma | s|)-1\right)\nonumber\\[5pt]& =-\left(k-\left| {J_{0}}^{-1}\zeta \right| \right)|s|-\left(\tau \mu \exp\! (\sigma | s|)-1\right)\left| s\right| \left| \tilde{k}\right|\end{align}

Recalling (31) and considering that for any $\mu$ and $\sigma$ there exists a positive number $\tau$ such that $\tau \mu \exp\! (\sigma | s| )\gt 1$ , we have

\begin{equation*}\dot{V_{1}}\lt 0\end{equation*}

for any $\left| s\!\left(t\right)\right| \gt \frac{\varepsilon }{2}$ , i.e., $\tilde{\boldsymbol{O}}\rightarrow 0$ which implies the convergence of the estimation. Particularly, for the Lyapunov function $L_{1}$ , by following a similar procedure, we can obtain:

\begin{equation*} \dot{L_{1}}\leq -\left(k-\left| {J_{0}}^{-1}\tilde{l}\right| \right)|s|-\left(\tau \mu \exp\! (\sigma | s|)-1\right)\left| s\right| \left| \tilde{k}\right| \end{equation*}

By defining the following symbols:

\begin{align*} \varrho _{1}& =k-\left| {J_{0}}^{-1}\tilde{l}\right| \nonumber\\ \varrho _{2} & =\left(\tau \mu \exp\! (\sigma | s|)-1\right)|s| \end{align*}

where $\varrho _{1}$ , $\varrho _{2}\gt 0$ , we further obtain:

(38) \begin{align} \dot{L_{1}} & \leq -\varrho _{1}\!\left| s\right| -\varrho _{2}\!\left| \tilde{k}\right| \nonumber\\ & =-\varrho _{1}\sqrt{2}\frac{\left| s\right| }{\sqrt{2}}-\varrho _{2}\sqrt{2}\frac{\left| \tilde{k}\right| }{\sqrt{2}}\nonumber\\ & \leq -\vartheta _{1}\!\left(\frac{\left| s\right| }{\sqrt{2}}+\frac{\left| \tilde{k}\right| }{\sqrt{2}}\right)\nonumber\\ & \leq -\vartheta _{1}{L_{1}}^{\frac{1}{2}} \end{align}

where $\vartheta _{1}=\min \{\sqrt{2}\varrho _{1},\sqrt{2}\varrho _{2}\}.$ Since $\vartheta _{1}\gt 0$ , there must exist a positive constant $T_{1}$ such that:

(39) \begin{equation}T_{1}\leq \vartheta _{1},\forall \left| s\!\left(t\right)\right| \gt \frac{\varepsilon }{2}\end{equation}

From (38), we have

(40) \begin{equation}\dot{L_{1}}\leq -T_{1}{L_{1}}^{\frac{1}{2}}\end{equation}

Therefore, the inequality (40) satisfies the finite time stability criterion in Appendix C, which shows that there exists a finite time $\overline{t}$ for the sliding variable to converge to $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ which is bounded by

(41) \begin{equation}\overline{t}\leq \frac{{L_{1}}^{\frac{1}{2}}(0)-{L_{1}}^{\frac{1}{2}}(\overline{t})}{0.5T_{1}}\end{equation}

Step 2: $\boldsymbol{t}\gt \overline{\boldsymbol{t}}$ for the control law (20).

To prove the stability, the following intermediate variable is introduced:

(42) \begin{equation}\varnothing =\varepsilon \frac{\left| \zeta \right| }{\left| \zeta \right| +J_{0}}\lt \varepsilon\end{equation}

In order to prove that the sliding variable $s$ will be retained in the region of $({-}\varepsilon,\varepsilon )$ thereafter, we will first prove that $| s(t)| \leq \varnothing \lt \varepsilon$ will be ensured in finite time $t_{s}$ from $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ . Considering the case of $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ , a Lyapunov function $V_{2}$ is defined as

(43) \begin{equation}V_{2}=L_{2}+\frac{\tilde{\boldsymbol{O}}^{T}\tilde{\boldsymbol{O}}}{2J_{0}\rho }\end{equation}

where $L_{2}=\frac{s^{2}}{2}+\frac{1}{2}\hat{k}^{2}$ . Taking the derivative of $V_{2}$ with respect to time yields.

\begin{equation*} \dot{V_{2}}=s\dot{s}+\hat{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}} \end{equation*}

According to Eqs. (17)-(20) and following a similar procedure in Step 1), we obtain:

\begin{align*} \dot{V_{2}} & =\frac{s}{J_{0}}\left(l-J_{0}\hat{k}\text{sign}(s)-\hat{l}\right)+\hat{k}\dot{\hat{k}}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\\& =\frac{s}{J_{0}}\left(l-J_{0}\hat{k}\text{sign}(s)-\hat{l}\right)+\hat{k}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\dot{s}+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\\& =\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\left(l-J_{0}\hat{k}\text{sign}(s)-\hat{l}\right)+\frac{1}{J_{0}\rho }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}} \end{align*}

where the fact of $\dot{s}=\frac{1}{J_{0}}\left(l-J_{0}\hat{k}\text{sign}(s)-\hat{l}\right)$ as indicated by (35) has been used. Furthermore, combining (28) with (30), we have

(44) \begin{align}\dot{V_{2}} & =\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\left(\boldsymbol{H}\tilde{\boldsymbol{O}}+\zeta -J_{0}\hat{k}\text{sign}(s)\right)+\frac{1}{J_{0}\mu }\dot{\tilde{\boldsymbol{O}}}^{T}\tilde{\boldsymbol{O}}\nonumber\\[4pt]& =\left(\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\boldsymbol{H}+\frac{1}{J_{0}\rho }{\dot{\tilde{\boldsymbol{O}}}{}^{T}}\right)\tilde{\boldsymbol{O}}+\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\left(\zeta -J_{0}\hat{k}\text{sign}(s)\right)\end{align}

If $\dot{\hat{\boldsymbol{O}}}^{T}$ in (28) is chosen as

(45) \begin{equation}\dot{\hat{\boldsymbol{O}}}^{T}=J_{0}\rho \!\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\boldsymbol{H}\end{equation}

such that:

\begin{equation*}\dot{\tilde{\boldsymbol{O}}}^{T}=-\dot{\hat{\boldsymbol{O}}}^{T}=-J_{0}\rho \!\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\boldsymbol{H}\end{equation*}

Then, (43) can be expressed as

(46) \begin{align}\dot{V_{2}} & =\left(\frac{s}{J_{0}}+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\text{sign}(s)\right)\left(\zeta -J_{0}\hat{k}\text{sign}(s)\right)\nonumber\\[3pt]& =\frac{1}{J_{0}}\big(\zeta s-J_{0}\hat{k}| s|\big)+\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\left(\zeta \text{sign}(s)-J_{0}\hat{k}\right)\nonumber\\[3pt]& \leq -\frac{1}{J_{0}}\left(J_{0}\hat{k}\!\left| s\right| -\left| \zeta \right| \left| s\right| \right)-\frac{\hat{k}}{J_{0}}\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\left(J_{0}\hat{k}-\left| \zeta \right| \right)\nonumber\\[3pt]& =-\left(\hat{k}-\left| {J_{0}}^{-1}\zeta \right| \right)\left| s\right| -\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\left(\hat{k}-\left| {J_{0}}^{-1}\zeta \right| \right)\hat{k}\end{align}

According to (42), one can obtain that $\hat{k}(s)\gt \hat{k}(\varnothing )=| {J_{0}}^{-1}\zeta |$ . Then, we obtain:

\begin{equation*} \dot{V_{2}}\lt 0 \end{equation*}

for any $\varnothing \lt \left| s\!\left(t\right)\right| \leq \frac{\varepsilon }{2}$ which demonstrates that the estimation is converging. Moreover, by defining another intermediate variable $\varnothing^{\prime}=\varepsilon \frac{\left| \tilde{l}\right| }{\left| \tilde{l}\right| +J_{0}}$ and following a similar procedure, it can be obtained that:

\begin{equation*} \dot{L_{2}}\leq -\left(\hat{k}-\left| {J_{0}}^{-1}\tilde{l}\right| \right)\left| s\right| -\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\left(\hat{k}-\left| {J_{0}}^{-1}\tilde{l}\right| \right)\hat{k} \end{equation*}

Define the following symbols:

(47) \begin{equation}\varrho _{1}=\hat{k}-\left| {J_{0}}^{-1}\tilde{l}\right|\end{equation}
(48) \begin{equation}\varrho _{2}=\frac{\varepsilon }{\left(\varepsilon -\left| s\right| \right)^{2}}\left(\hat{k}-\left| {J_{0}}^{-1}\tilde{l}\right| \right)\end{equation}

which are both positive since $\hat{k}(s)\gt \hat{k}(\varnothing^{\prime})=| {J_{0}}^{-1}\tilde{l}|$ . Then, we have

(49) \begin{align}\dot{L_{2}} & \leq -\varrho _{1}\left| s\right| -\varrho _{2}\hat{k}\nonumber\\[3pt] & =-\varrho _{1}\sqrt{2}\frac{\left| s\right| }{\sqrt{2}}-\varrho _{2}\sqrt{2}\frac{1}{\sqrt{2}}\hat{k}\nonumber\\[3pt] & \leq -\vartheta _{2}\!\left(\frac{\left| s\right| }{\sqrt{2}}+\frac{1}{\sqrt{2}}\hat{k}\right) \nonumber\\[3pt] & \leq -\vartheta _{2}{L_{2}}^{\frac{1}{2}} \end{align}

where $\vartheta _{2}=\min \{\sqrt{2}\varrho _{1},\sqrt{2}\varrho _{2}\}$ . Since $\vartheta _{2}\gt 0$ , there must exist a positive constant $T_{2}$ such that:

(50) \begin{equation}T_{2}\leq \vartheta _{2},\forall \varnothing \lt \left| s\!\left(t\right)\right| \leq \frac{\varepsilon }{2}\end{equation}

From (49), we have

(51) \begin{equation}\dot{L_{2}}\leq -T_{2}{L_{2}}^{\frac{1}{2}}\end{equation}

Therefore, the inequality (51) satisfies the finite time stability criterion in Appendix C, which shows that there exists a finite time $\overline{t}$ for the sliding variable to converge to [- $\varnothing, \varnothing$ ] which is bounded by

(52) \begin{equation}t_{s}\leq \frac{{L_{2}}^{\frac{1}{2}}(\overline{t})-{L_{2}}^{\frac{1}{2}}\!\left(\overline{t}+t_{s}\right)}{0.5T_{2}}\end{equation}

According to [Reference Shao, Zheng, Wang, Wang, Lu and Man34Reference Labbadi, Hashim, Eltoukhy and Djemai35], the sliding variable converges to $| s(t)| \leq \varnothing \lt \varepsilon$ in the finite time of $t=\overline{t}+t_{s}$ and will remain in that region ( $-\varepsilon,\varepsilon$ ). Furthermore, according to Appendix D, because of the ITSM (14)-(15), the tracking error $e$ will also be bounded in $\left| e\right| \lt \frac{\varepsilon }{\alpha _{1}}$ .

Here completes the proof.

Remark 5. The convergence of the sliding variable $s$ is divided into two steps: $\overline{t}$ represents the convergence time from the initial state to $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ and $t_{s}$ represents the consumed time to further converge to $| s(t)| \leq \varnothing \lt \varepsilon$ . After the sliding variable converges, namely $| s| \leq \varnothing$ , the gain is finally bounded by $\hat{k}(s)\leq f_{b}( \varnothing )=| {J_{0}}^{-1}\zeta |$ as implied by (21). It can be seen from (42) that $ \varnothing $ accordingly increases (decreases) as the disturbance increases (decreases), and as a result, the tracking error can be limited as small as possible.

Remark 6. From (31), it can be concluded that when $s$ changes from 0 to $s=s_{2}\gt 0$ , $\hat{\boldsymbol{O}}$ in (30) needs to increase, so that $\zeta$ approaches 0. At the same time, $\hat{k}$ in (46) will also increase because of (20), therefore $\hat{\boldsymbol{O}}$ will increase. As a result, the disturbance compensation for the ELM observer can be more accurate.

Table II. Parameters of controllers.

3.4. Control parameter selection

The unmodelled dynamics, external disturbance and measurement noise in a practical system require that appropriate control parameters should be synthetically selected to balance the control speed, precision and signal smoothness. The parameters of ELM-BFITSM are shown in Table II.

  1. 1) Selections of $\alpha _{1}$ , $\alpha _{2}$ , $q_{1}$ , $q_{2} \,:\, \alpha _{1}$ affect the convergence rate of tracking error: the larger the value of $\alpha _{1}$ is, the faster the error will converge. However, an excessive $\alpha _{1}$ will increase the control amplitude as shown in (17). The integral term of the sliding variable is used to suppress the steady-state error, and the integral coefficient $\alpha _{2}$ is usually a small constant. $q_{1}/q_{2}$ will also affect the convergence rate of the error. Usually, $q_{1}$ and $q_{2}$ are odd integers, and $q_{1}/q_{2}$ is less than 1. In our experiment, we chose $\alpha _{1}$ = 2.4, $\alpha _{2}$ = 1, $q_{1}$ = 3, and $q_{2}$ = 5.

  2. 2) Selections of $\varepsilon$ , $\sigma$ , $\mu$ : The parameter $\varepsilon$ defines the convergence region of the tracking error. A smaller $\varepsilon$ implies a better control precision but too small $\varepsilon$ may cause control input saturation and lead to chattering problem. In the adaptive law (20), appropriate $\mu$ and $\sigma$ should be selected in order to make $s$ reach the region faster with an acceptable chattering. $\varepsilon$ = 0.05, $\mu$ = 1000, and $\sigma$ = 2.5 are selected.

  3. 3) Selections of $L$ , $\rho$ : The number of neural nodes $L$ in the hidden layer affects the estimation accuracy: the more the number of neural nodes in the hidden layer is, the more accurate the tracking result will be. However, considering that the calculation capacity of the STM32 microcontroller adopted in our bicycle is limited, the number of $L$ cannot be selected to be too large. The learning rate $\rho$ affects the estimation rate of uncertainty as shown in (36) and (45). A larger $\rho$ value can get a faster estimation result. However, too large $\rho$ value may also cause severe chattering. Finally, we chose $L$ = 5 and $\rho$ = 0.1.

4. Experimental verification

In this section, to demonstrate the superiority of the proposed controller, experiments compared with ITSM, AITSM and BFITSM controllers are carried out on a real UB system.

4.1. Experiment configurations

The UB system used in the experiment is shown in Fig. 5, in which Fig. 5(a) and 5(b) are its side and top views, respectively. The UB system mainly consists of a body frame, two wheels, and a RW. The RW is installed at the centre of the bicycle body, and its rotational direction is the same as the bicycle’s longitudinal direction. A servo motor is adopted to drive the RW with an equipped STM32 microcontroller. The UB is powered by an 11.1-volt lithium battery mounted at the rear bicycle. In addition, an encoder is used for measuring the speed of the RW, and an MPU-6050 motion sensor module is used to obtain the yaw angle of the bicycle.

Figure 5. Experimental platform of UB. (a) Side view and (b) top view.

The ITSM and AITSM controllers and the BFITSM controller in ref. [Reference Labbadi, Hashim, Eltoukhy and Djemai35] are used for comparison, presented as follows:

(53) \begin{equation}u_{ITSM}=N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)+k_{1}\text{sign}(s)\end{equation}
(54) \begin{equation}u_{\boldsymbol{AITSM}}=N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)+k_{2}\!\left| s\right| ^{\sigma }\text{sign}(s)\end{equation}
(55) \begin{equation}u_{\boldsymbol{BFITSM}}=N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{e}+\alpha _{2}\dot{e_{I}}\right)+J_{0}\hat{k}\text{sign}(s)\end{equation}

where the control parameters of above controller are chosen as the same as the proposed controller as given in Section 3.

4.2. Performance of stationary state (Case 1)

Ideally, we hope that the roll angle of the UB maintains at 0° under control. However, in practice, due to the inertial action of the UB and the dead-zone characteristics of the motor, the roll angle of the UB cannot absolutely be maintained at 0°. Therefore, the control goal is to reduce the roll angle as much as possible. In this case, the UB system is keeping static. The experimental results are shown in Fig. 6.

Figure 6. Control performance (Case 1). (a) Roll angle, (b) control input and (c) ELM.

Fig. 6(a) shows the roll angle of the UB under the action of four controllers. It can be seen that although the UB is kept within an acceptable range, the roll angle under the ITSM controller is the largest, which reaches 0.0084 rad. The proposed ELM-BFITSM controller can restrain the roll angle at the smallest amplitude, and its maximum roll angle is only 0.0023 rad, which is smaller than AITSM control (0.0062 rad) and BFITSM control (0.0049 rad). Fig. 6 (b) shows the control inputs of the four controllers. It can be seen that the chattering range of ELM-BFITSM control is 0.0033 Nm, which is significantly smaller than ITSM control (0.0116 Nm), AITSM control (0.0061 Nm) and BFITSM control (0.0055 Nm). The reason is that the ELM observer and BF-based adaptive algorithm can well compensate for effect of the lumped uncertainty on the closed-loop system.

Fig. 6(c) shows the lumped uncertainty estimation of the UB system. It can be seen that the estimated disturbance varies during the experiment. As shown in Fig. 6(a), the tracking error under ELM-BFITSM is ensured in the predefined bound of ± $\frac{0.05}{2.4}$ = ±0.021 rad, which implies the proposed ELM-BFITSM can exactly predefine the region of the roll angle tracking error.

Figure 7. Control performance (Case 2). (a) Roll angle, (b) control input and (c) ELM estimated value.

4.3. Performance of lateral force rejection (Case 2)

To verify the robustness of the ELM-BFITSM against external disturbances, one-tenth of the motor power is used to generate a lateral force. The lateral force is suddenly applied to the UB when it is at a stationary balanced state. The experimental results are shown in Fig. 7. It can be observed that the ITSM control has the largest maximum roll angle reaching 0.057 rad. In addition, the BFITSM control achieves a smaller maximum roll angle (0.042 rad) than AITSM control (0.047 rad). Compared with the above three controllers, the performance of ELM-BFITSM controller is significantly improved, whose maximum roll angle is 0.034 rad. The control signals are shown in Fig. 7(b), while the disturbance estimation in the proposed control by ELM observer is depicted in Fig. 7(c). The proposed ELM observer does not require any training process and can adaptively adjusted the output weights such that the estimated results can be changed in real time with the disturbance. It can be seen that due to the effective compensation of the disturbance by using ELM observer, not only the smallest value of the control amplitude for the proposed control is achieved but also the best control performance of the proposed control is obtained as clearly shown in Fig. 7(a).

Figure 8. Control performance (Case 3). (a) Roll angle, (b) control input and (c) ELM estimated value.

4.4. Performance of S-curve driving (Case 3)

When the bicycle steers, the parametric uncertainty is activated due to the front wheel’s motion. In order to further verify the strong robustness of the ELM-BFITSM controller, the UB system is required to follow an S-curve path in this case. The tracking performance of the controllers is shown in Fig. 8. It can be seen from Fig. 8(a) that ITSM has the worst control performance with a maximum roll angle reaching 0.11 rad. Due to the introduction of BF, the maximum roll angle of BFITSM control reduces to 0.064 rad, which is smaller than AITSM (0.079 rad). Moreover, the tracking error of ELM-BFITSM is 0.053 rad, which indicates that the ELM-BFITSM achieves the best control performance under the changing disturbance condition. As seen from Fig. 8(b), the control signal of ELM-BFITSM is smoother than other three controllers, which does not have severe chattering problem. The estimate value of the uncertainty for the ELM-BFITSM control is shown in Fig. 8(c).

It can be seen from the above experimental results that, compared with ITSM, AITSM and BFITSM controllers, the proposed ELM-BFITSM control exhibits the best tracking performance. The reasons are given as follows: (i) Compared with the conventional ITSM control, better control performance is achieved due to the introduction of BF algorithm. (ii) Compared with adaptive ITSM and BFITSM control schemes, due to the fact that the ELM observer can compensate for the impacts of lumped uncertainty to effectively improve the control precision, the proposed control obtains the most superior control performance and robustness.

4.5. Performance comparisons

For performance analysis, root mean square error (RMSE) and maximum error (MAXE) values of the sampled tracking error $e(i)$ are taken into consideration for comparisons, which are defined as

(56) \begin{equation}\text{MAXE}\!\left(e\right)=\max \!\left(\left| e(i)\right| \right)\end{equation}
(57) \begin{equation}\text{RMSE}\!\left(e\right)=\sqrt{\sum\nolimits _{i=1}^{G}\frac{e^{2}\left(i\right)}{G}}\end{equation}

where $G$ is the number of the sampled tracking error. Table III and Figs. 9(a) and 9(b) show the performance comparisons of four controllers in three cases. It can be seen that the proposed controller achieves the smallest RMSE and MAXE values, while the ITSM controller behaves with the worst performance, followed by the AITSM and BFITSM controllers. It can be concluded that compared with the ITSM and adaptive ITSM controllers, the proposed ELM-BFITSM controller can achieve excellent control performance of the UB system in practical applications.

Figure 9. Control performance improvement of ELM-BFITSM.

Table III. Control performance comparisons.

5. Conclusion

In this paper, an ELM-BFITSM controller has been developed for the balancing control of an UB system. It has been shown that the proposed control strategy comprises a BF-based adaptive ITSM for eliminating the reaching phase and an ELM observer for further compensating the lumped uncertainty. As a result, the disturbance can be estimated and be fed forward by the inherent ELM observer. Moreover, due to the BF-based adaptive law, the control gain is automatically updated according the disturbance variations, and the overestimation is effectively suppressed. In addition, the proposed controller ensures that the roll angle of the UB converges to an exactly predefined region in finite time. The experimental results have verified the excellent control performance of the proposed controller compared to the conventional ITSM, conventional adaptive ITSM and BF-based ITSM controllers. Compared to ITSM, AITSM and BFITSM controllers, the RMSE of ELM-BFITSM control is improved by 72.6%, 57.4%, 52.2% in Case1; 31.4%, 20.1%, 7.5% in Case2 and 55.8%, 52.9%, 35% in Case3, respectively. In addition, the MAXE of the ELM-BFITSM control is improved by 72.6%, 62.9%, 53.1% in Case1; 40.4%, 27.7%, 19% in Case2 and 51.8%, 32.9%, 17.2% in Case3, respectively.

In practical applications, considering that some failures may occur such as actuator and sensor failures, our future research work will focus on sliding mode-based fault tolerant control of UB system.

Author contributions

Long Chen and Zhihui Jin conceived and designed the study. Guangyi Wang and Shuping He designed the hardware. Vladimir Stojanovic and Parisa Arabzadeh Bahri data gathering, simulation, and analyses. Zhihui Jin wrote this article. Ke Shao and Hai Wang provided guidance and supervision.

Financial support

This paper is partially supported by National Natural Science Foundation of China [Grant No. 61771178].

Competing interests

The authors declare no competing interest exist.

Ethical approval

None.

Appendix A

In this paper, the control law is designed as

(58) \begin{equation}u=u_{0}+u_{1}+u_{2}\end{equation}

Then, we have

(59) \begin{align} \left| u\right| & \lt \left| u_{0}\right| +\left| u_{1}\right| +\left| u_{2}\right| \nonumber\\ & \lt \left| N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{e}+\alpha _{2}\dot{e}_{I}\right)\right| +\left| \hat{l}\right| +\left| J_{0}\hat{k}sign(s)\right| \nonumber\\ & \lt \left| N_{0}v\dot{\delta }+P_{0}v^{2}\delta +M_{0}\theta +J_{0}\!\left(\alpha _{1}\dot{\theta }+\alpha _{2}e^{\frac{q1}{q2}}\right)\right| +\left| \hat{l}\right| +\left| J_{0}\hat{k}\right| \nonumber\\ & \lt \left| N_{0}v\dot{\delta }+P_{0}v^{2}\delta \right| +\left| \hat{l}\right| +\left| J_{0}\hat{k}\right| +\left| M_{0}\theta +J_{0}\alpha _{2}\theta ^{q1/q2}\right| +\left| J_{0}\alpha _{1}\dot{\theta }\right|\nonumber\\& \lt \left| N_{0}v\dot{\delta }+P_{0}v^{2}\delta \right| +\left| \hat{l}\right| +\left| J_{0}\hat{k}\right| +{max } (M_{0},J_{0}\alpha _{2})\left| \theta \right| +J_{0}\alpha _{1}\!\left| \dot{\theta }\right|\end{align}

where $v$ is the forward speed of the UB, $\delta$ is the steering angle of the front wheel relative to the positive direction of UB, which are set as the small value and change continuously in the experiment. $\hat{l}$ is the estimated result of ELM observer, which is bounded. $\hat{k}$ is the proposed adaptive law. If $\hat{k}=\mu \int _{0}^{t}\exp\! (\sigma | s| )| s| d\tau$ , the $\hat{k}$ is bounded because of the finite-time accessibility. If $\hat{k}=f_{b}(s)$ , the $\hat{k}$ is bounded because the $\hat{k}$ will increase quickly to pull $| s|$ back when $| s|$ gradually increases, so that the $\hat{k}$ is a large but bounded value.

Therefore, when $\zeta _{0}\gt | N_{0}v\dot{\delta }+P_{0}v^{2}\delta | +| \hat{l}| +| J_{0}\hat{k}|$ , $\zeta _{1}={max } (M_{0},J_{0}\alpha _{2})$ and $\zeta _{2}=J_{0}\alpha _{1}$ , the following inequality holds:

(60) \begin{equation}\left| u\right| \lt \zeta _{0}+\zeta _{1}\left| \theta \right| +\zeta _{2}|\dot{\theta |}\end{equation}

Appendix B

Based on (7), the $\ddot{\theta }$ is expressed as

(61) \begin{equation}\ddot{\theta }=\frac{N_{0}v\dot{\delta }}{J_{0}}+\frac{P_{0}v^{2}\delta }{J_{0}}+\frac{M_{0}\theta }{J_{0}}+\frac{1}{J_{0}}\left(l-u\right)\end{equation}

The lumped uncertainty $l$ is given by

(62) \begin{equation}l=d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta +\Delta M\theta -\Delta J\ddot{\theta }\end{equation}

Then, we obtain:

(63) \begin{equation}l=d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta +\Delta M\theta -\Delta J\!\left(\frac{N_{0}v\dot{\delta }}{J_{0}}+\frac{P_{0}v^{2}\delta }{J_{0}}+\frac{M_{0}\theta }{J_{0}}+\frac{1}{J_{0}}\left(l-u\right)\right)\end{equation}

Rearranging the terms, we get:

(64) \begin{align}l & =\frac{\Delta J}{J_{0}\left(1+\frac{\Delta J}{J_{0}}\right)}u+\frac{d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta }{\left(1+\frac{\Delta J}{J_{0}}\right)}+\frac{\Delta M\theta }{\left(1+\frac{\Delta J}{J_{0}}\right)}-\frac{\Delta JM_{0}\theta }{J_{0}\left(1+\frac{\Delta J}{J_{0}}\right)}-\frac{\Delta JN_{0}v\dot{\delta }+\Delta JP_{0}v^{2}\delta }{J_{0}\left(1+\frac{\Delta J}{J_{0}}\right)}\nonumber\\[4pt]& \left. =\frac{\Delta J}{J_{0}+\Delta J}u+\frac{\left(J_{0}-\Delta J\right)\Delta M\theta }{J_{0}+\Delta J}+\frac{J_{0}(d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta )}{J_{0}+\Delta J}-\frac{\Delta JN_{0}v\dot{\delta }+\Delta JP_{0}v^{2}\delta }{J_{0}+\Delta J}\right)\end{align}

Therefore, we have

(65) \begin{equation}\left| l\right| \leq \left| \frac{\Delta J}{J_{0}+\Delta J}\right| \left| u\right| +\left| \frac{\left(J_{0}-\Delta J\right)\Delta M}{J_{0}+\Delta J}\right| \left| \theta \right| +\left| \frac{J_{0}\!\left(\left| d+\Delta Nv\dot{\delta }+\Delta Pv^{2}\delta \right| \right)}{J_{0}+\Delta J}\right| +\left| \frac{\Delta J(N_{0}v\dot{\delta }+P_{0}v^{2}\delta )}{J_{0}+\Delta J}\right|\end{equation}

In this work, the disturbance $d$ and the uncertain items $\Delta N$ , $\Delta P$ , $\Delta M$ and $\Delta J$ are considered to be bounded. In addition, the steering angle $\delta$ is continuously changed such that its derivative is bounded as well [Reference Chen, Yan, Wang, Shao, Kurniawan and Wang29]. Therefore, the following inequalities for the bounded property are given as

(66) \begin{equation}\left| d\right| \lt D, \left| \Delta Nv\dot{\delta }\right| \lt \partial _{1}, \left| \Delta Pv^{2}\delta \right| \lt \partial _{2}\end{equation}
(67) \begin{equation}D+\partial _{1}+\partial _{2}\lt \partial _{3}\end{equation}
(68) \begin{equation}\left| \frac{J_{0}\!\left(\left| d\right| +\left| \Delta Nv\dot{\delta }\right| +\left| \Delta Pv^{2}\delta \right| \right)}{J_{0}+\Delta J}\right| +\left| \frac{\Delta J(N_{0}v\dot{\delta }+P_{0}v^{2}\delta )}{J_{0}+\Delta J}\right| \lt \left| \frac{J_{0}\partial _{3}}{J_{0}+\Delta J}\right| +\left| \frac{\Delta J(N_{0}v\dot{\delta }+P_{0}v^{2}\delta )}{J_{0}+\Delta J}\right| \lt \beta _{0}\end{equation}
(69) \begin{equation}\left| \frac{\Delta J}{J_{0}+\Delta J}\right| \lt \beta _{1}\end{equation}
(70) \begin{equation}\left| \frac{\left(J_{0}-\Delta J\right)\Delta M}{J_{0}+\Delta J}\right| \lt \beta _{2}\end{equation}

Then, we yield:

\begin{equation*} \left| l\right| \leq \beta _{0}+\beta _{1} \left| u\right| +\beta _{2}\left| \theta \right| \end{equation*}

As can be seen that the upper bound of the lumped uncertainty $l$ is associated with the structure of the designed controller. From (9), we have

(71) \begin{equation}\left| l\right| \leq \beta _{0}+\beta _{1}\left(\zeta _{0}+\zeta _{1}\left| \theta \right| +\zeta _{2}|\dot{\theta |}\right)+\beta _{2}\left| \theta \right| =\beta _{0}+{\beta _{1}}\zeta _{0}+\beta _{1}\zeta _{1}\left| \theta \right| +\beta _{2}\left| \theta \right| +\beta _{1}\zeta _{2}|\dot{\theta |}\end{equation}

Thus, we obtain the expression:

(72) \begin{equation}\left| l\right| \lt \overline{l}=\mu _{0}+\mu _{1}\left| \theta \right| +\mu _{2}|\dot{\theta |}\end{equation}

where:

(73) \begin{equation}\mu _{0}=\beta _{0}+{\beta _{1}}\zeta _{0}\end{equation}
(74) \begin{equation}\mu _{1}=\beta _{1}\zeta _{1}+\beta _{2}\end{equation}
(75) \begin{equation}\mu _{2}=\beta _{1}\zeta _{2}\end{equation}

This completes the proof.

Appendix C

Suppose a first-order non-linear differential inequality given by

(76) \begin{equation}\dot{V}+\kappa V^{\eta }\leq 0\end{equation}

where $\kappa \gt 0$ , $0\lt \eta \lt 1$ . $V(x)$ is a positive Lyapunov function with respect to the state $x\in R$ . Then, the function $V(x)$ will converge from any given initial condition $V(x(0))=V(0)$ to the origin in the finite time given by

(77) \begin{equation}t_{r}\leq \frac{\textrm{V}^{1-\eta }(0)}{\kappa (1-\eta )}\end{equation}

The details can refer to ref. [Reference Moulay and Perruquetti40] and references therein.

Appendix D

The sliding function (14)-(15) can be rewritten as

(78) \begin{equation}\dot{e}+\left(\alpha _{1}-\frac{s}{e}\right)e+\alpha _{2}\int _{0}^{t}e^{q_{1}/q_{2}}=0\end{equation}

Since $| s| \leq \varepsilon$ , if $\left| e\right| \gt \frac{\varepsilon }{\alpha _{1}}$ , we have $\alpha _{1}-\frac{\varepsilon }{e}\,:\!=\, \alpha _{1}^{\prime}\gt 0$ . It is clear that $e$ is decreasing to zero in finite time according to (78) [Reference Sariyildiz, Mutlu and Yu18]. In other words, $e$ will be bounded by

(79) \begin{equation}\left| e\right| \leq \frac{\varepsilon }{\alpha _{1}}\end{equation}

in finite time.

References

Shafiekhani, A., Mahjoob, M. J. and Akraminia, M., “Design and implementation of an adaptive critic-based neuro-fuzzy controller on an unmanned bicycle,” Mechatronics 28, 115123 (2015).CrossRefGoogle Scholar
Keo, L., Pornsarayouth, S., Yamakita, M. and Ito, K., “Stabilization of an unmanned bicycle with flywheel balancer,” IFAC Proceed Vol 43(14), 475480 (2010).CrossRefGoogle Scholar
Ahn, B., Kim, H., Lee, S. and Ham, W., “Control of autonomous motion of unmanned electric bicycle,” IFAC Proceed Vol 2, 871876 (2003).Google Scholar
Park, I., Moon, J. and Ham, W., “Control algorithm for stabilization of tilt angle of unmanned electric bicycle,” IFAC Proceed Vol 34(17), 165170 (2001).CrossRefGoogle Scholar
Nenner, U., Linker, R. and Gutman, P.-O., “Robust feedback stabilization of an unmanned motorcycle,” Control Eng Pract 18(8), 970978 (2010).CrossRefGoogle Scholar
Chen, C.-K. and Dao, T.-S., “Fuzzy control for equilibrium and roll-angle tracking of an unmanned bicycle,” Multibody Syst Dyn 15(4), 321346 (2006).CrossRefGoogle Scholar
Astrom, K. J., Klein, R. E. and Lennartsson, A., “Bicycle dynamics and control: Adapted bicycles for education and research,” IEEE Control Syst Mag 25(4), 2647 (2005).Google Scholar
Defoort, M. and Murakami, T., “Sliding-mode control scheme for an intelligent bicycle,” IEEE Trans Ind Electron 56(9), 33573368 (2009).CrossRefGoogle Scholar
Xiong, J., Wang, N. and Liu, C., “Stability analysis for the Whipple bicycle dynamics,” Multibody Syst Dyn 48, 311335 (2020).CrossRefGoogle Scholar
Tofigh, M. A., Mahjoob, M. J., Hanachi, M. R. and Ayati, M., “Fractional sliding mode control for an autonomous two-wheeled vehicle equipped with an innovative gyroscopic actuator,” Robot Auton Syst 140, 103756 (2021).CrossRefGoogle Scholar
Grau, S., Kapitola, S., Weiss, S. and Noack, D., “Control of an over-actuated spacecraft using a combination of a fluid actuator and reaction wheels,” Acta Astronaut 178, 870880 (2021).CrossRefGoogle Scholar
Buizza Avanzini, G., Zanchettin, A. M. and Rocco, P., “Constrained model predictive control for mobile robotic manipulators,” Robotica 36(1), 1938 (2018).CrossRefGoogle Scholar
Nfaileh, N., Alipour, K., Tarvirdizadeh, B. and Hadi, A., “Formation control of multiple wheeled mobile robots based on model predictive control,” Robotica 40(9), 31783213 (2022).CrossRefGoogle Scholar
Dimeas, F., Sako, D. V., Moulianitis, V. C. and Aspragathos, N. A., “Design and fuzzy control of a robotic gripper for efficient strawberry harvesting,” Robotica 33(5), 10851098 (2015).CrossRefGoogle Scholar
Li, F., Zhang, Z., Wu, Y., Chen, Y., Liu, K. and Yao, J., “Improved fuzzy sliding mode control in flexible manipulator actuated by PMAs,” Robotica 40(8), 26832696 (2022).CrossRefGoogle Scholar
Chocoteco, J. A., Morales, R., Feliu, V. and Sira-Ramírez, H., “Robust output feedback control for the trajectory tracking of robotic wheelchairs,” Robotica 33(1), 4159 (2015).CrossRefGoogle Scholar
Pierri, F., Muscio, G. and Caccavale, F., “An adaptive hierarchical control for aerial manipulators,” Robotica 36(10), 15271550 (2018).CrossRefGoogle Scholar
Sariyildiz, E., Mutlu, R. and Yu, H., “A sliding mode force and position controller synthesis for series elastic actuators,” Robotica 38(1), 1528 (2020).CrossRefGoogle Scholar
Islam, M. R., Rahmani, M. and Rahman, M. H., “A novel exoskeleton with fractional sliding mode control for upper limb rehabilitation,” Robotica 38(11), 20992120 (2020).CrossRefGoogle Scholar
Villanueva, A., Luque-Vega, L. F., González-Jiménez, L. E. and Arellano-Muro, C. A., “Robust multimode flight framework based on sliding mode control for a rotary UAV,” Robotica 39(4), 699717 (2021).CrossRefGoogle Scholar
Liu, S., Liu, G. and Wu, S., “Adaptive sliding tracking control for nonlinear uncertain robotic systems with unknown actuator nonlinearities,” Robotica 40(8), 25272546 (2022).CrossRefGoogle Scholar
Qiao, L. and Zhang, W., “Adaptive non-singular integral terminal sliding mode tracking control for autonomous underwater vehicles,” IET Control Theory Appl 11(8), 12931306 (2017).CrossRefGoogle Scholar
Qiao, L. and Zhang, W., “Trajectory tracking control of AUVs via adaptive fast nonsingular integral terminal sliding mode control,” IEEE Trans Industr Inform 16(2), 12481258 (2020).CrossRefGoogle Scholar
Lin, C.-K., “Nonsingular terminal sliding mode control of robot manipulators using fuzzy wavelet networks,” IEEE Trans Fuzzy Syst 14(6), 849859 (2006).CrossRefGoogle Scholar
Ma, H., Liu, W., Xiong, Z., Li, Y., Liu, Z. and Sun, Y., “Predefined-time barrier function adaptive sliding-mode control and its application to piezoelectric actuators,” IEEE Trans Industr Inform 18(12), 86828691 (2022).CrossRefGoogle Scholar
Chiu, C. H. and Wu, C. Y., “Bicycle robot balance control based on a robust intelligent controller,” IEEE Access 8, 8483784849 (2020).CrossRefGoogle Scholar
Hwang, C-L., Wu, H-M. and Shih, C-L., “Fuzzy sliding-mode underactuated control for autonomous dynamic balance of an electrical bicycle,” IEEE Trans Control Syst Technol 17(3), 658670 (2009).CrossRefGoogle Scholar
Yao, Z., Xu, F., Jiang, G.-P. and Yao, J., “Data-driven control of hydraulic manipulators by reinforcement learning,” IEEE/ASME Trans Mechatro (2023).Google Scholar
Chen, L., Yan, B., Wang, H., Shao, K., Kurniawan, E. and Wang, G., “Extreme-learning-machine-based robust integral terminal sliding mode control of bicycle robot,” Control Eng Pract 121, 105064 (2022).CrossRefGoogle Scholar
Hu, Y. H. and Wang, H., “Robust tracking control for vehicle electronic throttle using adaptive dynamic sliding mode and extended state observer,” Mech Syst Signal Process 135, 106375 (2020).CrossRefGoogle Scholar
Chen, L., Liu, J., Wang, H., Hu, Y., Zheng, X., Ye, M. and Zhang, J., “Robust control of reaction wheel bicycle robot via adaptive integral terminal sliding mode,” Nonlinear Dyn 104(3), 22912302 (2021).CrossRefGoogle Scholar
Ye, M. and Wang, H., “Robust adaptive integral terminal sliding mode control for steer-by-wire systems based on extreme learning machine,” Comput Electr Eng 86, 106756 (2020).CrossRefGoogle Scholar
Feng, Y., Yu, X. and Man, Z., “Non-singular terminal sliding mode control of rigid manipulators,” Automatica 38(12), 21592167 (2002).CrossRefGoogle Scholar
Shao, K., Zheng, J., Wang, H., Wang, X., Lu, R. and Man, Z. H., “Tracking control of a linear motor positioner based on barrier function adaptive sliding mode,” IEEE Trans Industr Inform 17(11), 74797488 (2021).CrossRefGoogle Scholar
Labbadi, M., Hashim, H. A., Eltoukhy, A. E. E. and Djemai, M., “Barrier Function-Based Adaptive Nonsingular Fast Terminal Sliding Mode Control for Disturbed UAVs,” In: 2022 European Control Conference (ECC). IEEE, (2022) pp. 975980.Google Scholar
Huang, G.-B., Zhu, Q.-Y. and Siew, C.-K., “Extreme learning machine: Theory and applications,” Neurocomputing 70(1-3), 489501 (2006).CrossRefGoogle Scholar
Huang, F., Lu, J., Tao, J., Li, L., Tan, X. and Liu, P., “Research on optimization methods of elm classification algorithm for hyperspectral remote sensing images,” IEEE Access 7, 108070108089 (2019).CrossRefGoogle Scholar
Yan, K., Ji, Z., Lu, H., Huang, J., Shen, W. and Xue, Y., “Fast and accurate classification of time series data using extended ELM: Application in fault diagnosis of air handling units,” IEEE Trans Syst Man Cybern: Syst 49(7), 13491356 (2019).CrossRefGoogle Scholar
Shao, K., Zheng, J., Huang, K., Wang, H., Man, Z. and Fu, M., “Finite-time control of a linear motor positioner using adaptive recursive terminal sliding mode,” IEEE Trans Ind Electron 67(8), 66596668 (2020).CrossRefGoogle Scholar
Moulay, E. and Perruquetti, W., “Finite time stability and stabilization of a class of continuous systems,” J Math Anal Appl 323(2), 14301443 (2006).CrossRefGoogle Scholar
Figure 0

Figure 1. Bicycle structure. (a) Top view, and (b) Rear view.

Figure 1

Table I. Notations description.

Figure 2

Figure 2. Schematic diagram of the proposed ELM-BFITSM controller for UB.

Figure 3

Figure 3. The scheme of barrier function.

Figure 4

Figure 4. ELM observer architecture.

Figure 5

Table II. Parameters of controllers.

Figure 6

Figure 5. Experimental platform of UB. (a) Side view and (b) top view.

Figure 7

Figure 6. Control performance (Case 1). (a) Roll angle, (b) control input and (c) ELM.

Figure 8

Figure 7. Control performance (Case 2). (a) Roll angle, (b) control input and (c) ELM estimated value.

Figure 9

Figure 8. Control performance (Case 3). (a) Roll angle, (b) control input and (c) ELM estimated value.

Figure 10

Figure 9. Control performance improvement of ELM-BFITSM.

Figure 11

Table III. Control performance comparisons.