1. Introduction
In recent years, as rigid robots have been widely used in various fields [Reference Li, Du, Ma, Dong, Wang, Gao and Chen1–Reference Huang, He, Wang, Zhang and Fu4], flexible manipulators have also received extensive attention for its lighter weight, higher load capacity, and lower energy consumption [Reference He, Tang, Wang and Liu5–Reference Chang, Li and Tong7]. At present, control methods of flexible manipulators mainly include proportional-integral-derivative control [Reference Pradhan and Subudhi8, Reference Fadilah, Abdul, Zaharuddin and Ariffanan9], fuzzy control [Reference Sun, Su, Xia and Nguyen10–Reference Kong, He, Yang, Li and Sun12], adaptive control [Reference Li, Li, Li, Su, Kan and He13–Reference Kong, He, Liu, Yu and Silvestre15], sliding mode control [Reference Huang and Chen16, Reference Salgado and Chairez17], and compound control [Reference Liang, Song, Sun and Jin18]. Although effective, due to uncertain factors such as parameter perturbation and external environmental interference [Reference Feng, Shi, Cheng, Huang and Liu19], the ordinary differential equation model established for the robotic arm is an uncertainty model [Reference Feng, Yang and Cheng20]. Neural networks (NNs) are an effective way to deal with uncertainty of flexible manipulator for its excellent learning ability [Reference Kong, He, Chen, Zhang and Wang21–Reference Yang, Kiumarsi, Modares and Xu24]. However, in practical applications, the selected NNs nodes cannot cover all the input information, which affects the ability to approximate the unknown model. In order to further improve the learning ability of the NNs and reduce the complexity of the network structure, Chen and Liu [Reference Chen and Liu25] proposed the broad learning system (BLS) and proved its ability to approximate nonlinear functions [Reference Chen, Liu and Feng26]. Different from the deep learning system, the BLS includes input layer, enhancement layer, and output layer and retrains the model by expanding the width of the enhancement layer. Due to the simple structure and strong learning ability, BLS is widely used in pattern recognition, data classification, and model approximation. Combining radial basis function neural networks (RBFNNs) and BLS, Huang et al. [Reference Huang, Zhang, Yang and Chen27] proposed a novel NNs control for the rigid manipulator system, which realized the approximation of the system model. Peng et al. [Reference Peng, Chen, He and Yang28] proposed an improved broad neural networks (BNNs) control to interact with unknown environment for robotic arm with input dead-zone. Although effective, there is barely an adaptive NNs control strategy concerning BLS for the more complex flexible manipulator.
Actually, actuator faults often occur because of aging and wear, which will cause the deterioration of the system performance [Reference Ghaf-Ghanbari, Mazare and Taghizadeh29, Reference Liu, Han, Zhao and He30]. In general, actuator faults can be classified as: (1) effectiveness loss; (2) locked-in-place; (3) floating around trim, and (4) hard-cover. Fault-tolerant control (FTC), which is divided into passive fault-tolerant control (PFTC) and active fault-tolerant control (AFTC), is a suitable strategy against the aforementioned actuator faults [Reference Smaeilzadeh and Golestani31]. The AFTC approach compensates for the system component malfunctions based on the fault detection and diagnosis technologies [Reference Van, Ge and Ren32, Reference Shen, Yue, Goh and Wang33]. The PFTC strategy is simpler from a practical point of view because it deals with the faults by the robust or adaptive controller. Besides, PFTC method also can avoid the time delay caused by the fault detection and diagnosis in AFTC. Considering the aforementioned factors, a passive fault-tolerant (FT) controller will be designed in this paper for a two-link flexible manipulator with actuator faults.
In order to ensure the transient performance and steady-state performance of the flexible manipulator system, the influence of system output constraints on the system is usually considered in the process of controller design. At present, the commonly used output constraint methods include barrier Lyapunov function (BLF) method [Reference Kong, He, Yang, Li and Kaynak34–Reference Li, Zhao, He and Lu36], funnel control method [Reference Ilchmann, Ryan and Trenn37, Reference Wang, Ren, Jing and Zeng38], and prescribed performance control (PPC) method [Reference Bechlioulis and Rovithakis39–Reference Sun, Wu and Sun41]. In ref. [Reference Bechlioulis and Rovithakis39], PPC is proposed for the first time, and the main idea is to convert the error variable constrained by the inequality into an unconstrained variable and ensure that the system tracking error converges to a prescribed small area while ensuring that the convergence speed and overshoot meet the prescribed conditions. Subsequently, many scholars adopt this method to the robotic manipulator system such as rigid robotic manipulator [Reference Guo, Li, Cai, Feng and Shi42–Reference Guo, Zhang, Celler and Su44] and single-link flexible manipulator [Reference Chen, Wang and Zou45]. However, the convergence speed of the system depends on the decay function $\rho =(\rho _{0}-\rho _{\infty })e^{-l_{0}t}+\rho _{\infty }$ and the preset convergence time is not considered. The convergence time of the tracking error will decrease as $l_{0}$ increases, but at the expense of overshoot. And the convergence time of transient error cannot be determined. Although the prescribed time control method [Reference Zhou and Zhang46, Reference Espitia and Perruquetti47] proposed in recent years can solve the problem of transient error convergence time, it will ignore overshoot, steady-state error, and other factors. Therefore, it is necessary to design a unified predetermined performance control strategy considering overshoot, transient convergence time, transient error, and steady-state error. In this paper, we focus on improving the transient performance of a two-link flexible robotic manipulator. Different from the structure of a single-link flexible manipulator, the two-link flexible manipulator has a more complex nonlinear model and stronger mutual link coupling effect [Reference Zhang, Liu, Peng and He48]. Moreover, in practical applications, the elastic deformation and high-frequency vibration of the flexible connecting rod will also increase the complexity of real-time control. Therefore, it is difficult but valuable to improve the rapidity and steady-state accuracy of the two-link flexible manipulator system under the premise of ensuring the overall trajectory tracking and elastic vibration suppression.
Based on the above discussion, aiming at the fast tracking and vibration suppression problem of a two-link flexible robotic manipulator with uncertainties and actuator faults, this paper presents an adaptive NNs FTC strategy considering specified performance. The difficulty is how to force the tracking errors of two joints converge to the preset small area within the specified time with a small overshoot. Inspired by neural adaptive PPC proposed in ref. [Reference Huang, Song and Lai49], an asymmetric scaling function and a behavior-shaping function are introduced to constraint overshoot and steady-state errors, respectively. The unknown model dynamics of flexible robotic manipulator is approximated by the improved RBFNNs. Besides, the multiplicative factor of the actuator faults is estimated by an adaptive estimation method, and the additive part is compensated by the upper bound of the faults. Compared with existing flexible manipulator control strategies, the main contributions of this paper are summarized as follows:
-
1. Different from most of performance constraint for flexible manipulators [Reference Chen, Wang and Zou45, Reference Ahanda, Mbede, Melingui and Zobo50], this paper starts from the transient performance index, and the purpose of controlling overshoot, transient error, and steady-state error can be achieved by introducing two auxiliary functions. Then, a logarithmic BLF is utilized to constrain the transformed state variable.
-
2. Compared with the traditional PPC shown in refs. [Reference Bechlioulis and Rovithakis39, Reference Guo, Li, Cai, Feng and Shi42], the performance function applied in this paper cannot only make the tracking errors of two joints converge to a small residual set within the given time and finally stabilize in a smaller range but also ensure a small overshoot.
-
3. Different from the traditional RBFNNs control, this paper presents a new node selection rule combined with broad learning and incremental learning. The centers of the Gaussian function are self-adjusting, and the number of neural nodes is limited to reduce the complexity of the NNs.
The rest of this paper is organized as follows. The dynamic description and the problem formulation are given in Section 2. Adaptive NN FTC with prescribed performance (PP) based on broad learning is presented in Section 3. In Section 4, simulations are conducted to illustrate the effectiveness of the proposed control. Finally, the conclusions are given in Section 5.
2. Preliminaries and problem formulation
2.1. Dynamic description
A typical structure of two-link flexible robotic manipulator is shown in Fig. 1. On the premise of small deflection, the dynamic model of manipulator system obtained via assumed mode method and Lagrange’s equation can be written as ref. [Reference Gao, He, Zhou and Sun51]
where $q=[\theta,\phi ]^{T}$ , $\theta =[\theta _{1},\theta _{2}]^{T}$ is the vector of joint angular position, $\phi =[\phi _{11},\ldots,\phi _{1n_{1}},\phi _{21},\ldots,\phi _{2n_{2}}]^{T}$ represents the flexible generalized coordinate vector, and $n=n_{1}+n_{2}$ is the total number of flexible variables. $M(q)\in R^{(n+2)\times (n+2)}$ is the positive inertia matrix, $B(q,\dot{q})\in R^{(n+2)\times (n+2)}$ denotes the centripetal and Coriolis forces, and $K(q)\in R^{(n+2)}$ is the vector of gravity and elasticity forces. $u=[u_{1},u_{2},0,\cdots,0]^{T}\in R^{(n+2)}$ is the input torque vector.
The actuator fault is mathematically modeled as
where $\tau (t)\in R^{n+2}$ denotes the desired control torque, $D=\textrm{diag}\{l_{1},\cdots,l_{n+2}\}\in R^{(n+2)\times (n+2)}$ represents the actuator effectiveness matrix with $0\lt \eta _{i}\leq l_{i}\leq 1,i=1,2,\cdots,n+2$ , and $\bar{u}=[\bar{u}_{1},\cdots,\bar{u}_{n+2}]^{T}\in R^{n+2}$ is the additive actuator fault vector.
Remark. As shown in ( 2 ), the multiplicative factor $l_{i}$ represents the uncertain gain of the desired actuator while the additive factor $\bar{u}_{i}$ represents the disturbance or noise imposed on the actuator. Further, it indicates that the $i$ th actuator is complete failure when $l_{i}=0$ and $\bar{u}_{i}=0$ , and healthy when $l_{i}=1$ and $\bar{u}_{i}=0$ .
Assumption 1. [Reference Chen, Song and Lewis52] Assuming that the additive fault introduced in ( 2 ) satisfies $|\bar{u}_{i}|\leq u_{ci}$ with $u_{ci}$ being a positive constant.
Property 1. [Reference Gao, He, Zhou and Sun51] The matrix $M(q)$ is symmetric and positive definite.
Property 2. [Reference Gao, He, Zhou and Sun51] The matrix $(\dot{M}(q)-2B(q,\dot{q}))$ is skew symmetric.
2.2. Prescribed tracking performance
Let $x_{1}=q,x_{2}=\dot{q}$ , the dynamic model (1) can be rewritten as
Define the desired trajectory as $x_{d}=[x_{d1},\cdots,x_{d(n+2)}]^{T}$ , which is a continuously bounded and known variable. The tracking error vector can be expressed as $e_{1}=x_{1}-x_{d}=[e_{11},\cdots,e_{1(n+2)}]^{T}$ .
The given performance specifications of tracking error vector $e_{1}(t)$ are set as
where $T\gt t_{0}$ , $\varepsilon \gt 0$ , $\mu \gt 0$ , $\delta _{2}\gt |e_{1i}(t_{0})|\gt \delta _{1}\gt 0$ are the specific parameters to be set, $t_{0}\geq 0$ is the initial time. The above descriptions constrain the transient and steady-state performance of the tracking error, respectively. More specifically, $e_{1i}(t)$ converges to a small region $(\!-\varepsilon,\varepsilon )$ within finite time $T$ and finally shrinks into a smaller range $(\!-\mu,\mu )$ . By (5c), we know that $-\delta _{1}$ and $\delta _{2}$ are the upper and lower bounds of the error $e_{1i}(t)$ when $e_{1i}(t_{0})\geq 0$ , for all $t\geq t_{0}$ . The same analysis applies when $e_{1i}(t_{0})\lt 0$ in (5d). Take $e_{1i}(t_{0})\geq 0$ as an example, the PP curve is shown as Fig. 2.
2.3. Error transformation
It is difficult to ensure that the transient and steady-state performance meeting the above requirements if the error is directly controlled. Thus, we introduce two error transfer functions to control overshoot and steady-state error, respectively. For the overshoot, we define the function as
where $g$ is a small positive constant, $m$ is an odd integer with $m\geq \textrm{max}\{3,n+2\}$ , $\gamma =\gamma _{1}\gt 1$ for all $e_{1i}\geq 0$ and $\gamma =\gamma _{2}\gt 1$ for all $e_{1i}\lt 0$ . $\dot{f}(e_{1i})=gm\gamma e_{1i}^{m-1}/(e_{1i}^{2m}+g)^{\frac{3}{2}}\geq 0$ shows that $f(e_{1i})$ is monotonously increasing with respect to $e_{1i}$ . Since $f(e_{1i})$ is monotonous and
we can obtain $f(e_{1i})\in (\!-\gamma _{2},\gamma _{1})$ .
In terms of steady-state error, a behavior-shaping function is imported as
where $\kappa (t)\geq \kappa (t_{0})=1$ is a rate function with $\dot{\kappa }(t)\gt 0$ and $\ddot{\kappa }(t)\gt 0$ for all $t\geq t_{0}$ ; $k_{bi}\gt 1$ , $0\lt \xi \lt 1,0\lt \varepsilon \lt 1$ are parameters to be designed. It is different from the traditional PP function. $\xi$ and $\varepsilon$ represent the transient accuracy and steady-state accuracy of the error, respectively, and $\kappa (t)$ specifies the time of error convergence. More properties of function $Q(t)$ are described in Lemma 2.3.
2.4. Broad learning system
As shown in Fig. 3, different from the general NNs structure, the hidden layer of the BLS includes feature nodes and enhancement nodes. For the presented input vector $X$ , the feature mapping of $X$ can be extracted by following $r_{1}$ feature mappings
where $\phi _{xi}$ is the linear or nonlinear activation function, $W_{xi}$ and $\epsilon _{xi}$ are the weight and bias generated randomly and then remained constant during training. Define the feature vector as $F^{r_{1}}=[F_{x1},\cdots,F_{xr_{1}}]$ , and the $r_{2}$ enhancement nodes are obtained by following feature mappings
where $\phi _{fj}$ is the nonlinear activation function, $W_{fi}$ and $\epsilon _{fi}$ are the weight and bias generated randomly. Define the enhanced node vector as $H^{r_2}=[H_{f1},\cdots,H_{fr_{2}}]$ and define $A=[F^{r_{1}}|H^{r_{2}}]$ , then the output of BLS can be expressed as $Y=AW^{(r_{1}+r_{2})}$ with $W^{(r_{1}+r_{2})}$ is the weight from the hidden layer to the output layer.
2.5. RBFNNs with broad learning theory
In this paper, we use RBFNNs to approximate the uncertain dynamic system described as (1). Mathematically, RBFNNs function is expressed as
where $F(Z)\in R^{n+2}$ is the output of NNs, $Z\in R^{m_{z}}$ is the NN input vector, $W\in R^{l\times (n+2)}$ is the NNs weights matrix with $l$ being the number of NNs nodes, $\epsilon (Z)\in R^{n+2}$ is the approximation error vector, $R(Z)\in R^{l}$ is the basis function vector with $R_{i}(Z)$ is the Gaussian function, and $\varphi =[\varphi _{1},\cdots,\varphi _{l}]\in R^{m_{z}\times l}$ and $\eta _{i}$ denote the centers and widths.
Inspired by the ideas of BLS [Reference Chen and Liu25] and incremental learning theory [Reference Huang, Zhang, Yang and Chen27], an improved RBFNNs learning algorithm is presented in this section. In the traditional RBFNNs control, the designed NNs node space cannot guarantee to completely cover the input vector, so it is valuable to add nodes close to the input vector and discard nodes far away from the input vector.
Define the NNs node information as $[\varphi _{t_{0}},\eta _{t_{0}},W_{t_{0}}]$ when $t=t_{0}$ and the added NNs node information as $[\varphi _{\textrm{new}},\eta _{\textrm{new}},W_{\textrm{new}}]$ . Select $n_{p}$ center vectors close to the input vector $c_{\textrm{min}}=\{c_{1},\cdots,c_{n_{p}}\}\in \varphi$ and the mean is $a_v=\sum _{i=1}^{n_{p}}c_{i}/n_{p}$ , then the added node can be designed by following rule [Reference Huang, Zhang, Yang and Chen27]
where $k_{z}\in R^{m_{z}\times m_{z}}$ is a positive diagonal matrix, $\bar{c}_{\textrm{min}}\in R^{m_{z}}$ denotes the average distance vector to the closest centers $c_{\textrm{min}}$ , which is defined as $\bar{c}_{\textrm{min}}=[a_{v},\cdots,a_{v}]\in R^{m_{z}}$ . Thus, the NNs center is updated as
where $t_{s}$ is the sampling interval, $\varXi$ is the preset threshold, and $\bar{l}$ is the preset maximum number of neural nodes. According to the idea of broad learning, we choose the hidden layer of NNs as $G(t_{0}+t_{s})=[Z|R(Z)]\in R^{l+m_{z}}$ . Meanwhile, initialize the weight from the hidden layer to the output layer with $W(t_{0}+t_{s})=[W_{t_{0}}|W_{\textrm{new}}]\in R^{(l+m_{z})\times (n+2)}$ . Consequently, the output of BNNs can be expressed as
Remark. The design of $\varphi _{\textrm{new}}$ in ( 11 ) ensures that the subsequently updated node is always near the input vector, which can improve the estimation accuracy of the NNs for unknown nonlinear function. Moreover, the design of threshold $\varXi$ in ( 13 ) provides a convenient way to save the number of neural nodes, which helps reduce the computational burden of the system. It should be pointed out that in practical applications, a small $\varXi$ will lead to a large number of nodes. So when without prior knowledge, we can preset the maximum number of neural nodes $\bar{l}$ to reduce the computational complexity of the NNs.
Remark. In practice, the value of $\bar{l}$ is related to the input information of NN. If the fluctuation range of the input information is large, we need to set a higher value of $\bar{l}$ . For example, we can set a larger value for $\bar{l}$ at the initial time, such as $200$ . If it is observed that the unknown function is well fitted, the value of $\bar{l}$ can be reduced. If it is observed that the unknown function is poorly fitted, the value of $\bar{l}$ needs to be increased. If the fitting effect is not good when the value of $\bar{l}$ reaches $400$ , we need to consider modifying other parameter values.
2.6. Useful technical lemmas
Lemma 2.1. [Reference Zhang, Dong, Ouyang, Yin and Peng53] For $\forall x\in R$ , there is a positive constant $\zeta \in R$ satisfying that $|x|\leq |\zeta |$ with $\zeta$ being the constraint value, the following inequality holds:
Lemma 2.2. [Reference Smaeilzadeh and Golestani31] For any constants $a_{1},a_{2},\cdots,a_{n}$ and $0\leq x\leq 1$ , the following inequality holds:
Lemma 2.3. [Reference Huang, Song and Lai49] The behavior-shaping function $Q(t)$ shown in (8) possesses the following properties:
-
(1) ${Q}(t_{0})=1$ ;
-
(2) $\dot{Q}(t)\gt 0$ and $Q^{(i)}\in \ell _{\infty }$ for all $t\geq t_{0}$ ;
-
(3) $\lim \limits _{t\to \infty }Q^{(i)}=0$ , $i$ is a positive integer.
3. Control design
3.1. Control objectives
In this section, the backstepping method is performed to obtain an adaptive NNs FT controller.
-
(1) The desired reference angular vector $x_{d}$ can be tracked within the prescribed time. And the tracking errors of two flexible manipulator joints can satisfy the prescribed requirements in terms of overshoot, transient, and steady-state performance.
-
(2) The actuator faults described in (2) can be compensated by a PFTC scheme.
-
(3) All the close-loop error signals are uniformly ultimately bounded.
3.2. Adaptive NN PPC with actuator faults
To facilitate the controller design, we define the first error variable $z_{1}$ as
where $f(e_{1})=[f(e_{1i}),\cdots,f(e_{(n+2)i})]^{T}$ . If $|z_{1i}|\lt k_{bi}(i=1,2,\cdots,n+2)$ with $k_{bi}\lt \textrm{min}\{\delta _{1},\delta _{2}\}$ , we get
when $t\geq T$ , the $\kappa (T)$ is chosen as $(k_{b}-\xi \varepsilon )/(\varepsilon -\xi \varepsilon )$ , then
Considering that $\kappa (t)$ is monotonically increasing function with $\kappa (t_{0})=1$ , we get $\lim \limits _{t\to +\infty }\kappa ^{-1}(t)=0$ ; thus, we can further obtain
Next, we discuss the relationship between $e_{1}$ and $f(e_{1})$ . From (6), we can get the inverse function of $f(e_{1})$
Differentiating $e_{1i}(f)$ with respect to $f$ yields $\dot{e}_{1i}=f^{(\frac{1}{m}-1)}g^{\frac{1}{m}}\gamma ^{2}(\gamma ^{2}-f^{2})^{(\frac{1}{2m}-1)}/m(\gamma ^{2}-f^{2})^{\frac{1}{m}}\geq 0$ with $g\gt 0$ and $m$ is an odd integer which is not less than $3$ . Thus, $e_{1i}(f)$ is strictly increasing for $f\in (\!-\gamma _1,\gamma _2)$ . Selecting $g=(\gamma ^{2}-\varepsilon ^{2})\varepsilon ^{(2m-2)}$ and combining (19) and (21), we conclude
Substituting (20) into (21) yields
Thus, the constraints of $e_{1i}$ in (5aa) and (5ab) are guaranteed.
According to the properties of $Q(t)$ , we know $0\lt Q^{-1}(t)\lt 1$ and further get
where $k_{bi}\lt \textrm{min}\{\gamma _{1},\gamma _{2}\}$ .
when $e_{1i}(t_{0})\geq 0$ , $\gamma =\gamma _{1}$ , we can get from (21) and (24)
when $e_{1i}(t_{0})\lt 0$ , $\gamma =\gamma _{2}$ , similarly, we get
where $P_{u}, -P_{l}$ are the upper and lower bounds of the tracking error, respectively. More specifically,
-
(1) When $0\leq e_{1i}(t_{0}) \leq P_{u}$ , the size of overshoot is determined by $P_{l}$ . Considering the definitions of $P_{l}$ and $g$ , we find that once $k_{bi}$ and $\varepsilon$ are fixed, the overshoot is only determined by $\gamma$ . According to the definition of $\gamma$ , it is easy to know that the overshoot is only determined by $\gamma _{1}$ .
-
(2) When $-P_{l}\leq e_{1i}(t_{0}) \lt 0$ , the size of overshoot is determined by $P_{u}$ . Considering the definitions of $P_{u}$ and $g$ , we find that once $k_{bi}$ and $\varepsilon$ are fixed, the overshoot is only determined by $\gamma$ . Similarly, it is easy to know that the overshoot is only determined by $\gamma _{2}$ .
The above analysis hold under the condition $|z_{1i}|\lt k_{bi}$ ; the preset performance can be achieved by transforming the error $e_{1i}$ twice with (6) and (8). In the following controller design process, a BLF $\sum _{i=1}^{n+2}\textrm{ln}\frac{k_{bi}^{2}}{k_{bi}^{2}-z_{1i}^2}$ is considered to guarantee that $z_{1i}$ remain into the region $|z_{1i}| \lt k_{bi}$ [Reference Ren, Ge, Tee and Lee54].
Define the second error variable $z_{2}$ as
where $\alpha =[\alpha _{1},\cdots,\alpha _{n+2}]^{T}$ is the virtual error variable. Take the derivative of $z_{1}$ with respect to time, we have
where
Since the term $z_{2}^{T}Mz_{2}$ is positive and continuously differentiable, we choose the Lyapunov function candidate as
Taking the time derivative of $V_{1}$ along (4) and (28), we obtain
According to Property 2, the term $z_{2}^{T}(\frac{1}{2}\dot{M}(q)-B(q,\dot{q}))z_{2}=0$ . To proceed, we define virtual control law $\alpha$ as
with $K_{1}=\textrm{diag}\{K_{11},\cdots,K_{1(n+2)}\}$ is a positive-definite gain matrix and
Combining $L_{i}, M_{1i}$ , and $Q$ , we get
Substituting (31) and (32) into (30), one has
where $w=\textrm{diag}\{w_{1},\cdots,w_{i}\}$ with $w_{i}=\frac{1}{k_{bi}^{2}-z_{1i}^2}$ , $L=\textrm{diag}\{L_{1},\cdots,L_{i}\}$ .
Since uncertainties in $M(q), B(q,\dot{q}), K(q)$ are difficult to obtain, we cannot realize the model-based control actually. Therefore, BNNs are utilized to approximate the uncertainties of system model. Design the adaptive controller as
where $\hat{P}=\textrm{diag}\{\hat{P}_{1},\cdots,\hat{P}_{n+2}\}$ is the estimation matrix of $P$ with $P=D^{-1}$ , $\tau _{1}^{*}=[\tau _{11}^{*},\tau _{12}^{*},0,\cdots,0]^{T}\in R^{n+2}$ . $\hat{W}$ is the weight of NNs and $R(Z)$ is the basis function. The term $\hat{W}^{T}R(Z)$ denotes the estimation of ${W}^{*T}R(Z)$ , while ${W}^{*T}R(Z)$ represents the approximation of the uncertain item in the control (34) with
where $Z=[q^{T},\dot{q}^{T},\alpha ^{T},\dot{\alpha }^{T}]$ is the input of the basis function to the NNs and $\epsilon (Z)\in R^{n+2}$ is the approximation error. Define the NNs adaptive laws as
where $\Gamma _{i}\in R^{(n+2)(n+2)}$ is the constant positive-definite gain matrix; $\sigma _{i}$ is a small positive constant.
Let $P_{i}=\frac{1}{l_{i}}, (i=1,\cdots,n+2)$ , and the adaptive laws of $\hat{P}_{i}$ are designed as
Then, a theorem that guarantees prescribed tracking performance under actuation faults is proposed as the following.
Theorem 3.1. For the system described by (1), the controller ( 34 ) with the adaptive law ( 36 ) and ( 37 ), if the initial conditions $ (q(0),\dot{q}(0),\hat{W}_{i}(0),\hat{P}_{i}(0) )$ are bounded and $z_{1i}(0)\lt k_{bi}$ , $i=1,2,\cdots,n+2$ , the semi-global stability of the system can be achieved. The error signals $z_{1}, z_{2}, \tilde{W}$ , and $\tilde{P}_{i}$ will remain within $\Omega _{z_{1}},\Omega _{z_{2}},\Omega _{W}$ , and $\Omega _{P}$ , respectively, which are defined as
with $i=1,\ldots,n+2$ and $J=V_{2}(0)+\frac{l_{0}}{p_{0}}$ with $l_{0}$ and $p_{0}$ are two positive constants.
$Proof:$ The Lyapunov function candidate is considered as
where $\tilde{P_{i}}=\hat{P}_{i}-P_{i}$ , $\tilde{W_{i}}=\hat{W}_{i}-W_{i}^{*}$ , and $\tilde{W_{i}}$ , $\hat{W}_{i}$ , $W_{i}^{*}$ are the NNs weight error, estimate value, and actual value, respectively.
Differentiating $V_{2}$ with respect to time leads to
According to the control law (34), NNs (35) and (37), we get
Note that $l_{i}(\hat{P}_{i}-\tilde{P}_{i})=l_{i}P_{i}=1$ . Substituting (31), (34), and (40) into (39), we get
By applying Young’s inequality, the following inequalities $-\sigma _{i}\tilde{W}_{i}^{T}\hat{W}_{i}\leq -\frac{\sigma _{i}}{2}(\Vert \tilde{W}_{i}\Vert ^{2}-\Vert{W_{i}^{*}}\Vert ^{2}),$ $-\tilde{P}_{i}\hat P_{i}\leq -\frac{1}{2}(\tilde P_{i}^2-P_{i}^2), -z_{2}^{T}\epsilon (Z)\leq \frac{1}{2}z_{2}^{T}z_{2}+\frac{1}{2}\Vert \epsilon (Z)\Vert ^{2}$ hold. And considering Lemmas 2.1 and 2.2, (41) can be written as
where $I\in R^{(n+2)(n+2)}$ is an identity matrix, $l_{0}$ and $p_{0}$ are two constants defined as
with $\lambda _{\textrm{min}}(\!*\!)$ and $\lambda _{\textrm{max}}(\!*\!)$ refer to the minimum and maximum eigenvalues of the matrix $*$ , respectively. In order to ensure that $l_{0}\gt 0$ , the gains $K_{1}$ and $K_{2}$ should be satisfied
Multiplying (42) by $e^{l_{0}t}$ yields
Integrating $t$ on both sides of inequality (46), we have
Combining (38) and (47), we get $z_{1i}^{2}\leq k_{bi}^2\big (1-e^{-2(V_{2}(0)+\frac{l_{0}}{p_{0}}})\big )$ , $\Vert z_{2}\Vert ^{2}\leq 2\big ( V_{2}(0)+\frac{l_{0}}{p_{0}}\big )\lambda _{\textrm{min}}^{-1}(M)$ , $\Vert \tilde{W}\Vert ^{2}\leq 2\big ( V_{2}(0)+\frac{l_{0}}{p_{0}}\big )\lambda _{\textrm{min}}^{-1}(\Gamma ^{-1})$ , $\tilde{P}_{i}^{2}\leq 2\big ( V_{2}(0)+\frac{l_{0}}{p_{0}}\big )(\textrm{min}(l_{i}))^{-1}$ . Therefore, we can conclude that $z_{1},z_{2},\tilde{W}$ , and $\tilde{P}$ all converge into the compact set shown in Theorem 1. The proof has been finished.
According to Theorem 1, the error variable $z_{1}$ is uniformly ultimately bounded with respect to the set $\Omega _{z1}$ . Thus, the preset performance described in (5a) can be achieved. Moreover, the error signals $z_{2}$ , $\tilde{W_{i}}$ , and $\tilde{P_{i}}$ are uniformly ultimately bounded. Because $W^{*}_{i}$ and $P_{i}$ are bounded with $\tilde{W_{i}}=\hat{W}_{i}-W_{i}^{*}$ , $\tilde{P_{i}}=P_{i}-\hat{P}_{i}$ , $\hat{W}_{i}$ and $\hat{P}_{i}$ must be bounded.
Remark. In control law ( 34 ), $\tau _{1}$ will be unbounded if $z_{1i}=k_{bi}$ . However, we can restrict $z_{1i}$ within the range [- $k_{bi}$ , $k_{bi}$ ] by adjusting parameters $\xi, \gamma$ , and $\varepsilon$ at the initial moment $t_{0}$ . Therefore, the control law ( 34 ) will be bounded with no singular values.
Remark. Different from existing works [Reference Bechlioulis and Rovithakis39], [Reference Karayiannidis and Doulgeri55], the proposed control with PP in this paper cannot only ensure that the tracking error reaches the preset range within a finite time but also can finally enter a smaller residual set. Although the performance function $\rho (t)$ can constrain the error convergence process, the system converges in infinite time and the convergence speed is slow. Therefore, since the convergence time $T$ is set in advance, the proposed controller has a faster convergence rate, which will be proved in the simulation part.
4. Simulations
In this section, simulations of a two-link flexible manipulator are carried out to demonstrate the validity of proposed control. The simulation environment is based on MATLAB 2020a. The detailed platform parameters are shown in Table I. The reference trajectories of two joints are square waves which are chosen as
Our control objective is to force the tracking error of two joints to satisfy the given performance requirements in (5a)–(5d). The performance parameters are designed as $\varepsilon =0.01$ , $\mu =0.005$ , and $T=0.5\ \text{s}$ . The boundaries of two tracking errors are $-\delta _{1}=-0.01$ and $\delta _{2}=0.5$ . The initial conditions of states information $q(0)$ and $\dot{q}(0)$ are set to zero, which ensures that the initial errors of the system is within the constraint range.
In order to show the superiority of the proposed controller, three cases are considered in the following simulations. First, we examine the effect of proportional differential (PD) controller. Second, the method of combining PD and PP for the manipulator proposed in ref. [Reference Karayiannidis and Doulgeri55] is shown. Third, the simulation results of proposed control with PP are presented. Finally, the effectiveness of adaptive neural networks fault-tolerant (NNFT) controller is verified with actuator faults being taken into consideration.
4.1. Simulation results without actuator faults
First, we prove the effectiveness of the proposed PP controller. Considering the initial state of errors of first joint and second joint is negative, so we choose $\gamma =\gamma _{2}$ . For the proposed controller, the specific parameters are set as: $K_{1}=\textrm{diag}\{12,12,1,1,1,1\}$ , $K_{2}=\textrm{diag}\{11,11,1,1,1,1\}$ , $m=3$ , $\gamma _{1}=\gamma _{2}=2$ , $\xi =0.5$ , $k_{bi}=1.2(i=1,\cdots,6)$ .
For the RBFNNs with BLS, the initial number of nodes in the NNs is $2^{6}=64$ , the initial center parameters are chosen as $-1$ or $1$ , and the widths are set as $\eta =2$ . The initial weights are $\hat{W}_{i}(0)=0(i=1,\cdots,64)$ , $\Gamma _{i}=0.1(i=1,\cdots,6)$ , $\sigma _{i}=0.02(i=1,\cdots,6)$ , the threshold is $\varXi =1.2$ , and the maximum number of neural nodes is $\bar{l}=100$ . According the definition and properties of rate function $\kappa (t)$ , we choose the $\kappa (t)$ as $e^{9.21t}$ . The simulation results are shown in Fig. 4.
From the tracking performance of the first and second joints shown in Fig. 4 (black dotted line), we find the tracking error can converge into the prescribed small region within $0.5\ \text{s}$ and then shrink to a smaller range at $t=1\ \text{s}$ , but the overshoot is too large to satisfy the prescribed requirement in (5d). Then we select a more larger value of $\gamma$ . So we set $\gamma =5$ in the next step, and the simulation results are shown as the red lines. We can clearly find that the tracking errors of two joints meet the PP described in (5a)–(5d) with small overshoot. Figure 5 shows that the uncertain function in (35) is better approximated under the improved RBFNNs and the number of neural nodes is only $100$ . Therefore, the proposed RBFNNs algorithm with BLS is effective. The first subfigure in Fig. 6 shows that the control inputs of both joints are bounded and smooth. The second subfigure in Fig. 6 is the trajectory change process of two joints in three-dimensional space, which shows the vibration of the flexible rod is effectively suppressed.
Second, to further demonstrate the superiority of the proposed NNs PP controller, the traditional PD control and the method proposed in ref. [Reference Karayiannidis and Doulgeri55] which has combined PD and PP are used as comparisons.
Design the PD control as
where $e_{1}=x_{1}-x_{d}$ is the tracking error, and $K_{p}$ and $K_{d}$ are proportional and differential gains matrices. Considering the transient and steady-state performance of two joints, we get the optimal parameters with $K_{p}=10I_{6\times 6}$ and $K_{d}=10I_{6\times 6}$ .
For the model-free control with PP proposed in ref. [Reference Karayiannidis and Doulgeri55], the exponential performance function is set as
with $\rho _{0}=1.1$ , $\rho _{\infty }=0.01$ , and $l=2$ . Taking into account the characteristics of the desired trajectory, we set the performance function as $\rho (t-10)$ at $t=10\,\text{s}$ .
Then, designing the model-free PPC $\tau$ as
where $K_{p},K_{d},K_{\varepsilon }$ are positive diagonal gain matrices, $x=\frac{e(t)}{\rho (t)}$ , $\varepsilon (x)$ is transformation function. For better comparison, we also choose $K_{p}=10I_{6\times 6}$ , $K_{d}=10I_{6\times 6}$ , and $K_{\varepsilon }=2I_{6\times 6}$ . See ref. [Reference Karayiannidis and Doulgeri55] for some other specific parameters. The simulation results of three cases are shown as Figs. 7 and 8.
From Fig. 7, we notice that although PD control can keep the system stable, it is much worse in terms of speed. Under the same proportional and derivative gains, the controller in ref. [Reference Karayiannidis and Doulgeri55] can significantly improve the rapidity of the system and keep $e(t)$ staying between $\rho (t)$ and $-\rho (t)$ , but still cannot meet our preset convergence time $T$ . Observing the proposed controller in Fig. 7, it can be found that the proposed NNs PP controller not only has higher stability accuracy but also faster convergence speed, which meet our preset requirements and effectively improve the transient and steady-state performance of flexible robotic manipulator. Therefore, the superiority of the proposed adaptive NNs PPC has been proved.
4.2. Simulation results with actuator faults
This part is mainly to verify the performance of the proposed FT controller in the case of actuator faults. The actuator faults in (2) include multiplicative and additive faults which are described as follows:
Through the above actuator faults settings, it is noted that the actuator for the first joint and the second joint lose $40\%$ and $70\%$ of its effectiveness, respectively, and both experience an additive bias fault after $5\ \text{s}$ . For time-varying additive fault, the upper bound of the fault is $u_{ci}=0.5, i=1,2$ . For the NNFT PPC (34), the parameters are same as described in section $B$ . Simulation results are shown as Figs. 9 and 10.
From Fig. 9, the proposed adaptive FT control can still tracking the desired trajectories rapidly when actuators $\tau _{1}$ and $\tau _{2}$ occur faults. It is noted that the red line and blue line represent control input $u$ and control signal $\tau$ , respectively. Thus, we can conclude that the proposed NNFT control (34) possesses strong robustness against the actuator faults. In Fig. 10, when actuator faults occur at $5\ \text{s}$ , the proposed controller reacts rapidly to offset the adverse effects, so that the signal amplitude of the tracking error converges the prescribed range.
5. Conclusion
An adaptive NNFT controller with PP has been presented for a two-link flexible manipulator in this paper. The overshoot and tracking errors of two joints can satisfy the preset performance specifications by using a behavior-shaping function and an asymmetric scaling function. Based on the RBFNNs and broad learning theory, the uncertain dynamics of flexible robotic manipulator has been well approximated. Simulation results show the proposed control method cannot only ensure tracking errors meet the preset performance but also effectively suppress the vibration of the flexible rod. Comparative simulation results with other methods show the superiority of proposed control in terms of rapidity and stability, and vibration of the flexible rod can be suppressed. Besides, the proposed adaptive controller can solve the problem caused by actuator faults. In the future, we will continue to focus on the research of flexible manipulator by combining reinforcement learning control and optimization control.
Author contributions
Wei He, Linghuan Kong, and Wenkai Niu proposed the research project. Wenkai Niu designed the control algorithm, carried out the simulation test, and wrote the first draft. Yifan Wu and Haifeng Huang revised the manuscript. And Wei He and Linghuan Kong provided advice and supervision.
Funding statement
This work was supported in part by the National Natural Science Foundation of China under Grants 62225304, 62061160371, and U20A20225, by the National Key Research and Development Program of China under Grant 2019YFB1703600, in part by the Fundamental Research Funds for the China Central Universities of USTB under Grant FRF-MP-20-36, in part by the Beijing Natural Science Foundation under Grant JQ20026, and in part by the Beijing Top Discipline for Artificial Intelligence Science and Engineering, University of Science and Technology Beijing.
Conflicts of interest
The authors declare that there is no conflict of interest.