Nomenclature
- UAV
-
Unmanned Aerial Vehicle
- QUAV
-
Quadrotor Unmanned Aerial Vehicle
1.0 Introduction
Unmanned aerial vehicles (UAVs) have the advantages of low cost, high manoeuverability,and ease of operation [Reference Greenwood, Lynch and Zekkos1–Reference Aslan and Oktay4]. When a single UAV completes complex tasks, its application is increasingly limited due to insufficient payload and sensor performance. Instead, more and more UAVs are working together to complete tasks, and the application and research of UAVs are beginning to develop towards clusters. UAV formation can perform tasks such as border patrol, battlefield reconnaissance, pesticide spraying and forest fire detection [Reference Kartal, Subbarao and Gans5–Reference Oktay and Eraslan8], which are widely used in both military and civil fields. Therefore, studying the problem of multi-UAV formation has important practical application value.
Categorised by the form of information exchange between individuals, multi-UAVs formation can be classified into centralised, distributed and decentralised structures [Reference Chen, Tang and Lao9]. In distributed control method, all agents act as independent control units and communicate information with neighbours to achieve state consistency control. Due to its better reliability and flexibility, this paper chooses the distributed control method. At present, the mainstream cluster formation control algorithms mainly include leader-following method [Reference Loria, Dasdemir and Jarquin10], behaviour-based method [Reference Garcia-Aunon, Del Cerro and Barrientos11], and virtual structure method [Reference Askari, Mortazavi and Talebi12]. Among them, the leader-following method is the most widely used, with the advantages of simple structure, easy implementation and wide applicability. The leader-following method is usually combined with the consistency method to design the formation algorithm [Reference Yoshioka and Namerikawa13].
Consistency method was first applied to the distributed control of communication network nodes. This method can enable the state of multiple nodes in communication network to reach a consistent state within a certain period of time. Due to its feasibility, this method is widely used in the control of multi-agent systems. Jadbabaie and Lin [Reference Jadbabaie, Lin and Morse14] applied knowledge of algebraic graph theory to theoretically prove that when the communication topology of the Vicsek model [Reference Vicsek, Czirók and Ben-Jacob15] is an undirected connected graph, all agents in the system will converge to the same state eventually. Olfati-Saber [Reference Olfati-Saber and Murray16] studied the consensus problem of second-order continuous multi- agent system, extended the concept of algebraic connectivity in undirected graphs to directed graphs, and elucidated the important role of equilibrium directed graphs in solving the average consensus problem. Ni [Reference Ni and Cheng17] studied the second-order multi-agent consensus problem under switching topology and proved that when the concatenation set of all the communicating topologies contains the minimum generation, the system will converge to consensus eventually.
The artificial potential field method is a distance-dominated method [Reference Khatib18]. This method has the advantages of short computation time, real-time control, and safe and smooth planned trajectory. However, it also has disadvantages such as collision with obstacles, unreachable target and falling into local minimum. For these problems, many scholars have improved the artificial potential field method. Lawrence [Reference Lawrence, Frew and Pisano19] provided general a technique for constructing guidance fields for UAV, which combines Lyapunov stability properties to generate simple, globally stable vector fields in 3D. Goncalves [Reference Goncalves, Pimenta and Maia20] proposed a method for computing artificial vector fields, which allows a UAV to converge to a specified general curve in n-dimensional space and loop around it. Wilhelm [Reference Wilhelm and Clem21] focused on the optimisation of a gradient vector field UAV trajectory following and obstacle avoidance method, which minimise the deviation from an original trajectory. Hung [Reference Hung, Rego and Quintas22] conducted an in-depth review of trajectory following control strategies applicable to a wide range class of marine, ground, and aerial autonomous robotic vehicles.
UAV formation obstacle avoidance not only requires avoiding obstacles, but also ensuring a safe distance between each UAV and inter-UAV communication. Due to various reasons such as communication links and devices, the network often experiences communication and input delay during the actual communication processes [Reference Moreau23]. Therefore, the design and analysis of consistency method for multi-UAVs with time delay is more relevant.
The traditional consistency control can make the system state converge to the expected value when the time is infinite, but the finite-time consensus control can make the system state converge to the expected value within a finite time. Control algorithm in finite-time is proposed in Ref. (Reference Wang, Chen and Liu24). Adbessameud and Rtayebi [Reference Abdessameud and Tayebi25] established a formation protocol with time delay for a second-order dynamics model of a multi-UAV system, and proved the stability of the protocol based on Lyapunov-Krasovskii theory. Sun [Reference Sun and Guan26] studied the finite time consistency problem of multi-UAV systems with leaders under fixed and switched networks by applying graph theory and Lasalle invariance principle. Ma [Reference Ma, Sun and Ji27] proposed a new distributed control law to study the leader-following consensus problem for finite-time linear time-invariant multi-agent system by assigning different priorities to each agent in the system. Shang [Reference Shang28] studied a first-order multi- agent system and obtained sufficient conditions for the system to achieve finite-time consensus under an undirected stationary topology.
Based on the earlier discussion, this paper improves on the Ref. (Reference Liu, Qu and Wei29) and studies obstacle avoidance problem of QUAV formation with finite-time consensus and time delay.
The main innovations are as follows:
-
(1) The traditional finite-time consensus protocol is improved, and considered the existence of time delay to accelerate the convergence speed of the system.
-
(2) The traditional artificial potential field method is improved, and the problems of the traditional artificial potential field method is solved.
-
(3) The trajectory segmentation strategy is introduced to reduce the complexity of QUAV obstacle avoidance environment and improve the task execution efficiency.
The rest of the paper is organised as follows. In Section 2, graph theory is given. In Section 3, a mathematical model of a QUAV is established. In Section 4, an improved finite-time consistency method is given with considered the existence of time delay. In Section 5, the improved artificial potential field method is introduced. In Section 6, a finite-time consensus formation obstacle avoidance algorithm with time delay is proposed based on the leader-following method. In Section 7, MATLAB simulation results are given. In Section 8, conclusion is given.
2.0 Graph theory
QUAV formation consensus control covers multiple aspects such as formation reconstruction, formation maintenance, obstacle avoidance and so on, all of which are designed based on graph theory. In graph theory, a graph is composed of vertexes and edges, where the vertexes represent the QUAVs and the edges represent the communication between the QUAVs.
Assuming the total number of QUAVs is n. The graph $G = \left( {V,E} \right)$ consists of a vertex set $V = (1,2, \cdots, n)$ and an edge set $E \subseteq V \times V$ . If vertex $i$ and vertex $j$ are connected, then there’s an edge $({v_i},{v_j}) \in E$ , and vertex $j$ is called a neighbour of vertex $i$ . Graphs are divided into directed graphs and undirected graphs, if all the edges in a graph are undirected, it is called an undirected graph; otherwise, it is called a directed graph.
The relationship between vertex $i$ and vertex $j$ is representedby ${a_{ij}}$ . If vertex $i$ and vertex $j$ are connected by an edge, then ${a_{ij}} = 1$ ; otherwise, ${a_{ij}} = 0$ . The adjacency matrix $A = [{a_{ij}}](i,j \in \left\{ {1,2, \cdots, n} \right\})$ is a method used in graph theory to represent a graph as a two-dimensional array, where the rows and columns represent the vertexes in the graph, and the elements of the array represent whether there are edges between the vertices.
The degree matrix $D = diag({d_1},{d_2}, \ldots, {d_n})$ is diagonal arrays, and the elements on the diagonal represent the degrees of a single vertex. In an undirected graph, the degree $d({v_i})$ of a vertex is the number of edges of vertex $i$ . In a directed graph, degree is divided into two concepts: in-degree and out-degree. The in-degree is the number of directed edges that end at the vertex; the out-degree is the number of directed edges starting from the vertex.
The Laplace matrix of a graph is related to the degree matrix and the adjacency matrix by $L = D - A$ .
3.0 Mathematical models
The UAV model used in this paper is a QUAV, assuming that the safety radius of the centre of the QUAV is ${R_S}$ and the communication radius is ${R_C}$ . The distance between the centre of the QUAV and the surface of the obstacle needs to be greater than the safety radius; otherwise, a collision may occur, resulting in a crash. The distance between any two QUAV centres needs to be less than the communication radius; otherwise it will cause the QUAV to lose contact. The safety radius and communication radius are illustrated in Fig. 1.
The continuous-time multi-QUAVs linear model used in this paper is:
where, ${x_i}(t) \in R$ and ${u_i}(t) \in R$ denote the state and control input of QUAV $i$ at moment $t$ , respectively.
Due to the fact that QUAV is a linear model, it is treated as a mass point in the artificial potential field method, which contradicts the safety radius that the QUAV needs to maintain. It is necessary to attach the safety radius of the QUAV to the obstacle, which the QUAV is treated as a mass point and no longer requires a safety radius. Assuming the actual radius of the spherical obstacle is ${R_O}$ , the total radius $R = {R_O} + {R_S}$ . The radius of the obstacle is shown in Fig. 2.
4.0 Finite-time consensus with time delay
The consensus protocol with equal communication delay, input delay and undirected connected graph network topology is as follows:
where, $\tau $ denotes the time delay; when $ \tau \in [0,{{\;\pi } / {2{\lambda _n}}}) $ the system can achieve consistent; ${\lambda _n}$ is the maximum eigenvalue of the Laplace matrix of the system.
Definition 1 [Reference Xiao, Wu and Peng30] If there exists an expected time ${T_0} \in [0, + \infty )$ , such that for any initial state, the multi-QUAVs satisfy $\mathop {\lim }\limits_{t \to {T_0}} \left\| {{x_i}(t) - {x_0}(t)} \right\| = 0$ and ${x_i}(t) = {x_0}(t),{\rm{ }}\forall t \geqslant {T_0},{\rm{ }}i \in I$ , then it can be said that finite-time consensus control has been achieved.
In Ref. (Reference Sun and Zhu31), Sun and Guan proposed the following finite-time consensus protocol:
where $0 \lt \alpha \lt 1$ .
Definition 2 $si{g^\alpha }({x_i}) = {\left| {{x_i}} \right|^\alpha }{\mathop{\rm sgn}} \left( {{x_i}} \right)$ , ${\mathop{\rm sgn}} ()$ is a symbolic function, defined as $sgn(r) = \left\{ {\begin{array}{l@{\quad}l}{1,} & {}{r \gt 0}\\[3pt] {0,} & {}{r = 0}\\[3pt] { - 1,} & {}{r \lt 0}\end{array}} \right.$ .
Considering the existence of time delay based on protocol (3), protocol (4) is obtained:
In order to accelerate the convergence speed of protocol (4), this paper has made improvements and proposed protocol (5):
where, $\beta \gt 0$ , the larger $\beta $ is, the faster system converges.
Next, it will be proven that protocol (5) can achieve finite-time consensus and has a faster convergence speed than protocol (4). Before that, the following lemmas need to be introduced.
Lemma 1: [Reference Zheng, Chen and Wang32] Suppose that the function $\varphi \,:\,{R^2} \to R$ satisfies the condition $\varphi \left( {{x_i},{x_j}} \right) = - \varphi \left( {{x_j},{x_i}} \right)\!, \forall i,j \in I,i \ne j$ , and then $\mathop \sum \limits_{i = 1}^N \mathop \sum \limits_{j \in {N_i}} {a_{ij}}{y_i}\varphi \left( {{x_j},{x_i}} \right) = - \frac{1}{2}\mathop \sum \limits_{\left( {{v_i},{v_j}} \right) \in E} {a_{ij}}\left( {{y_j} - {y_i}} \right)\varphi \left( {{x_j},{x_i}} \right)$ is satisfied for any undirected graph and series ${y_1},{y_2}, \ldots, {y_n}$ .
Lemma 2: [Reference Ren and Beard33] If ${y_1},{y_2}, \ldots, {y_n} \gt 0$ and $0 \lt p \le 1$ , then $\mathop \sum \limits_{i = 1}^n y_i^p \geqslant {\left( {\mathop \sum \limits_{i = 1}^n {y_i}} \right)^p}$ .
Lemma 3: (Reference Wang and Xiao34) $L(A) = \left[ {{l_{ij}}} \right] \in {R^{n \times n}}$ denotes the Laplace matrix of graph $G(A)$ , which has the following properties:
-
(1) ${x^T}L(A)x = \frac{1}{2}\mathop \sum \limits_{i,j = 1}^n {a_{ij}}{\left( {{x_j} - {x_i}} \right)^2}$ , the semi positive characterisation of $L(A)$ indicates that all its eigenvalues are real numbers and not less than zero.
-
(2) If the graph $G(A)$ is connected, the second smallest eigenvalue of $L(A)$ is greater than zero, denoted by ${\lambda _2}\left( {{L_A}} \right)$ , which is called as the algebraic connectivity of the graph $G(A)$ .
-
(3) The algebraic connectivity of the graph $G(A)$ is equal to $\mathop {\min }\limits_{x \ne {0_n},1_n^Tx = 0} {{{x^T}L(A)x} / {{x^T}x}}$ , and if $1_n^Tx = 0$ , then ${x^T}L(A)x \geqslant {\lambda _2}\left( {{L_A}} \right){x^T}x$ .
-
(4) If graph $G(A)$ is an undirected connected graph, then $L(A) + Diag\left( {{b_1}, \ldots, {b_n}} \right)$ is positive definite, where ${b_i} \geqslant 0,\;\forall i \in I$ and there exists at least one ${b_i} \gt 0$ .
Lemma 4: [Reference Cortés35] Consider the non-Lipschitz continuous nonlinear systems $\dot x = f(x)$ and $f(0) = 0$ , suppose there is a differential function $V(x)$ defined in the neighbourhood of the origin, and there exist $c \gt 0$ and $0 \lt \alpha \lt 1$ such that $V(x)$ is positive definite and $\dot V(x) + c{V^\alpha } \le 0$ , and then the origin is considered locally finite-time stable. The finite-time $T$ depends on the initial state ${x_0}$ , and satisfy $T({x_0}) \le {{V{{({x_0})}^{1 - \alpha }}} / {c(1 - \alpha )}}$ .
Next, using the Lyapunov function method, it is shown that the control of multi-QUAVs through protocol (5) can achieve state consensus within a finite-time.
Proof: Define the position error
This is obtained through Equations (1) and (6):
The Lyapunov function ${V_1}(t) = \mathop \sum \limits_{i = 0}^n e_i^2(t)$ chosen according to Lemma 1 is given by Equation (7):
From Lemma 2, it can be concluded that:
Let $B = L(A) + Diag({a_{10}}, \ldots, {a_{n0}})$ , from Lemma 3, we can conclude that:
Combining Equations (9) and (10) yields ${\dot V_1}(t) \le - {\left[ {2{\lambda _2}\left( {{L_B}} \right)} \right]^{\frac{{1 + \alpha }}{2}}}V{(t)^{\frac{{1 + \alpha }}{2}}}$ , and by Lemma 4 $T(x(0)) \le \frac{{2{V_1}{{(0)}^{\frac{{1 - \alpha }}{2}}}}}{{(1 + \alpha ){{\left[ {2{\lambda _2}\left( {{L_B}} \right)} \right]}^{\frac{{1 + \alpha }}{2}}}}}$ . And that’s the end of the proof.
In the following, by choosing the same Lyapunov function ${V_2}(t) = \mathop \sum \limits_{i = 0}^n e_i^2(t)$ , it is shown that the convergence of the multi-QUAV formation under protocol (5) is faster than that under protocol (4) in the same way as described earlier.
From Equation (8), we can obtain ${\dot V_1}(t) = - \mathop \sum \limits_{i = 0}^n \mathop \sum \limits_{j = 0}^n {a_{ij}}{\left( {{e_j}(t) - {e_i}(t)} \right)^{1 + \alpha }} - \beta \mathop \sum \limits_{i = 0}^n \mathop \sum \limits_{j = 0}^n {a_{ij}}{\left( {{e_j}(t) - {e_i}(t)} \right)^2}$ , and from $\beta \gt 0$ easily know ${\dot V_2}(t) \lt {\dot V_1}(t)$ . Therefore, the convergence speed of protocol (5) is faster than that of protocol (4).
5.0 Artificial potential field method
5.1 Traditional artificial potential field method
The traditional artificial potential field method is created based on the principle of force in physics. It regards the controlled object as a particle and studies its motion in a virtual potential field. The QUAV moves towards the target position under the influence of potential field and avoids obstacles during this process. The whole potential field consists of two parts: gravitational potential field and repulsive potential field. By calculating the negative gradient of the gravitational potential field and repulsive potential field acting on the QUAV, gravity and repulsive force are obtained, where the force at the target point is gravity and the force at the obstacle is repulsive force. Under the combined action of gravity and repulsion, the QUAV can move toward the target point while avoiding obstacles, and a smooth and continuous motion trajectory can be generated during the whole movement process. The principle of artificial potential field method is shown in Fig. 3.
The traditional artificial potential field method has the following shortcomings [Reference Luan, Tan and You36]:
-
(1) Collision with obstacles. When the distance between the QUAV and the target point is too far, the attractive force on it will be much greater than the repulsive force, and it may collide with obstacles during flight.
-
(2) Target unreachability. When the target point is located near an obstacle and the QUAV approaches the target point, the repulsive force applied to the QUAV is too strong relative to the attractive force, causing the QUAV to hover or remain stationary in a certain area.
-
(3) Falling into a local minimum. When the QUAV moves to a non-target point location, it will fall into a local minimum due to force equilibrium, causing it to oscillate or come to rest repeatedly near the local minimum point.
5.2 Improved artificial potential field method
The attractive potential field function is improved to address the problem of excessive attractive force caused by the QUAV being too far away from the target. Set the attraction action threshold for segment to avoid the problem of excessive attraction force. The modified attractive potential field function is shown in Equation (12):
where, ${k_{att}}$ is the attractive coefficient, ${d_0}$ is the impact distance threshold of the target point, $q$ is the spatial position of the QUAV, and ${d_{goal}} = {\left\| {q - {q_{goal}}} \right\|_2}$ is the Euclidean distance between the QUAV and the target point.
The attraction force is the negative gradient of the attractive potential field, as shown in Equation (13):
For the target unreachability problem, an improved repulsive potential field function is introduced, which combines the distance between the target point and the QUAV to ensure that the potential field is globally minimised at the target point only. The improved repulsive potential field function is shown in Equation (14):
where, ${K_{rep}}$ is the repulsive coefficient, ${d_0}$ is the maximum impact distance of an obstacle, and ${d_{obs}} = {\left\| {q - {q_{obs}}} \right\|_2}$ is the Euclidean distance between the QUAV and the obstacle.
The repulsive force is the negative gradient of the repulsive potential field, as shown in Equation (15):
where, ${{\bf{n}}_1}$ and ${{\bf{n}}_2}$ denote the unit direction vectors along the directions ${F_{rep1}}$ and ${F_{rep2}}$ . The expressions for ${F_{rep1}}$ and ${F_{rep2}}$ are shown in Equation (16):
where ${F_{rep1}}$ and ${F_{rep2}}$ are the components of the repulsive force in two different directions, respectively.
The direction of vector points from the obstacle to the QUAV, which is the repulsive force component; the direction of vector points from the QUAV to the target point and is the attractive force component. As shown in Fig. 4.
It is important to note that the QUAVs in the formation also need to avoid collisions with each other. QUAVs can consider each other as dynamic obstacles, obtain the positions of other QUAVs, and avoid collisions through artificial potential field methods.
The total potential field consists of attractive potential field and repulsive potential field, as shown in Equation (17):
The combined force is equal to the negative gradient of the total potential field, as shown in Equation (18):
For the local minimum problem, introduce a random perturbation and set the minimum velocity as the condition for triggering the random perturbation. If the velocity components ${v_x},{v_y},{v_z}$ of the QUAV in all three directions of $x,y,z$ are less than the minimum velocity, the random perturbation is triggered, as shown in Equation (19):
where, is a random function, is the minimum velocity, $\lambda$ is a coefficient with a small absolute value, and ${\rm{2rand}}$ is a random number in the range of 0 to 1.
5.3 Constraint conditions
In order to avoid collision between QUAVs and obstacles due to excessive speed, a maximum speed constraint is introduced and the constraints are ${v_x},{v_y},{v_z}$ .
Taking the velocity component of the X-axis as an example,and calculate the velocity change at each sampling time, as shown in Equation (20):
where, ${v_x}(t)$ is the speed of the QUAV at the current sampling time, ${v_x}(t - 1)$ is the speed of the QUAV at the last sampling time, and $\Delta {v_x}(t)$ is the amount of change in the velocity component.
Then, the velocity component of the QUAV at the next sampling time is calculated, as shown in Equation (21):
where, ${v_{x\max }}$ is the maximum velocity component of the QUAV, ${a_x}$ is the acceleration component. The values of ${v_{x\max }}$ and ${a_x}$ need to meet the requirement that the QUAV can avoid collision by decelerating when encountering obstacles. ${v_{0x}}(t + 1)$ plays a transitional role in determining the ${v_x}(t + 1)$ of the velocity component of the QUAV at the next sampling time.
5.4 Trajectory segmentation strategy
Collision detection and avoidance are important components of the trajectory strategy, and in environments with long total paths or many obstacles, the artificial potential field method has some randomness in trajectory planning, and the result may not be the optimal trajectory. This section improves the artificial potential field method by segmenting the total path to reduce the complexity of the obstacle environment.
The obstacles encountered by QUAVs during actual flight are mostly irregular three-dimensional objects. If the repulsive potential field is calculated at each trajectory point of the obstacle boundary points, it will result in a strategic trajectory with frequent turning and easy to oscillate, which does not satisfy the QUAV shortest trajectory segment constraint. For irregular concave and convex body obstacles, the general solution is to create a complete envelope of an outer jointed sphere for them and use the outer jointed sphere as a new obstacle to be avoided.
The following is an example of a spherical obstacle to introduce the calculation method of trajectory segmentation strategy:
-
(1) If the starting point S $({x_S},{y_S},{z_S})$ and the ending point G $({x_G},{y_G},{z_G})$ are known, draw a line ${L_{SG}}$ through these two points. Let its direction vector be $({d_x},{d_y},{d_z})$ . The parametric equation of ${L_{SG}}$ is given in Equation (22):
(22) \begin{align} \left\{ {\begin{array}{*{20}{c}}{x = {x_S} + {d_x} \cdot t}\\[3pt] {y = {y_S} + {d_y} \cdot t}\\[3pt] {z = {z_S} + {d_z} \cdot t}\end{array}} \right. \end{align} -
(2) Judgement whether the line ${L_{SG}}$ intersects with an obstacle. If ${L_{SG}}$ does not intersect with any obstacle, ${L_{SG}}$ is the target trajectory; otherwise, judgement the spherical centre C $({x_C},{y_{C,}}{z_C})$ of the obstacle and its safe radius ${r_C}$ that intersects with ${L_{SG}}$ and is closest to the starting point. The spherical expression of the obstacle is shown in Equation (23):
(23) \begin{align}{\left( {x - {x_C}} \right)^2} + {\left( {y - {y_C}} \right)^2} + {\left( {z - {z_C}} \right)^2} = r_C^2 \end{align} -
(3) Find the value of the parameter $t$ by combining Equations (22) and (23), and substitute it into Equation (22) to obtain the two intersection points A and B between the line and the sphere. Calculate the coordinates of the midpoint M of the line segment AB.
-
(4) Connect the centre of the sphere C and the point M to obtain the line ${L_{CM}}$ . Calculate the other two intersections E and F of ${L_{CM}}$ of the sphere in the same way as above.
-
(5) Calculate and compare the distances between point E and point F from the starting point S respectively. Use the point closer to the starting point S as the new target point.
-
(6) Execute the formation consensus obstacle avoidance algorithm, when the leader reaches a new target position, use this position as a new starting point. Repeat steps (1)–(5) to find the next new target point until the QUAV reaches the desired target point.
The trajectory segmentation strategy can reduce the complexity of the QUAV flight environment. By calculating the sub-target points, the planned trajectory is guaranteed to be the optimal solution, which reduces the flight cost and improves the task execution efficiency.
6.0 Mutli-quavs formation consensus obstacle avoidance algorithm
The formation control adopts the leader-following method in distributed control, and its basic idea is that: the leader receives commands and controls the entire formation to perform the expected task. The followers follow the leader based on their expected positions relative to the leader. The leader-following method is illustrated in Fig. 5:
The control protocol of the leader is shown in Equation (24):
where $\Delta {x_{i0}}(t)$ is the relative position of the leader and the follower $i$ at moment $t$ , ${F_0}(t)$ is the combined force on the leader at moment $t$ .
The control protocol of the follower is shown in Equation (25):
where $\Delta {x_{ij}}(t)$ denotes the relative position of QUAV $i$ and QUAV $j$ at moment $t$ , and ${F_{rep(i)}}(t)$ the obstacle repulsive force on QUAV $i$ at moment $t$ .
After receiving the command signal, the leader QUAV sends control command to the followers through the formation consensus obstacle avoidance algorithm. The followers enter the range of influence of obstacles and avoid them autonomously, thus realising a consistent state throughout the formation. The QUAV formation control structure is shown in Fig. 6.
In order to realise the QUAV formation to maintain formation and avoid obstacles autonomously, adaptive inter-QUAV communication weights are considered in this paper. Add a weight to the algorithm ${\omega _{ij}}(t)$ that varies with the position of QUAV $i$ and QUAV $j$ , and the system reaches state consensus when $\left\| {{x_i}(t) - {x_j}(t)} \right\| \to 0$ . The designed multi-QUAVs finite-time consensus obstacle avoidance protocol is shown in Equation (26):
where ${\omega _{ij}}(t) = 2 - {e^{ - {{({x_j}(t - \tau ) - {x_i}(t - \tau ))}^2}}}$ , ${F_i}(t)$ denotes the combined force of the artificial potential field on the QUAV $i$ at moment $t$ .
7.0 Numerical simulation
The effectiveness of the algorithm in an environment without obstacles is verified by MATLAB simulation. QUAVs have a safety radius of ${R_S} = 0.25{\rm{m}}$ and a communication radius of ${R_C} = 3.5{\rm{m}}$ , and the safety radius between two QUAVs is $R = 2 \times {R_S} = 0.5{\rm{m}}$ . The desired positions in Table 1 indicate the position of each QUAV in the desired formation. Based on the position of the leader as a reference, the followers move towards the desired position to form the desired formation.
Each QUAV quickly forms the desired formation under the control of the finite-time consensus protocol. The leader autonomously plans the flight trajectory, and the followers maintain the desired formation and follow the movement of the leader under the control of the consensus agreement. The simulation results of QUAV formation consensus are shown in Fig. 7.
Then, the effectiveness of the algorithm(26) is verified by MATLAB simulation, and both the network topology and the desired formation of the QUAV formation are shown in Fig. 4. The radius of the spherical obstacle ${R_O} = 0.25{\rm{m}}$ . The safe radius between the QUAV and the obstacle $R = {R_O} + {R_S} = 0.5{\rm{m}}$ . The attraction gain coefficient ${K_{att}} = 2$ , repulsion gain coefficient ${K_{rep}} = 0.0002$ , target influence distance threshold ${d_0} = 2{\rm{m}}$ , maximum obstacle influence distance ${d_{\max }} = 1{\rm{m}}$ , and the coordinates of the target point are $(20,20,10)$ . Several spherical obstacles are randomly generated on the flight trajectory of QUAV formation, with the spherical centre coordinates are ${\rm{(6 8 3)}}$ , ${\rm{(8 6 3)}}$ , ${\rm{(9 8 4)}}$ , ${\rm{(9 10 5)}}$ , ${\rm{(11 8 5)}}$ , ${\rm{(11 9 5)}}$ , ${\rm{(11 11 6)}}$ , ${\rm{(12 11 6)}}$ , ${\rm{(12 13 7)}}$ , ${\rm{(13 11 7)}}$ , ${\rm{(13 14 8)}}$ . Maximum velocity constraint is ${v_{x\max }} = {v_{y\max }} = {v_{z\max }} = 3{\rm{m/s}}$ , ${v_{\max }} = 3\sqrt 3 {\rm{m/s}}$ , acceleration ${a_x} = {a_y} = {a_z} = 1{\rm{m/}}{{\rm{s}}^{\rm{2}}}$ . The sampling time is $\Delta t = 1{\rm{ms}}$ , time delay $\tau = 10{\rm{ms}}$ , finite-time coefficient $\alpha = 0.5$ , finite-time coefficient $\gamma = 4$ . The initial and desired position coordinates of each UAV are shown in Table 2.
The algorithm requires that multi-QUAVs to quickly assemble to form a desired formation, and the QUAV formation should avoid obstacles in the airspace and reach the target position while maintaining the desired formation. The simulation results are shown in Fig. 8. The spherical shape in Fig. 8 represents an obstacle, and its radius is equal to the sum of the actual radius of the obstacle and the safety radius. Multi-QUAVs fly to the target position under the guidance of the leader, and the end of the mission is signaled when the leader reaches the target position. From the four angles of Fig. 8, it can be clearly seen that the multi-QUAVs can quickly assemble to form the desired formation. The QUAV formation can autonomously avoid obstacles while flying to the target position, and the formation is well maintained.
Figure 9 shows the distance between the QUAVs and the obstacles. Observe the minimum value of the Y-axis in the figure. If it is greater than the total radius of the obstacle, it indicates that there is no collision with the obstacle. The distance between the centres of the 4 QUAVs and the centres of the 11 obstacles is always greater than 0.5m, indicating that each QUAV can avoid obstacles.
Figure 10 shows a comparative simulation of QUAV formation consistency obstacle avoidance under the action of traditional artificial potential field method, with the same other conditions. The results indicate that multi-QUAVs cannot maintain the desired formation during obstacle avoidance, demonstrating the effectiveness of the algorithm proposed in this paper.
Figure 11 shows the adjacent distance between any two QUAVs, where ${D_{12}}$ denotes the distance between follower 1 and follower 2; ${D_{10}}$ denotes the distance between follower 1 and the leader, and so on. The distance between any two QUAVs is greater than the safe radius and less than the communication radius. This indicates that QUAVs do not collide with each other and are always able to establish communication. After about 1.6s, the distance between any two QUAVs remains unchanged, indicating that the formation is well maintained.
To reflect the QUAV formation consensus motion under this algorithm more clearly, the convergence of the formation’s positions on X-axis, Y-axis and Z-axis are simulated by MATLAB. The simulation results are shown in Fig. 12. The leader arrives at the target position at about 11.3s. From Fig. 12(c), it can be seen that the multi-QUAVs complete desired formation at about 1.6s, and the heights of each QUAV on the Z-axis are always synchronised during the subsequent flight.
Simulation of position error curves of each follower and leader, the spacing between the follower and the leader in the desired formation on the axis is $\Delta x = [{-}2, -{\rm{1,0,0]}}$ , $\Delta y = [0, -1, -2{\rm{,0]}}$ , $\Delta z = [0,{\rm{0,0,0]}}$ , respectively, and the QUAV position error $\Delta = [{-}2, -\sqrt 2, -2,0]$ . In Fig. 13, the position error of each follower and leader is the same as the predetermined position, which indicates that the multi-QUAVs can form and maintain the desired formation.
Figure 14 shows a simulation comparison of the Z-axis position error without trajectory segmentation strategy and improved finite-time consensus control, with the same other conditions. The Z-axis position error are selected for comparison, as the positions of each QUAV in the required formation are different on the X-axis and Y-axis, while and the required heights for the Z-axis are the same, but the initial heights are different. The multi-QUAVs can only form the desired formation at about 8s, and the leader reaches the target position at about 12.3s. It is verified that the multi-QUAV finite-time consensus protocol enables multi-QUAVs to quickly assemble to form a desired formation, and the trajectory segmentation strategy can improve task execution efficiency.
Figure 15(a)–(c) show the velocity components of the QUAVs with the maximum constraint of $3{\rm{m/s}}$ , and Fig. 15(d) shows the velocity of the QUAVs with the maximum constraint of $3\sqrt 3 {\rm{m/s}}$ . None of the QUAVs exceeded the maximum velocity constraint and converged uniformly in approximately 1.6s. Afterwards, each QUAV has the same speed, indicating that the formation is well maintained.
8.0 Conclusion
A finite time consensus obstacle avoidance algorithm for QUAV formation with time delay was designed by combining an improved finite time consensus protocol with an improved artificial potential field method. The multi-QUAVs can be quickly assembled to form the desired formation through the improved finite-time consensus protocol. Under the improved artificial potential field method, the QUAV formation flies towards the target position and avoids obstacles. The trajectory segmentation strategy reduces the complexity of obstacle avoidance environments. The effectiveness of the proposed method is verified by MATLAB simulation. The state convergence time for multi-QUAVs is much faster than the traditional finite-time consensus control method. QUAV formation can successfully avoid obstacles and reach the target position faster than without trajectory segmentation strategy. Multi-QUAVs can form the desired formation and fly to the target position faster, maintain the formation during obstacle avoidance and improve task execution efficiency.
Acknowledgements
This work was supported by the National Natural Science Foundation of China (61976237, 62103456), the Science and Technology Innovation Team of Colleges and Universities in Henan Province (22IRTSTHN015), the Central Plains Thousand Talents Plan Top Talents in Central Plains (ZYQR201810162), the Natural Science Foundation of Henan Province (202300410511), the Key R&D and Promotion Projects (tackling of key scientific and technical problems) in Henan Province, China (232102111129), the Basic research business fees for provincial undergraduate universities of Zhongyuan University of Technology, China (K2022YY010), the Key Research and Development Program of Henan (241111210100), Discipline Strength Improvement Plan of Zhongyuan University of Technology, China (SD202210).
Data availability statement
Our manuscript has no associated data.
Competing interests
The authors declare that they have no conflict of interest.