In the face of complex working environments and task requirements, unmanned swarm systems increasingly need control algorithms with higher processing efficiency, generalization ability and adaptability. More and more artificial intelligence algorithms are applied in the field of unmanned swarm system. At the same time, deep reinforcement learning(DRL) has entered a period of rapid development. Due to the integration of deep learning's powerful representation ability and reinforcement learning's effective strategy search ability, DRL has become a promising learning paradigm for realizing general artificial intelligence.
There have been many researches combining unmanned swarm and deep reinforcement learning in recent years. This paper introduces the concept of DRL from the aspects of principle and characteristics. It analyzes a variety of typical algorithms of DRL. Then it discusses the various control requirements of UAV swarm, and focuses on the many achievements of combining deep reinforcement learning and UAV swarm control. Finally it puts forward viewpoints on the application prospects and challenges for the landing and transformation in the combination field.
The concept of unmanned swarm originated from the study of the behavior of biological groups. Many species of bees, ants, birds, fish and other creatures have complex group behavior. These clusters are a large number of independent individuals in accordance with certain aggregation rules to form a coordinated, orderly group movement mechanism. The groups behavior shows the characteristics of distribution, coordination, self-organization, environmental adaptability, and they are stable in structure, and can produce intelligence beyond the individuals in them. Similar to biological clusters, in the field of robotics or unmanned aerial vehicles(UAVs), unmanned swarm systems are crowded intelligent systems that are composed of a large number of homogeneous or heterogeneous unmanned equipment to achieve mutual behavior coordination and jointly complete specific tasks, through interactive feedback and incentive response of information.
In practical applications, an unmanned swarm system needs to meet the requirements of open environment, changeable situation, limited resources, and real-time response. It requires the system to have namy core collaborative capabilities such as distributed collaborative perception, intelligent collaborative decision-making, and robust collaborative control.The distributed intelligent collaborative control method based on deep reinforcement learning can fully meet the control requirements of high intelligence and robustness of unmanned cluster systems. In addition, by using the powerful representation ability of deep learning technology and the search and optimization ability of reinforcement learning methods, the problem of insufficient data of a single node during online learning can be solved by generative adversarial network, so as to achieve real-time collaborative control and decision-making on a larger scale and high dimension, and complete the design of intelligent decision-making methods in complex and highly dynamic environments.