摘要
网络化战争中,机载雷达在实现对目标信息持续获取的同时保证载机安全生存是亟待解决的问题。对此,以多机协同作战安全转场任务为背景,提出基于深度强化学习算法的智能传感器管理方法。首先,综合考虑信号辐射量与目标威胁因素,计算目标运动过程中的实时威胁隶属度。其次,在强化学习框架下对雷达-目标分派问题建模,利用神经网络逼近动作-值函数,并根据时序差分算法进行参数更新。仿真结果表明,相比于传统调度方法,所提算法有效提升了任务成功率,缩短了任务完成用时。
In the networked war,it is urgent that airborne radar can continuously acquire target information while ensuring the safe survival.Focusing on this problem,in the context of safe transition tasks of multi-airborne cooperative operations,this paper proposes a intelligent sensor management method based on deep reinforcement learning.First,the real-time threat membership is calculated considering the signal radiation and several threat factors.Then,the radar-target assignment problem is modeled in a reinforcement learning framework.The neural network is used to approximate the action-value function,and the parameters are updated according to the temporal-difference algorithm.It can be seen from the simulation that the proposed algorithm improves the task success rate and shortens the time of task completion compared with the traditional scheduling methods.
作者
闫实
贺静
王跃东
孙自强
梁彦
YAN Shi;HE Jing;WANG Yuedong;SUN Ziqiang;LIANG Yan(School of Automation,Northwestern Polytechnical University,Xi’an 710072,China;Key Laboratory of Information Fusion,Ministry of Education,Xi’an 710072,China;Nanjing Institute of Electronic Technology,Nanjing 210039,China)
出处
《系统工程与电子技术》
EI
CSCD
北大核心
2020年第8期1726-1733,共8页
Systems Engineering and Electronics
基金
国家自然科学基金(61771399,61873205)资助课题。
关键词
传感器管理
强化学习
威胁隶属度
sensor management
reinforcement learning
threat membership