引用本文
  •    [点击复制]
  •    [点击复制]
【打印本页】 【在线阅读全文】【下载PDF全文】 查看/发表评论下载PDF阅读器关闭

←前一篇|后一篇→

过刊浏览    高级检索

本文已被:浏览 344次   下载 389 本文二维码信息
码上扫一扫!
Q-learning强化学习协同拦截制导律
王金强,苏日新,刘莉,刘玉祥,龙永松
0
(江南机电设计研究所,贵阳 550025)
摘要:
为实现多枚导弹协同拦截机动目标,提升拦截效能,提出了一种Q-learning强化学习协同拦截制导律。首先,基于逃逸域覆盖理论,建立了非线性多弹协同拦截模型。其次,以视线角速率为状态,依据脱靶量构造奖励函数,通过离线训练生成强化学习智能体,并结合传统比例制导控制方法,构建基于强化学习的变导引系数制导律,实时生成实现协同拦截的制导指令。最终,通过数值仿真验证了所提算法的有效性和优越性。
关键词:  协同拦截  强化学习  机动目标  逃逸域  制导律
DOI:
基金项目:国防科工委重点基础研究项目(2019-JCJQ-ZD-049
Cooperative Interception Guidance Law Based on Reinforcement Learning of Q-learning
WANG Jin-qiang,SU Ri-xin,LIU Li,LIU Yu-xiang,LONG Yong-song
(Jiangnan Institute of Mechanical and Electrical Design, Guiyang 550025, China)
Abstract:
To achieve the cooperative interception of multiple missiles against a maneuvering target and improve the interception effectiveness, a cooperative interception guidance law is proposed through Q-learning technology. Firstly, based on escape domain covering theory, a nonlinear cooperative interception model is established. Then, a reward function is constructed by using miss distance with taking line-of-sight rate as the state, and a reinforcement learning agent is generated by offline training. At the same time, a variable coefficient guidance law based on reinforcement learning algorithm is designed by combining proportional navigation guidance law to generate guidance commands in real time. Finally, the effectiveness and superiority of the proposed algorithm are verified based on numerical simulation.
Key words:  Cooperative interception  Reinforcement learning  Maneuvering target  Escape domain  Guidance law

用微信扫一扫

用微信扫一扫