发行时间:2024.05.21
总页数:21
编辑:
harlan
摘要:
www.nature.com/scientificreports
open
a hierarchical reinforcement learning method for missile evasion and guidance
mengda yan1,2*, rennong yang1,2, ying zhang1,2, longfei yue1 & dongyuan hu1
this paper proposes an algorithm for missile manoeuvring based on a hierarchical proximal policy optimization (ppo) reinforcement learning algorithm, which enables a missile to guide to a target and evade an interceptor at the same time. based on the idea of task hierarchy, the agent has a two-layer str...