数据新鲜度和节能无人机导航优化：一种深厚的加固学习方法

论文标题

数据新鲜度和节能无人机导航优化：一种深厚的加固学习方法

Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep Reinforcement Learning Approach

论文作者

Abedin, Sarder Fakhrul, Munir, Md. Shirajum, Tran, Nguyen H., Han, Zhu, Hong, Choong Seon

论文摘要

在本文中，我们为多个无人机（UAV）设计了一项导航政策，其中移动基站（BSS）被部署以改善与物联网（IoT）设备的数据新鲜度和连接性。首先，我们提出一个节能轨迹优化问题，其中目标是通过优化UAV-BS轨迹策略来最大化能源效率。我们还结合了不同的上下文信息，例如能源和信息时代（AOI）约束，以确保地面BS的数据新鲜度。其次，我们提出了一个具有经验重播模型的敏捷深度加强学习，以解决有关UAV-BB导航的上下文约束的制定问题。此外，提出的方法非常适合解决问题，因为问题的状态空间非常大，并且找到具有有用上下文特征的最佳轨迹策略对于UAV-BSS来说太复杂了。通过应用拟议的训练模型，无人机-BSS的有效实时轨迹政策随着时间的推移捕获了可观察到的网络状态。最后，模拟结果说明了所提出的方法比贪婪和基线深Q网络（DQN）方法的能源效率高3.6％和3.13％。

In this paper, we design a navigation policy for multiple unmanned aerial vehicles (UAVs) where mobile base stations (BSs) are deployed to improve the data freshness and connectivity to the Internet of Things (IoT) devices. First, we formulate an energy-efficient trajectory optimization problem in which the objective is to maximize the energy efficiency by optimizing the UAV-BS trajectory policy. We also incorporate different contextual information such as energy and age of information (AoI) constraints to ensure the data freshness at the ground BS. Second, we propose an agile deep reinforcement learning with experience replay model to solve the formulated problem concerning the contextual constraints for the UAV-BS navigation. Moreover, the proposed approach is well-suited for solving the problem, since the state space of the problem is extremely large and finding the best trajectory policy with useful contextual features is too complex for the UAV-BSs. By applying the proposed trained model, an effective real-time trajectory policy for the UAV-BSs captures the observable network states over time. Finally, the simulation results illustrate the proposed approach is 3.6% and 3.13% more energy efficient than those of the greedy and baseline deep Q Network (DQN) approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题