先发制人以最大程度地减少传输延迟下不正确信息的年龄

论文标题

先发制人以最大程度地减少传输延迟下不正确信息的年龄

Preempting to Minimize Age of Incorrect Information under Transmission Delay

论文作者

Chen, Yutao, Ephremides, Anthony

论文摘要

我们研究了优化有能力的发射器决策的问题，以最大程度地减少通信渠道随机延迟时的不正确信息（AOII）。我们考虑了一个开槽的时间系统，在该系统中，发射器观察了马尔可夫源，并根据系统状态做出决定。在每个时间插槽中，发射器决定在频道繁忙时是否抢占还是跳过。当频道空闲时，发射器会决定是否发送新更新。远程接收器根据收到的更新估算了马尔可源的状态。我们考虑一个通用的传输延迟，并假设传输延迟是独立的，并且针对每个更新都相同分布。本文旨在在每个时间插槽中优化发射器的决定，以最大程度地减少AOII，并使用通用的时间惩罚函数。为此，我们首先使用马尔可夫决策过程来制定优化问题，并得出两种规范的先发制性策略实现的预期AOII的分析表达。然后，我们证明了最佳政策的存在，并提供了可行的价值迭代算法以近似最佳策略。但是，如果我们希望对近似值有很大的信心，则值迭代算法将在计算上很昂贵。因此，我们在两个规范延迟分布下分析了系统特征，并理论上使用策略改进定理获得了相应的最佳策略。最后，提出数值结果，以说明先发制能力带来的绩效提高。

We study the problem of optimizing the decisions of a preemptively capable transmitter to minimize the Age of Incorrect Information (AoII) when the communication channel has a random delay. We consider a slotted-time system where a transmitter observes a Markovian source and makes decisions based on the system status. In each time slot, the transmitter decides whether to preempt or skip when the channel is busy. When the channel is idle, the transmitter decides whether to send a new update. A remote receiver estimates the state of the Markovian source based on the update it receives. We consider a generic transmission delay and assume that the transmission delay is independent and identically distributed for each update. This paper aims to optimize the transmitter's decision in each time slot to minimize the AoII with generic time penalty functions. To this end, we first use the Markov decision process to formulate the optimization problem and derive the analytical expressions of the expected AoIIs achieved by two canonical preemptive policies. Then, we prove the existence of the optimal policy and provide a feasible value iteration algorithm to approximate the optimal policy. However, the value iteration algorithm will be computationally expensive if we want considerable confidence in the approximation. Therefore, we analyze the system characteristics under two canonical delay distributions and theoretically obtain the corresponding optimal policies using the policy improvement theorem. Finally, numerical results are presented to illustrate the performance improvements brought about by the preemption capability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题