论文标题

具有风险敏感平均成本标准的马尔可夫连锁店的离散时间零和游戏

Discrete-time Zero-Sum Games for Markov chains with risk-sensitive average cost criterion

论文作者

Ghosh, Mrinal K., Golui, Subrata, Pal, Chandan, Pradhan, Somnath

论文摘要

我们研究了零和随机游戏,用于控制的离散时间马尔可夫链,具有风险敏感的平均成本标准,具有可计数的状态空间和Borel动作空间。收益功能是无负的,可能是无限的。在动力学上的某个Lyapunov稳定性假设下,我们确定了值和鞍点平衡的存在。此外,我们完全表征了固定马尔可夫策略类别中所有可能的马鞍点策略。最后,我们介绍并分析了一个说明性示例。

We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded. Under a certain Lyapunov stability assumption on the dynamics, we establish the existence of a value and saddle point equilibrium. Further we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Finally, we present and analyze an illustrative example.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源