两阶段重复随机游戏的激励兼容性

论文标题

两阶段重复随机游戏的激励兼容性

Incentive Compatibility in Two-Stage Repeated Stochastic Games

论文作者

Satchidanandan, Bharadwaj, Dahleh, Munther A.

论文摘要

我们解决了两阶段重复的随机游戏的机理设计问题 - 一种新颖的设置，使用该设置，在下一代电力市场中，许多新兴问题都可以很容易地建模。反复的比赛为玩家提供了一大批策略，这些策略使玩家的行动适应了过去从那里获得的所有观察和推论。在其他设置（例如迭代拍卖或动态游戏）中，这种巨大的策略空间表现出来，通常对机制设计具有重要的影响：可能不可能将真相作为主要的策略平衡。因此，在这种情况下，即使纳什均衡是现实世界中的贫穷模型，也很常见的是使真相平衡或其变体的机制解决。这是由于每个玩家都必须对其他玩家的行为做出过分具体的假设，才能采用其NASH均衡策略，这可能不会做出。通常，在平衡中投机的负担越小，它对现实世界的行为进行建模越多。在这一格言的指导下，我们引入了一种新的平衡概念，称为主导策略，非破产平衡（DNBE），该策略要求玩家对其他玩家的行为几乎没有采用其平衡策略的假设。因此，这种机制可以使dnbe而不是纳什均衡的机制在沿着真实的线条塑造现实世界的行为方面非常有效。我们提出了一种两阶段重复随机游戏的机制，该机制使真相的统一策略不利。该机制还保证了个人合理性并最大化社会福利。最后，我们描述了该机制设计需求响应市场的应用。

We address the problem of mechanism design for two-stage repeated stochastic games -- a novel setting using which many emerging problems in next-generation electricity markets can be readily modeled. Repeated playing affords the players a large class of strategies that adapt a player's actions to all past observations and inferences obtained therefrom. In other settings such as iterative auctions or dynamic games where a large strategy space of this sort manifests, it typically has an important implication for mechanism design: It may be impossible to obtain truth-telling as a dominant strategy equilibrium. Consequently, in such scenarios, it is common to settle for mechanisms that render truth-telling only a Nash equilibrium, or variants thereof, even though Nash equilibria are known to be poor models of real-world behavior. This is owing to each player having to make overly specific assumptions about the behaviors of the other players to employ their Nash equilibrium strategy, which they may not make. In general, the lesser the burden of speculation in an equilibrium, the more plausible it is that it models real-world behavior. Guided by this maxim, we introduce a new notion of equilibrium called Dominant Strategy Non-Bankrupting Equilibrium (DNBE) which requires the players to make very little assumptions about the behavior of the other players to employ their equilibrium strategy. Consequently, a mechanism that renders truth-telling a DNBE as opposed to only a Nash equilibrium could be quite effective in molding real-world behavior along truthful lines. We present a mechanism for two-stage repeated stochastic games that renders truth-telling a Dominant Strategy Non-Bankrupting Equilibrium. The mechanism also guarantees individual rationality and maximizes social welfare. Finally, we describe an application of the mechanism to design demand response markets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题