论文标题
在确定和限定马尔可夫链聚集中的超级遗产数量
On Determining and Qualifying the Number of Superstates in Aggregation of Markov Chains
论文作者
论文摘要
许多涉及大型马尔可夫连锁店的研究需要确定较小的代表性(聚合)链。代表链中的每个{\ em Superstate}代表原始Markov链中的一个{\ em组态。通常,聚合链中超级遗产数量的选择是模棱两可的,并且基于有限的先前知识。在本文中,我们提出了一种结构化方法,即确定超级遗产数量的最佳候选者。我们通过比较不同大小的聚合链来实现这一目标。为了促进这种比较,我们开发并量化了{\ em边际回报}的概念。我们的概念捕获了{\ em相关}状态(即由同一超级巨星代表的状态)组中{\ em异质性}的减少,这是聚合链中超级巨星数量增加的。我们使用最大的熵原理来证明边际回报的概念以及我们对异质性的量化。通过对超级遗产数量的合成马尔可夫链的模拟,我们表明具有最大边缘回报的聚合链可以确定此数字。如果是马尔可夫链,我们表明,具有最大边缘回报的汇总模型标识了所建模的方案所特有的固有结构。因此,证实了我们提出的方法的功效。
Many studies involving large Markov chains require determining a smaller representative (aggregated) chains. Each {\em superstate} in the representative chain represents a {\em group of related} states in the original Markov chain. Typically, the choice of number of superstates in the aggregated chain is ambiguous, and based on the limited prior know-how. In this paper we present a structured methodology of determining the best candidate for the number of superstates. We achieve this by comparing aggregated chains of different sizes. To facilitate this comparison we develop and quantify a notion of {\em marginal return}. Our notion captures the decrease in the {\em heterogeneity} within the group of the {\em related} states (i.e., states represented by the same superstate) upon a unit increase in the number of superstates in the aggregated chain. We use Maximum Entropy Principle to justify the notion of marginal return, as well as our quantification of heterogeneity. Through simulations on synthetic Markov chains, where the number of superstates are known apriori, we show that the aggregated chain with the largest marginal return identifies this number. In case of Markov chains that model real-life scenarios we show that the aggregated model with the largest marginal return identifies an inherent structure unique to the scenario being modelled; thus, substantiating on the efficacy of our proposed methodology.