论文标题
马尔可夫链基本定理的各种证明
Various proofs of the Fundamental Theorem of Markov Chains
论文作者
论文摘要
本文是对Markov链的所谓{\ em基本定理的各种证据的调查}:每个Ergodic Markov链都有独特的正固定分布,并且该链在极限中达到了与链条启动的初始分布无关的极限分布。由于马尔可夫连锁店是随机过程,因此自然使用基于概率的参数进行证明是很自然的。同时,马尔可夫链的动力学完全被其初始分布(即矢量及其过渡概率矩阵)完全捕获。因此,也可以使用基于矩阵分析和线性代数的参数。下面讨论的证据使用了这两种类型的参数中的一种,除了在某种情况下参数为图理论。主要文本中给出了各种证据的适当学分。 我们的第一个证明完全是基本的,但是证明也很简单。证明还暗示了我们证明的混合时间绑定,但是在许多情况下,这并不是最佳绑定。证明基本定理的一种方法在两个部分中打破了证明:(i)表明存在不可约马尔可夫链的独特的正固定分布,并且(ii)假设一个千古链确实具有固定分布,表明该链将在该链中融合到该限制到该限制的限制,而与初始分布无关。对于(i),我们调查两个证据,一个使用概率参数,另一个使用图形理论参数。对于(ii),首先,我们提供基于耦合的证明(耦合是基于概率的技术),另一个使用矩阵分析。最后,我们仅使用线性代数概念提供了基本定理的证明。
This paper is a survey of various proofs of the so called {\em fundamental theorem of Markov chains}: every ergodic Markov chain has a unique positive stationary distribution and the chain attains this distribution in the limit independent of the initial distribution the chain started with. As Markov chains are stochastic processes, it is natural to use probability based arguments for proofs. At the same time, the dynamics of a Markov chain is completely captured by its initial distribution, which is a vector, and its transition probability matrix. Therefore, arguments based on matrix analysis and linear algebra can also be used. The proofs discussed below use one or the other of these two types of arguments, except in one case where the argument is graph theoretic. Appropriate credits to the various proofs are given in the main text. Our first proof is entirely elementary, and yet the proof is also quite simple. The proof also suggests a mixing time bound, which we prove, but this bound in many cases will not be the best bound. One approach in proving the fundamental theorem breaks the proof in two parts: (i) show the existence of a unique positive stationary distribution for irreducible Markov chains, and (ii) assuming that an ergodic chain does have a stationary distribution, show that the chain will converge in the limit to that distribution irrespective of the initial distribution. For (i), we survey two proofs, one uses probability arguments, and the other uses graph theoretic arguments. For (ii), first we give a coupling based proof (coupling is a probability based technique), the other uses matrix analysis. Finally, we give a proof of the fundamental theorem using only linear algebra concepts.