Mean Ergodic Theorem
The mean ergodic theorem is a fundamental result in the ergodic theory, and produces the von Neumann ergodic theorem as a corollary. The theorem is stated as follows (Ergodic Theory, Karl Peterson):
Theorem
Let
be a linear transform on the Hilbert space
, and satisfies the contraction property:
.
Define the invariance subspace of
by
.
Let
be the orthogonal projection from
to
. Then for any
,
in the sense of
.
This result states that the time average of can be used to approximate
, which is classical in the ergodic theory.
Proof Define to be the span of the subspace
, then
is the orthogonal complement of
. Now we compute the time average of
.
(1) If for some
, then it is direct to see
,
hence we have
.
(2) If , then there exists
in
such that
converges to
in
. For any
, we can choose
such that
, and thus
.
For this given , for sufficiently large
we have
.
Therefore we conclude that for sufficiently large ,
.
Then we obtain in the norm of
.
From the proof we know that, if is a closed set, then the convergence rate of the time average is explicitly given by
.
A natural consequence is the von Neumann’s mean ergodic theorem, which is the FIRST ergodic theorem in history.
Theorem
Let
be a
-finite measure space,
be a measure-preserving transformation, and
. Then there exists a function
for which
.
Here, means that
.
Example: Irreducible Markov Chain
Now we consider a special example: the irreducible Markov chain in the Euclidean space . We shall employ the mean ergodic theorem to prove convergence of the weak convergence in the irreducible Markov chain.
Theorem
Let
be a Markov chain in
with invariant distribution
. Suppose for any
, the transition probability
has a positive lower bound when
lives in a compact set. For given
, for almost every initial value
there holds
.
Here, the Hilbert space actually means
, where
is the Borel
-algebra in
. The “irreducible” corresponds to the positivity of the transition probability
. The theorem asserts that in the weak sense, the time average converges to the spatial average almost surely.
Proof Introduce the operator on
by
.
Now we prove two essential properties of .
(1) is a contraction in
.
This is actually a classical result from the Markov semigroup theory, (The geometry of Markov diffusion generators, Bakry) for example. Still, we present a short proof of this property below. It is direct to calculate for any ,
,
where is the transition probability distribution. Integrate both sides over
gives
Here, is a probability distribution in
with density
.
Clearly, the marginal distributions of in
are both
. Therefore,
and we conclude that is a contraction in
.
(2) The solution to for
is constant.
Clearly, if is a constant function, then
,
which implies is actually a solution. Next, if some non-constant
satisfies
, we can assume
has a nonzero maximum. Specifically, let
, and
is the maximum. Then
implies
.
Since has a positive lower bound when
lives in a compatc set, we claim that
holds true almost everywhere. This is a contradition.
Using the two properties above, we immediately obtain the result from the mean ergodic theorem.
Extension to Continuous Time Stochastic Process
A natural idea is to extend the mean ergodic theorem to the continuous time stochastic processes. There is an intuitive approach to complete this task: in the discrete time case, we have used to conclude
.
In the continuous case, suppose is the generator of the stochastic process, then we wish for some
there holds
.
Differentiating on the variable gives
.
Therefore, to study the time average corresponding to the function , we only need to solve the Poisson equation
(note that
is usually an elliptic operator).
The details of the method can be found in Mattingly’s prominent paper Convergence of numerical time-averaging and stationary measures via Poisson equations, which also provides the error analysis of numerical discretization methods.
Summary
We establish the mean ergodic theorem in the Hilbert space and apply the theorem to prove the convergence of the time average of a Markov chain.

留下评论