Journal of the Operations Research Society of China

• Stochastic Optimization •     Next Articles

Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria

  

  • Online:2014-12-30 Published:2014-12-30

Abstract:

This paper studies the policy iteration algorithm (PIA) for zero-sum stochastic differential games with the basic long-run average criterion, as well as with its more selective version, the so-called bias criterion. The system is assumed to be a nondegenerate diffusion. We use Lyapunov-like stability conditions that ensure the existence and boundedness of the solution to certain Poisson equation. We also ensure the convergence of a sequence of such solutions, of the corresponding sequence of policies, and, ultimately, of the PIA.

Key words: Ergodic payoff criterion,  Zero-sum stochastic differential games, Policy iteration algorithm,  Nondegenerate diffusions ,  Poisson equation  , Scha¨lconvergence , Bias game