Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria

Jose Daniel Lopez-Barrientos

doi:10.1007/s40305-014-0061-z

Journal of the Operations Research Society of China >

2014 , Vol. 2 >Issue 4: 395

DOI: https://doi.org/10.1007/s40305-014-0061-z

Stochastic Optimization

Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria

Expand

Online published: 2014-12-30

Fold

Abstract

This paper studies the policy iteration algorithm (PIA) for zero-sum stochastic differential games with the basic long-run average criterion, as well as with its more selective version, the so-called bias criterion. The system is assumed to be a nondegenerate diffusion. We use Lyapunov-like stability conditions that ensure the existence and boundedness of the solution to certain Poisson equation. We also ensure the convergence of a sequence of such solutions, of the corresponding sequence of policies, and, ultimately, of the PIA.

Key words： Ergodic payoff criterion; Zero-sum stochastic differential games; Policy iteration algorithm; Nondegenerate diffusions; Poisson equation ; Scha¨lconvergence ; Bias game

Cite this article

Jose Daniel Lopez-Barrientos . Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria[J]. Journal of the Operations Research Society of China, 2014 , 2(4) : 395 . DOI: 10.1007/s40305-014-0061-z

Options

Outlines

模态框（Modal）标题

Abstract

Cite this article