Article contents
Detecting optimal and non-optimal actions in average-cost Markov decision processes
Published online by Cambridge University Press: 14 July 2016
Abstract
We present two sufficient conditions for detection of optimal and non-optimal actions in (ergodic) average-cost MDPs. They are easily interpreted and can be implemented as detection tests in both policy iteration and linear programming methods. An efficient implementation of a recent new policy iteration scheme is discussed.
MSC classification
- Type
- Research Article
- Information
- Copyright
- Copyright © Applied Probability Trust 1994
References
- 4
- Cited by