Article contents
A new policy iteration scheme for Markov decision processes using Schweitzer's formula
Published online by Cambridge University Press: 14 July 2016
Abstract
Given a family of Markov chains with a single recurrent class, we present a potential application of Schweitzer's exact formula relating the steady-state probability and fundamental matrices of any two chains in the family. We propose a new policy iteration scheme for Markov decision processes where in contrast to policy iteration, the new criterion for selecting an action ensures the maximal one-step average cost improvement. Its computational complexity and storage requirement are analysed.
Keywords
MSC classification
- Type
- Short Communications
- Information
- Copyright
- Copyright © Applied Probability Trust 1994
References
- 4
- Cited by