On a class of non-Markov decision processes

K. D. Glazebrook

doi:10.2307/3213426

On a class of non-Markov decision processes

Published online by Cambridge University Press: 14 July 2016

K. D. Glazebrook

Show author details

K. D. Glazebrook*: Affiliation:
University of Newcastle upon Tyne

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The optimal strategy for a class of non-Markov decision processes is characterised and has the property that changes of action may occur between successive transitions of the process. Results are given which enable the optimal strategy to be computed iteratively.

Keywords

BANDIT PROCESSES DYNAMIC ALLOCATION INDEX DYNAMIC PROGRAMMING MARKOV AND SEMI-MARKOV DECISION PROCESSES

Type: Research Papers
Information: Journal of Applied Probability , Volume 15 , Issue 4 , December 1978 , pp. 689 - 698

DOI: https://doi.org/10.2307/3213426 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 1978

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Gittins, J. C. and Glazebrook, K. D. (1977) On Bayesian models in stochastic scheduling. J. Appl. Prob. 14, 556–565.Google Scholar

Gittins, J. C. and Jones, D. M. (1972) A dynamic allocation index for the sequential design of experiments. In Progress in Statistics: Proc. 9th European Meeting of Statisticians , ed. Gani, J. et al. North Holland, Amsterdam.Google Scholar

Lippman, S. A. (1971) Maximal average-reward policies for semi-Markov decision processes with arbitrary state and action space. Ann. Math. Statist. 42, 1717–1726.CrossRef Google Scholar

Miller, B. L. (1968) Finite state continuous time Markov decision processes with an infinite planning horizon. J. Math. Anal. Appl. 22, 552–569.Google Scholar

Ross, S. M. (1968) Arbitrary state Markovian decision processes. Ann. Math. Statist. 39, 2118–2122.CrossRef Google Scholar

Ross, S. M. (1970) Applied Probability Models with Optimization Applications. Holden-Day, San Francisco.Google Scholar

Article contents

On a class of non-Markov decision processes

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests