Policy Improvement and the Newton-Raphson Algorithm

P. Whittle; N. Komarova

doi:10.1017/S0269964800000760

Policy Improvement and the Newton-Raphson Algorithm

Published online by Cambridge University Press: 27 July 2009

P. Whittle and

N. Komarova

Show author details

P. Whittle: Affiliation:
Statistical Laboratory University of Cambridge
N. Komarova: Affiliation:
All-Union Correspondence Polytechnic Institute, Moscow, USSR

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We show that the calculation of the infinite-horizon value function for a linear/quadratic Markov decision process by policy improvement is exactly equivalent to solution of the equilibrium Riccati equation by the Newton-Raphson method. The assertion extends to risk-sensitive and non-Markov forinulations and thus shows, for example, that the Newton-Raphson method provides an iterative algorithm for the canonical factorization of operators which shows second-order convergence and has a variational basis.

Type: Articles
Information: Probability in the Engineering and Informational Sciences , Volume 2 , Issue 2 , April 1988 , pp. 249 - 255

DOI: https://doi.org/10.1017/S0269964800000760 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 1988

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Whittle, P. (1981). Risk-sensitive linear/quadratic/Gaussian control. Advances in Applied Probability 13:764–777.CrossRef Google Scholar

Whittle, P. (1982). Optimization Over Time, Vol. I. Chichester: Wiley.Google Scholar

Whittle, P. (1983). Prediction and Regulation by Linear Least Square Methods, 2nd Ed.University of Minnesota Press.Google Scholar

Whittle, P. & Kuhn, J. (1986). A Hamiltonian formulation of risk-sensitive linear/quadratic/Gaussian control. International Journal of Control 43:1–12.Google Scholar

Article contents

Policy Improvement and the Newton-Raphson Algorithm

Abstract

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests