Optimal control of linear stochastic systems with an exponential-of-integral performance index

Alain Bensoussan

doi:10.1017/CBO9780511526503.004

3 - Optimal control of linear stochastic systems with an exponential-of-integral performance index

Published online by Cambridge University Press: 16 September 2009

Alain Bensoussan

Show author details

Alain Bensoussan: Affiliation:
Université de Paris IX (Paris-Dauphine)

Book contents

Get access

Summary

Introduction

In Chapter 2 (see comments) we have seen the concepts of certainty equivalence and the separation principle. Again the main idea is that an optimal control for a stochastic control problem with partial observation can be obtained by a feedback rule (deterministic of course) applied on the Kalman filter (the best estimate of the state). As we shall see in later chapters, this situation is by no means general. The ‘sufficient statistics’, to which one applies a feedback rule, are in general infinite dimensional (as will be seen it is the conditional probability distribution), and thus does not reduce to a single moment, the conditional mean. Some natural questions arise at this stage. Can we find examples in which the sufficient statistics are finite dimensional, for instance several moments (conditional mean and conditional variance)? Note that if this holds, then the dimension of the sufficient statistics, although finite, is larger than that of the state. In this chapter we will study a different situation. We shall meet a situation where the optimal control is given by feedback on sufficient statistics, which are finite dimensional, with a dimension which is the same as that of the state, but does not coincide with the conditional mean, the best estimate of the state. It is clearly a situation where the separation principle does not hold, but from a computational viewpoint offers the same simplicity as that when the separation principle does hold.

The situation that we consider here is naturally very specific again, and corresponds to linear dynamics, with linear observation, and a payoff which is the exponential of the standard quadratic functional.

Type: Chapter
Information: Stochastic Control of Partially Observable Systems , pp. 53 - 73

DOI: https://doi.org/10.1017/CBO9780511526503.004 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1992

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

3 - Optimal control of linear stochastic systems with an exponential-of-integral performance index

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive