Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-01-10T22:42:57.386Z Has data issue: false hasContentIssue false

Dynamic programming and gambling models

Published online by Cambridge University Press:  01 July 2016

Sheldon M. Ross*
Affiliation:
University of California, Berkeley

Abstract

Dynamic programming is used to solve some simple gambling models. In particular we consider the situation where an individual may bet any integral amount not greater than his fortune and he will win this amount with probability p or lose it with probability 1 — p. It is shown that if p ≧ ½ then the timid strategy (always bet one dollar) both maximizes the probability of ever reaching any preassigned fortune, and also stochastically maximizes the time until the bettor becomes broke. Also, if p ≦ ½ then the timid strategy while not stochastically maximizing the playing time does maximize the expected playing time. We also consider the same model but with the additional structure that the bettor need not gamble but may instead elect to work for some period of time. His goal is to minimize the expected time until his fortune reaches some preassigned goal. We show that if p ≦ ½ then (i) always working is optimal, and (ii) among those strategies that only allow working when the bettor is broke it is the bold strategy that is optimal

Type
Research Article
Copyright
Copyright © Applied Probability Trust 1974 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

[1] Blackwell, D. (1967) Positive dynamic programming. Proc. Fifth Berkeley Symp. Math. Statist. Prob. 1, 415418. University of California Press.Google Scholar
[2] Blackwell, D. (1970) On stationary policies. J. R. Statist. Soc. A 133, 3337.Google Scholar
[3] Breiman, L. (1961) Optimal gambling systems for favorable games. Proc. Fourth Berkeley Symp. Math. Statist. Prob. 1. University of California Press.Google Scholar
[4] Breiman, L. (1968) Probability. Addison-Wesley, Reading, Mass. Google Scholar
[5] Derman, C. and Strauch, R. (1966) A note on memoryless rules for controlling sequential control processes. Ann. Math. Statist. 37, 276279.Google Scholar
[6] Dubins, L. and Savage, L. (1965) How to Gamble if you Must. McGraw-Hill, New York.Google Scholar
[7] Epstein, R. (1967) Theory of Gambling and Statistical Logic. Academic Press, New York.Google Scholar
[8] Freedman, D. (1967) Timid play is optimal. Ann. Math. Statist. 38, 12811284.Google Scholar
[9] Molenaar, W. and Van Der Velde, E. A. (1967) How to survive a fixed number of fair bets. Ann. Math. Statist. 38, 12781281.Google Scholar
[10] Ornstein, D. (1969) On the existence of stationary optimal strategies. Proc. Amer. Math. Soc. 20, 563569.Google Scholar
[11] Ross, S. (1970) Applied Probability Models with Optimization Applications. Holden-Day, San Francisco.Google Scholar
[12] Strauch, R. (1966) Negative dynamic programming. Ann. Math. Statist. 37, 171189.Google Scholar
[13] Thorp, E. (1969) Optimal gambling systems for favorable games. Rev. Inst. Internat. Statist. 37, No. 3.Google Scholar