Crossref Citations
This article has been cited by the following publications. This list is generated based on data provided by Crossref.
Bharath, B
and
Borkar, V S
1999.
Stochastic approximation algorithms: Overview and recent trends.
Sadhana,
Vol. 24,
Issue. 4-5,
p.
425.
Abounadi, J.
Bertsekas, D.
and
Borkar, V. S.
2001.
Learning Algorithms for Markov Decision Processes with Average Cost.
SIAM Journal on Control and Optimization,
Vol. 40,
Issue. 3,
p.
681.
Borkar, V. S.
2002.
Q-Learning for Risk-Sensitive Control.
Mathematics of Operations Research,
Vol. 27,
Issue. 2,
p.
294.
Ormoneit, D.
and
Glynn, P.
2002.
Kernel-based reinforcement learning in average-cost problems.
IEEE Transactions on Automatic Control,
Vol. 47,
Issue. 10,
p.
1624.
Van Roy, Benjamin
2002.
Handbook of Markov Decision Processes.
Vol. 40,
Issue. ,
p.
431.
Melo, Francisco S.
and
Ribeiro, M. Isabel
2007.
Learning Theory.
Vol. 4539,
Issue. ,
p.
308.
Malikopoulos, Andreas A.
Papalambros, Panos Y.
and
Assanis, Dennis N.
2009.
A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty.
Journal of Dynamic Systems, Measurement, and Control,
Vol. 131,
Issue. 4,
Malikopoulos, Andreas A.
2009.
Convergence Properties of a Computational Learning Model for Unknown Markov Chains.
Journal of Dynamic Systems, Measurement, and Control,
Vol. 131,
Issue. 4,
Malikopoulos, Andreas A.
Assanis, Dennis N.
and
Papalambros, Panos Y.
2009.
Real-Time Self-Learning Optimization of Diesel Engine Calibration.
Journal of Engineering for Gas Turbines and Power,
Vol. 131,
Issue. 2,