Skip to Main Content
Skip Nav Destination
ASME Press Select Proceedings
Intelligent Engineering Systems through Artificial Neural Networks, Volume 20Available to Purchase
By
Cihan H. Dagli
Cihan H. Dagli
Search for other works by this author on:
ISBN:
9780791859599
No. of Pages:
686
Publisher:
ASME Press
Publication date:
2010

Most reinforcement learning algorithms are of the model-free type in which the transition probabilities are not computed and the agent seeks to make decisions without building the transition probability model. We focus on the model-based, also called model-building, algorithms that attempt to build the model along with optimization of the decision-making process. Model-based algorithms have certain advantages over model-free algorithms in that their behavior is more stable and robust. Another aspect of robustness and stability of the algorithm has to do with the variability in the value of the performance measure returned by the algorithm. We will present a new model-building algorithm that builds the transition probability model simultaneously with the value function and a new variance-penalizing algorithm that exhibits robustness with respect to the performance measure.

Abstract
Introduction
A Literature Review
New Algorithms
Conclusions
References
This content is only available via PDF.
You do not currently have access to this chapter.

or Create an Account

Close Modal
Close Modal