The increasing complexity of engineering systems has motivated continuing research on computational learning methods towards making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to be able to sense their environment, but should also integrate information from the environment into all decision making. The evolution of such systems is modeled as an unknown controlled Markov chain. In previous research, the predictive optimal decision-making (POD) model was developed that aims to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of POD to the stationary distribution of a Markov chain is proven, thus establishing POD as a robust model for making autonomous intelligent systems. The paper provides the conditions that POD can be valid, and an interpretation of its underlying structure.
Skip Nav Destination
ASME 2008 Dynamic Systems and Control Conference
October 20–22, 2008
Ann Arbor, Michigan, USA
Conference Sponsors:
- Dynamic Systems and Control Division
ISBN:
978-0-7918-4335-2
PROCEEDINGS PAPER
Convergence Properties of a Computational Learning Model for Unknown Markov Chains
Andreas A. Malikopoulos
Andreas A. Malikopoulos
University of Michigan, Ann Arbor, MI
Search for other works by this author on:
Andreas A. Malikopoulos
University of Michigan, Ann Arbor, MI
Paper No:
DSCC2008-2174, pp. 513-520; 8 pages
Published Online:
June 29, 2009
Citation
Malikopoulos, AA. "Convergence Properties of a Computational Learning Model for Unknown Markov Chains." Proceedings of the ASME 2008 Dynamic Systems and Control Conference. ASME 2008 Dynamic Systems and Control Conference, Parts A and B. Ann Arbor, Michigan, USA. October 20–22, 2008. pp. 513-520. ASME. https://doi.org/10.1115/DSCC2008-2174
Download citation file:
4
Views
Related Proceedings Papers
Related Articles
Convergence Properties of a Computational Learning Model for Unknown Markov Chains
J. Dyn. Sys., Meas., Control (July,2009)
An Approach for Testing Methods for Modeling Uncertainty
J. Mech. Des (September,2006)
Why Do Verification and Validation?
J. Verif. Valid. Uncert (March,2016)
Related Chapters
Model-Building for Robust Reinforcement Learning
Intelligent Engineering Systems through Artificial Neural Networks, Volume 20
Decision Making in Two-Dimensional Warranty Planning (PSAM-0186)
Proceedings of the Eighth International Conference on Probabilistic Safety Assessment & Management (PSAM)
An Approach for System Development Using Evolutionary Probabilistic Strategy and Grammar Rules
Intelligent Engineering Systems through Artificial Neural Networks, Volume 16