Intelligent agents are becoming increasingly important in our society in applications as diverse as house cleaning robots, computer-controlled opponents in video games, unmanned aerial combat vehicles, entertainment robots, and autonomous explorers in outer space. However, the broader adoption of intelligent agents is often hindered by their limited adaptability to new tasks; when conditions change slightly, agents may quickly become confused. Additionally, a substantial engineering effort is required to design an agent for each new task. This paper presents an adaptable, general purpose intelligent agent toolkit based on reinforcement learning (RL), an approach with strong mathematical foundations and intriguing biological implications. RL algorithms are powerful because of their generality: agents simply receive a scalar reward value representing success or failure, which greatly simplifies the agent design process. Furthermore, these algorithms can be combined with other techniques (e.g., planning from a learned internal model) to improve learning efficiency. The design and implementation of an open source RL toolkit is presented here as a step towards the goal of general purpose agents. Experimental results show learning performance on several tasks, including two physical control problems.
Skip Nav Destination
Close
Sign In or Register for Account
ASME 2006 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference
September 10–13, 2006
Philadelphia, Pennsylvania, USA
Conference Sponsors:
- Design Engineering Division and Computers and Information in Engineering Division
ISBN:
0-7918-4255-X
PROCEEDINGS PAPER
Verve: A General Purpose Open Source Reinforcement Learning Toolkit
Tyler Streeter,
Tyler Streeter
Iowa State University, Ames, IA
Search for other works by this author on:
James Oliver,
James Oliver
Iowa State University, Ames, IA
Search for other works by this author on:
Adrian Sannier
Adrian Sannier
Arizona State University, Tempe, AZ
Search for other works by this author on:
Tyler Streeter
Iowa State University, Ames, IA
James Oliver
Iowa State University, Ames, IA
Adrian Sannier
Arizona State University, Tempe, AZ
Paper No:
DETC2006-99651, pp. 359-369; 11 pages
Published Online:
June 3, 2008
Citation
Streeter, T, Oliver, J, & Sannier, A. "Verve: A General Purpose Open Source Reinforcement Learning Toolkit." Proceedings of the ASME 2006 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference. Volume 1: 32nd Design Automation Conference, Parts A and B. Philadelphia, Pennsylvania, USA. September 10–13, 2006. pp. 359-369. ASME. https://doi.org/10.1115/DETC2006-99651
Download citation file:
- Ris (Zotero)
- Reference Manager
- EasyBib
- Bookends
- Mendeley
- Papers
- EndNote
- RefWorks
- BibTex
- ProCite
- Medlars
Close
Sign In
4
Views
0
Citations
Related Proceedings Papers
Related Articles
Discussion of “A Review of Propulsion, Power, and Control Architectures for Insect-Scale Flapping Wing Vehicles” by E. F. Helbling and R. J. Wood (Helbling, E. F., and Wood, R. J., 2018, ASME Appl. Mech. Rev., 70(1), p. 010801)
Appl. Mech. Rev (January,2018)
On the Joint Velocity Jump for Redundant Robots in the Presence of Locked-Joint Failures
J. Mech. Des (October,2008)
TMFSLAM—Design Analysis Tool for Coated Structures
J. Eng. Gas Turbines Power (April,1992)
Related Chapters
Scalability of Abinit on BlueGene/L for Identifying the Band Structure for Nanotechnology Materials
International Conference on Advanced Computer Theory and Engineering (ICACTE 2009)
History Repeated Full Cycle into the Unfamiliar
The Code: An Authorized History of the ASME Boiler and Pressure Vessel Code
Pricing and Bidding Strategies
Natural Negotiation for Engineers and Technical Professionals