Path tracking control of non-holonomic car-like robot with reinforcement learning

Jacky Baltes, Yuming Lin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

This paper investigates the use of reinforcement learning in solving the path-tracking problem for car-like robots. The reinforcement learner uses a case-based function approximator, to extend the standard reinforcement learning paradigm to handle continuous states. The learned controller performs comparable to the best traditional control functions in both simulation and also in practical driving.

Original languageEnglish
Title of host publicationRoboCup-99
Subtitle of host publicationRobot Soccer World Cup III
EditorsManuela Veloso, Enrico Pagello, Hiroaki Kitano
PublisherSpringer Verlag
Pages162-173
Number of pages12
ISBN (Print)9783540410430
DOIs
Publication statusPublished - 2000 Jan 1
Event3rd Robot World Cup Soccer Games and Conferences, RoboCup 1999 - Stockholm, Sweden
Duration: 1999 Jul 271999 Aug 6

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1856
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other3rd Robot World Cup Soccer Games and Conferences, RoboCup 1999
CountrySweden
CityStockholm
Period99/7/2799/8/6

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Path tracking control of non-holonomic car-like robot with reinforcement learning'. Together they form a unique fingerprint.

  • Cite this

    Baltes, J., & Lin, Y. (2000). Path tracking control of non-holonomic car-like robot with reinforcement learning. In M. Veloso, E. Pagello, & H. Kitano (Eds.), RoboCup-99: Robot Soccer World Cup III (pp. 162-173). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1856). Springer Verlag. https://doi.org/10.1007/3-540-45327-x_12