TY - GEN
T1 - A Deep Reinforcement Learning Algorithm for Objects Balance Control with Hexapod Robot
AU - Tsai, Yu Hao
AU - Saeedvand, Saeed
AU - Baltes, Jacky
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Legged robots have been a prominent focus of research for an extensive period, owing to their enhanced stability and maneuverability in challenging terrains compared to wheeled robots. In recent decades, the application of reinforcement learning (RL) to train legged robots has yielded excellent results. This approach has effectively addressed numerous challenges that traditional methods struggled to overcome, such as navigating through complex environments. The advancements in RL for legged robots have significantly improved the feasibility and success of demanding applications, including exploration and resource delivery in outdoor settings. This paper introduces a training architecture for a hexapod robot based on the implementation of proximal policy optimization (PPO) on a GPU. This architecture enables the hexapod to transport objects across uneven terrain surfaces while adhering to input control commands. Our investigation extends to assessing the performance limitations of the hexapod robot in both flat and uneven ground environments, with a particular focus on evaluating the impact of different activation functions.
AB - Legged robots have been a prominent focus of research for an extensive period, owing to their enhanced stability and maneuverability in challenging terrains compared to wheeled robots. In recent decades, the application of reinforcement learning (RL) to train legged robots has yielded excellent results. This approach has effectively addressed numerous challenges that traditional methods struggled to overcome, such as navigating through complex environments. The advancements in RL for legged robots have significantly improved the feasibility and success of demanding applications, including exploration and resource delivery in outdoor settings. This paper introduces a training architecture for a hexapod robot based on the implementation of proximal policy optimization (PPO) on a GPU. This architecture enables the hexapod to transport objects across uneven terrain surfaces while adhering to input control commands. Our investigation extends to assessing the performance limitations of the hexapod robot in both flat and uneven ground environments, with a particular focus on evaluating the impact of different activation functions.
KW - hexapod
KW - Isaac Gym
KW - objects carrying
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/105013678903
UR - https://www.scopus.com/pages/publications/105013678903#tab=citedBy
U2 - 10.1109/ICCRE65455.2025.11093566
DO - 10.1109/ICCRE65455.2025.11093566
M3 - Conference contribution
AN - SCOPUS:105013678903
T3 - 2025 10th International Conference on Control and Robotics Engineering, ICCRE 2025
SP - 34
EP - 39
BT - 2025 10th International Conference on Control and Robotics Engineering, ICCRE 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Conference on Control and Robotics Engineering, ICCRE 2025
Y2 - 9 May 2025 through 11 May 2025
ER -