Behavior policy learning: Learning multi-stage tasks via solution sketches and model-based controllers

Tsinganos, Konstantinos and Chatzilygeroudis, Konstantinos and Hadjivelichkov, Denis and Komninos, Theodoros and Dermatas, Evangelos and Kanoulas, Dimitrios (2022) Behavior policy learning: Learning multi-stage tasks via solution sketches and model-based controllers. Frontiers in Robotics and AI, 9. ISSN 2296-9144

[thumbnail of pubmed-zip/versions/2/package-entries/frobt-09-974537-r1/frobt-09-974537.pdf] Text
pubmed-zip/versions/2/package-entries/frobt-09-974537-r1/frobt-09-974537.pdf - Published Version

Download (2MB)

Abstract

Multi-stage tasks are a challenge for reinforcement learning methods, and require either specific task knowledge (e.g., task segmentation) or big amount of interaction times to be learned. In this paper, we propose Behavior Policy Learning (BPL) that effectively combines 1) only few solution sketches, that is demonstrations without the actions, but only the states, 2) model-based controllers, and 3) simulations to effectively solve multi-stage tasks without strong knowledge about the underlying task. Our main intuition is that solution sketches alone can provide strong data for learning a high-level trajectory by imitation, and model-based controllers can be used to follow this trajectory (we call it behavior) effectively. Finally, we utilize robotic simulations to further improve the policy and make it robust in a Sim2Real style. We evaluate our method in simulation with a robotic manipulator that has to perform two tasks with variations: 1) grasp a box and place it in a basket, and 2) re-place a book on a different level within a bookcase. We also validate the Sim2Real capabilities of our method by performing real-world experiments and realistic simulated experiments where the objects are tracked through an RGB-D camera for the first task.

Item Type: Article
Subjects: Apsci Archives > Mathematical Science
Depositing User: Unnamed user with email support@apsciarchives.com
Date Deposited: 24 Jun 2023 06:18
Last Modified: 16 Sep 2023 05:42
URI: http://eprints.go2submission.com/id/eprint/1389

Actions (login required)

View Item
View Item