Giuseppe De Giacomo, Marco Favorito, Luca Iocchi, Fabio Patrizi |
PosterID:
59
PDF
Slides
Poster
BibTeX
|
A common problem in Reinforcement Learning (RL) is that the reward function is hard to express. This can be overcome by resorting to Inverse Reinforcement Learning (IRL), which consists in first obtaining a reward function from a set of execution traces generated by the expert agent, and then making the learning agent learn the expert's behavior --this is known as Transfer Learning (TL). Typical IRL solutions rely on a numerical representation of the reward function, which raises problems related to the adopted optimization procedures. We describe a TL method where the execution traces generated by the expert agent, possibly via planning, are used to produce a logical (as opposed to numerical) specification of the reward function, to be incorporated in a device known as Restraining Bolt (RB). The RB can be attached to the learning agent to drive the learning process and ultimately make it imitate the expert. We show that TL can be applied to heterogeneous agents, with the expert, the learner and the RB using different representations of the environment's actions and states, without specifying mappings among their representations. |
Canb | 10/29/2020, 00:00 – 01:00 |
10/29/2020, 18:00 – 19:00 |
Paris | 10/28/2020, 14:00 – 15:00 |
10/29/2020, 08:00 – 09:00 |
NYC | 10/28/2020, 09:00 – 10:00 |
10/29/2020, 03:00 – 04:00 |
LA | 10/28/2020, 06:00 – 07:00 |
10/29/2020, 00:00 – 01:00 |
Learning Sequences of Approximations for Hierarchical Motion Planning
Martim Brandão, Ioannis Havoutis
Joint Inference of Reward Machines and Policies for Reinforcement Learning
Zhe Xu, Ivan Gavran, Yousef Ahmad, Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu
Imitation Learning over Heterogeneous Agents with Restraining Bolts
Giuseppe De Giacomo, Marco Favorito, Luca Iocchi, Fabio Patrizi
Refining Process Descriptions from Execution Data in Hybrid Planning Domain Models
Alan Lindsay, Santiago Franco, Rubiya Reba, Thomas L. McCluskey