TLdR: Policy Summarization for Factored SSP Problems Using Temporal Abstractions

Sarath Sreedharan, Siddharth Srivastava, Subbarao Kambhampati

PosterID: 26

As more and more people are expected to work with complex AI-systems, it becomes more important than ever that such systems provide intuitive explanations for their decisions. A prerequisite for holding such explanatory dialogue is the ability of the systems to present their proposed decisions to the user in an easy-to-understand form. Unfortunately, such dialogues could become hard to facilitate in real-world problems where the system may be planning for multiple eventualities in stochastic environments. This means for the system to be effective, it needs to be able to present the policy at a high-level of abstraction and delve into details as required. Towards this end, we investigate the utility of temporal abstractions derived through analytically computed landmarks and their relative ordering to build a summarization of policies for {\em Stochastic Shortest Path Problems}. We formalize the concept of policy landmarks and show how it can be used to provide a high level overview of a given policy. Additionally, we establish the connections between the type of hierarchy we generate and previous works in temporal abstractions, specifically MaxQ hierarchies. Our approach is evaluated through user studies as well as empirical metrics that establish that people tend to choose landmarks facts as subgoals to summarize policies and demonstrates the performance of our approach on standard benchmarks.

Session Am1: Explainable Planning

Canb 10/28/2020, 03:00 – 04:00

10/30/2020, 10:00 – 11:00

Paris 10/27/2020, 17:00 – 18:00

10/30/2020, 00:00 – 01:00

NYC 10/27/2020, 12:00 – 13:00

10/29/2020, 19:00 – 20:00

LA 10/27/2020, 09:00 – 10:00

10/29/2020, 16:00 – 17:00

D3WA+ A Case Study of XAIP in a Model Acquisition Task for Dialogue Planning

Sarath Sreedharan, Tathagata Chakraborti, Christian Muise, Yasaman Khazaeni, Subbarao Kambhampati

TLdR: Policy Summarization for Factored SSP Problems Using Temporal Abstractions

Sarath Sreedharan, Siddharth Srivastava, Subbarao Kambhampati

Generating Explanations for Temporal Logic Planner Decisions

Daniel Kasenberg, Ravenna Thielstrom, Matthias Scheutz

RADAR: Automated Task Planning for Proactive Decision Support

Sachin Grover, Sailik Sengupta, Tathagata Chakraborti, Aditya Prakash Mishra, Subbarao Kambhampati