Robust Factored Markov Decision Processes

讲座简介


Over the last few decades, Markov Decision Processes (MDPs) have been used as the basic semantics for optimal planning for decision-theoretic agents in stochastic environments. Factored MDPs are an approach to represent large MDPs compactly by exploiting the internal structure. However, even when a large MDP can be represented compactly by using a factored representation, solving it exactly is still intractable. In this project, we propose a semi-infinite method to solve the factored MDPs. A central element of our method is a novel linearization technique, which reformulates an integer program (IP) to a provably equivalent, small-size mixed-integer linear program (MILP). Our numerical experiments so far indicate that we can solve problems multiple times larger than the state of the art. Besides, we take the uncertainty of transition kernel in factored MDPs into account and apply the same method to solve it. Numerical results show our proposed method still works for the robust factored MDPs.

时间


2021-05-12

上午 9:00 ~ 11:00

主讲人


Huikang Liu, Imperial College London

地点



腾讯会议:915 827 775

会议密码:123456

会议链接:

https://meeting.tencent.com/s/eke0cuP3fkOH