期刊文献+

A Fast Approximation Method for Partially Observable Markov Decision Processes 被引量:3

A Fast Approximation Method for Partially Observable Markov Decision Processes
原文传递
导出
摘要 This paper develops a new lower bound method for POMDPs that approximates the update of a belief by the update of its non-zero states.It uses the underlying MDP to explore the optimal reachable state space from initial belief and select actions during value iterations,which significantly accelerates the convergence speed.Also,an algorithm which collects and prunes belief points based on the upper and lower bounds is presented,and experimental results show that it outperforms some of the state-of-art point-based algorithms. This paper develops a new lower bound method for POMDPs that approximates the update of a belief by the update of its non-zero states. It uses the underlying MDP to explore the optimal reachable state space from initial belief and select actions during value iterations, which significantly accelerates the convergence speed. Also, an algorithm which collects and prunes belief points based on the upper and lower bounds is presented, and experimental results show that it outperforms some of the state-of-art point-based algorithms.
出处 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2018年第6期1423-1436,共14页 系统科学与复杂性学报(英文版)
基金 supported in part by the National High-tech R&D Program of China under Grant No.2014AA06A503 the National Natural Science Foundation of China under Grant Nos.61422307,61673361 and 61725304 the Scientific Research Starting Foundation for the Returned Overseas Chinese Scholars and Ministry of Education of China
关键词 LOWER BOUND point-based POMDP Lower bound point-based POMDP
  • 相关文献

同被引文献25

引证文献3

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部