A Fast Online Planning Under Partial Observability Using Information Entropy Rewards - Details

author：

Indexed by：

EI Scopus SCIE

Abstract：

Motion　planning　in　an　unknown　environment　is　a　common　challenge　because　of　the　existing　uncertainties.　Representatively,　the　partially　observable　Markov　decision　process　(POMDP)　is　a　general　mathematical　framework　for　planning　in　uncertain　environments.　Recent　POMDP　solvers　generally　adopt　the　sparse　reward　scheme　to　solve　the　planning　under　uncertainty　problem.　Subsequently,　the　robot＇s　exploration　may　be　hindered　without　immediate　rewards,　resulting　in　excessively　long　planning　time.　In　this　article,　a　POMDP　method,　information　entropy　determinized　sparse　partially　observation　tree　(IE-DESPOT),　is　proposed　to　explore　a　high-quality　solution　and　efficient　planning　in　unknown　environments.　First,　a　novel　sample　method　integrating　state　distribution　and　Gaussian　distribution　is　proposed　to　optimize　the　quality　of　the　sampled　states.　Then,　an　information　entropy　based　on　sampled　states　is　established　for　real-time　reward　calculation,　resulting　in　the　improvement　of　robot　exploration　efficiency.　Moreover,　the　near-optimality　and　convergence　of　the　proposed　algorithm　are　analyzed.　As　a　result,　compared　with　general-purpose　POMDP　solvers,　the　proposed　algorithm　exhibits　fast　convergence　to　a　near-optimal　policy　in　many　examples　of　interest.　Furthermore,　the　IE-DESPOT＇s　performance　is　verified　in　real　mobile　robot　experiments.

Keyword：

Convergence efficiency Entropy Informatics Information entropy information entropy reward Markov processes mobile robot Mobile robots partially observable Markov decision process (POMDP) Planning planning under uncertainty Robots Task analysis Uncertainty Upper bound

Community：

[ 1 ] [Chen, Yanjie]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Peoples R China
[ 2 ] [Liu, Jiangjiang]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Peoples R China
[ 3 ] [Lan, Limin]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Peoples R China
[ 4 ] [Chen, Yanjie]Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Wales
[ 5 ] [Chen, Yanjie]Natl Engn Res Ctr Robot Visual Percept & Control T, Changsha 410082, Peoples R China
[ 6 ] [Zhang, Hui]Hunan Univ, Sch Robot, Changsha 410082, Peoples R China
[ 7 ] [Miao, Zhiqiang]Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
[ 8 ] [Wang, Yaonan]Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China

Reprint 's Address：

Email：

Show more details

Version：

A Fast Online Planning Under Partial Observability Using Information Entropy Rewards
2023，IEEE Transactions on Industrial Informatics
A Fast Online Planning Under Partial Observability Using Information Entropy Rewards
2023，IEEE Transactions on Industrial Informatics

Related Keywords：

Research on channel state prediction algorithm under multi-radio multi-channel environment in heterogeneous networks
2010，Journal of Electronics and Information Technology
Multiple Mobile Robots Planning Framework for Herding Non-Cooperative Target
2023，IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING
High-efficiency online planning using composite bounds search under partial observation
2022，APPLIED INTELLIGENCE
SET: Sampling-Enhanced Exploration Tree for Mobile Robot in Restricted Environments
2023，IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS

Source ：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS

ISSN： 1551-3203

Year： 2023

Issue： 12

Volume： 19

Page： 11596-11607

1 1 . 7

JCR@2023

1 1 . 7 0 0

JCR@2023

JCR Journal Grade：1

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

机械工程及自动化学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to