Multi-view reinforcement learning for sequential decision-making with insufficient state information - Details

author：

Li, M. (Li, M..) ^[1] | Zhu, W. (Zhu, W..) ^[2] | Wang, S. (Wang, S..) ^[3]

Indexed by：

Scopus

Abstract：

Most　reinforcement　learning　methods　describe　sequential　decision-making　as　a　Markov　decision　process　where　the　effect　of　action　is　only　decided　by　the　current　state.　But　this　is　reasonable　only　if　the　state　is　correctly　defined　and　the　state　information　is　sufficiently　observed.　Thus　the　learning　efficiency　of　reinforcement　learning　methods　based　on　Markov　decision　process　is　limited　when　the　state　information　is　insufficient.　Partially　observable　Markov　decision　process　and　history-based　decision　process　are　respectively　proposed　to　describe　sequential　decision-making　with　insufficient　state　information.　However,　these　two　processes　are　easy　to　ignore　the　important　information　from　the　current　observed　state.　Therefore,　the　learning　efficiency　of　reinforcement　learning　methods　based　on　these　two　processes　is　also　limited　when　the　state　information　is　insufficient.　In　this　paper,　we　propose　a　multi-view　reinforcement　learning　method　to　solve　this　problem.　The　motivation　is　that　the　interaction　information　between　the　agent　and　its　environment　should　be　considered　from　the　views　of　history,　present,　and　future　to　overcome　the　insufficiency　of　state　information.　Based　on　these　views,　we　construct　a　multi-view　decision　process　to　describe　sequential　decision-making　with　insufficient　state　information.　A　multi-view　reinforcement　learning　method　is　proposed　by　combining　the　multi-view　decision　process　and　the　actor-critic　framework.　In　the　proposed　method,　multi-view　clustering　is　performed　to　ensure　that　each　type　of　sample　can　be　sufficiently　exploited.　Experiments　illustrate　that　the　proposed　method　is　more　effective　than　the　compared　state-of-the-arts.　The　source　code　can　be　downloaded　from　https://github.com/jamieliuestc/MVRL.　©　2023,　The　Author(s),　under　exclusive　licence　to　Springer-Verlag　GmbH　Germany,　part　of　Springer　Nature.

Keyword：

Insufficient state information Multi-view clustering Multi-view decision process Multi-view reinforcement learning

Community：

[ 1 ] [Li M.]Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
[ 2 ] [Zhu W.]Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
[ 3 ] [Wang S.]College of Computer and Data Science, Fuzhou University, Fuzhou, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Multi-view reinforcement learning for sequential decision-making with insufficient state information
2023，INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
Contrastive Consensus Graph Learning for Multi-View Clustering
2022，IEEE-CAA JOURNAL OF AUTOMATICA SINICA
An overview of recent multi-view clustering
2020，NEUROCOMPUTING
Cross-View Fusion for Multi-View Clustering
2025，IEEE SIGNAL PROCESSING LETTERS
Diversity embedding deep matrix factorization for multi-view clustering
2022，INFORMATION SCIENCES

Source ：

International Journal of Machine Learning and Cybernetics

ISSN： 1868-8071

Year： 2023

Issue： 4

Volume： 15

Page： 1533-1552

3 . 1

JCR@2023

3 . 1 0 0

JCR@2023

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to