Multi-view reinforcement learning for sequential decision-making with insufficient state information - Details

author：

Li, Min (Li, Min.) ^[1] | Zhu, William (Zhu, William.) ^[2] | Wang, Shiping (Wang, Shiping.) ^[3] (Scholars：王石平)

Indexed by：

EI Scopus SCIE

Abstract：

Most　reinforcement　learning　methods　describe　sequential　decision-making　as　a　Markov　decision　process　where　the　effect　of　action　is　only　decided　by　the　current　state.　But　this　is　reasonable　only　if　the　state　is　correctly　defined　and　the　state　information　is　sufficiently　observed.　Thus　the　learning　efficiency　of　reinforcement　learning　methods　based　on　Markov　decision　process　is　limited　when　the　state　information　is　insufficient.　Partially　observable　Markov　decision　process　and　history-based　decision　process　are　respectively　proposed　to　describe　sequential　decision-making　with　insufficient　state　information.　However,　these　two　processes　are　easy　to　ignore　the　important　information　from　the　current　observed　state.　Therefore,　the　learning　efficiency　of　reinforcement　learning　methods　based　on　these　two　processes　is　also　limited　when　the　state　information　is　insufficient.　In　this　paper,　we　propose　a　multi-view　reinforcement　learning　method　to　solve　this　problem.　The　motivation　is　that　the　interaction　information　between　the　agent　and　its　environment　should　be　considered　from　the　views　of　history,　present,　and　future　to　overcome　the　insufficiency　of　state　information.　Based　on　these　views,　we　construct　a　multi-view　decision　process　to　describe　sequential　decision-making　with　insufficient　state　information.　A　multi-view　reinforcement　learning　method　is　proposed　by　combining　the　multi-view　decision　process　and　the　actor-critic　framework.　In　the　proposed　method,　multi-view　clustering　is　performed　to　ensure　that　each　type　of　sample　can　be　sufficiently　exploited.　Experiments　illustrate　that　the　proposed　method　is　more　effective　than　the　compared　state-of-the-arts.　The　source　code　can　be　downloaded　from　https://github.com/jamieliuestc/MVRL.

Keyword：

Insufficient state information Multi-view clustering Multi-view decision process Multi-view reinforcement learning

Community：

[ 1 ] [Li, Min]Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
[ 2 ] [Zhu, William]Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
[ 3 ] [Wang, Shiping]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China

Reprint 's Address：

[Wang, Shiping]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China;;

Email：

shipingwangphd@163.com

Show more details

Version：

Multi-view reinforcement learning for sequential decision-making with insufficient state information
2024，International Journal of Machine Learning and Cybernetics
Multi-view reinforcement learning for sequential decision-making with insufficient state information
2023，International Journal of Machine Learning and Cybernetics

Related Keywords：

Contrastive Consensus Graph Learning for Multi-View Clustering
2022，IEEE-CAA JOURNAL OF AUTOMATICA SINICA
An overview of recent multi-view clustering
2020，NEUROCOMPUTING
Cross-View Fusion for Multi-View Clustering
2025，IEEE SIGNAL PROCESSING LETTERS
Diversity embedding deep matrix factorization for multi-view clustering
2022，INFORMATION SCIENCES

Source ：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

ISSN： 1868-8071

Year： 2023

Issue： 4

Volume： 15

Page： 1533-1552

3 . 1

JCR@2023

3 . 1 0 0

JCR@2023

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to