基于多智能体近端策略优化的多信道动态频谱接入 - Details

author：

Chen, Ping-Ping (Chen, Ping-Ping.) ^[1] | Zhang, Xu (Zhang, Xu.) ^[2] | Xie, Zhao-Peng (Xie, Zhao-Peng.) ^[3] | Qiu, Yu-Ping (Qiu, Yu-Ping.) ^[4] | Fang, Yi (Fang, Yi.) ^[5]

Indexed by：

Abstract：

To　enhance　communication　efficiency　and　ensure　user　fairness　in　multi-user　multi-channel　communication　scenarios,　based　on　multi-agent　proximal　policy　optimization　(MAPPO)　for　the　application　of　dynamic　spectrum　access　(DSA)　technology,　this　paper　proposes　the　MAPPO-DSA　algorithm.　The　algorithm　addresses　the　issue　of　spectrum　waste　in　single-channel　access　when　multiple　channels　are　simultaneously　idle　by　using　multi-channel　access　as　a　solution.　However,　multi-channel　access　leads　to　an　exponential　increase　in　the　state　and　action　spaces,　resulting　in　high　computational　costs　and　learning　difficulties.　To　tackle　this,　the　paper　introduces　the　MAPPO　deep　reinforcement　learning　(DRL)　algorithm　to　efficiently　learn　and　optimize　access　strategies　in　complex　environments.　The　design　of　MAPPO　incorporates　reinforcement　learning　elements　such　as　observation　and　reward,　as　well　as　shared　network　parameters　to　ensure　user　fairness.　Experimental　results　in　different　scenarios　demonstrate　that　the　proposed　MAPPO-DSA　algorithm　can　learn　near-optimal　access　strategies,　and　approach　the　theoretical　throughput　limit　in　some　scenarios,　outperforming　the　existing　algorithms　significantly　and　effectively　ensuring　user　fairness.　©　2024　Chinese　Institute　of　Electronics.　All　rights　reserved.

Keyword：

Deep learning Multi agent systems Reinforcement learning

Community：

[ 1 ] [Chen, Ping-Ping]School of Advanced Manufacturing, Fuzhou University, Fujian, Jinjiang; 362251, China
[ 2 ] [Zhang, Xu]School of Advanced Manufacturing, Fuzhou University, Fujian, Jinjiang; 362251, China
[ 3 ] [Xie, Zhao-Peng]School of Advanced Manufacturing, Fuzhou University, Fujian, Jinjiang; 362251, China
[ 4 ] [Qiu, Yu-Ping]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou; 350108, China
[ 5 ] [Fang, Yi]School of Information Engineering, Guangdong University of Technology, Guangdong, Guangzhou; 510006, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Active Distribution Network Reconfiguration with Renewable Energy Based on Multi-agent Deep Reinforcement Learning
2023，6th International Conference on Energy, Electrical and Power Engineering, CEEPE 2023
The Action Selector in the Deep Q-learning Applied in a Multi-agent Economic System
2021，2021 IEEE International Conference on Artificial Intelligence and Computer Applications, ICAICA 2021
Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning
2018，2018 IEEE International Conference on Robotics and Automation, ICRA 2018
Review of Power System Transient Stability Control Strategies Based on Deep Reinforcement Learning
2023，High Voltage Engineering
Review of Active Distribution Network Dynamic Reconfiguration Based on Deep Reinforcement Learning
2025，High Voltage Engineering

Source ：

电子学报

ISSN： 0372-2112

Year： 2024

Issue： 6

Volume： 52

Page： 1824-1831

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to