Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

Journal of Beijing University of Posts and Telecommunications ›› 2022, Vol. 45 ›› Issue (6): 126-130.

Previous Articles     Next Articles

Resource Allocation Based on Alternating Direction Multiplier Method and Deep Reinforcement Learning Algorithm

  

  • Received:2022-03-01 Revised:2022-07-06 Online:2022-12-28 Published:2022-11-24
  • Supported by:
    Open project of provincial and ministerial key laboratories;National Natural Science Foundation of China

Abstract: In order to optimize resource allocation of dense network under limited channel state information, a model-driven learning framework combined with alternating direction method of multipliers, as well as deep reinforcement learning algorithm, is proposed. This framework differs from data-driven ones, which enables one-to-one modeling of specific problems. The steps on how to model resource allocation include: alternately optimizing base station selection, power, and subcarrier allocation with alternating direction method of multipliers; using deep reinforcement learning algorithm to optimize weights, solve target functions and improve performance of the system; using effective channel state information instead of redundant information to reduce overhead on communication; adding constraints on users’ quality of service requirements to maximize cell spectral efficiency while ensuring user experience, which can maximize the spectral efficiency of the cell while ensuring users’ experience. The simulation results show that the model-driven learning framework can converge in a small number of iterations.

Key words: dense network, model-driven, resource allocation, deep reinforcement learning, alternating direction multiplier method

CLC Number: