Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2014, Vol. 37 ›› Issue (1): 80-84.doi: 10.13190/j.jbupt.2014.01.018

• Reports • Previous Articles     Next Articles

Cognitive Radio Resource Allocation by Clustering Multi-Agent Enforcement Learning

WU Chun1,2, JIANG Hong2, YI Ke-chu1   

  1. 1. State Key Laboratory of Integrated Service Networks, Xidian University, Xi'an 710071, China;
    2. School of National Defense Technology, Southwest University of Science and Technology, Sichuan Mianyang 621000, China
  • Received:2013-03-13 Online:2014-02-28 Published:2014-01-07

Abstract:

A multi-agent enforcement learning method based on user clustering as well as a variable learning rate was proposed for solving the problem of channel allocation and power control within multi cognitive radio users. Firstly, a hierarchy processing method was used to separate channel selection and power control. The channel allocation was implemented by fast optimal search combined with user-number balance. Secondly, stochastic game framework was adopted to model the multiuser power control issue. In subsequent multi-agent enforcement learning, K-means user clustering method was employed to reduce the user number in game and single user's environment complexity, and a variable learning rate scheme for Q learning and policy learning was proposed to promote the convergence of multiuser learning. Simulation shows that the method can make multiuser's power status and global reward converging effectively, moreover the whole performance can reach sub-optimal.

Key words: cognitive radio, multi-agent enforcement learning, clustering, power control, variable learning rate

CLC Number: