北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2010, Vol. 33 ›› Issue (6): 112-115.doi: 10.13190/jbupt.201006.112.lixy

• 研究报告 • 上一篇    下一篇

多Agent系统中信任预测的SRL模型

李小勇,周锋,杨旭东,倪晖   

  1. 北京邮电大学 智能通信软件与多媒体北京市重点实验室, 北京 100876
  • 收稿日期:2009-10-10 修回日期:2010-05-23 出版日期:2010-12-28 发布日期:2011-01-07
  • 通讯作者: 李小勇 E-mail:lixiaoyong@bupt.edu.cn
  • 基金资助:

    国家自然科学基金项目(61003281); 中央高校基本科研业务项目(BUPT 2009RC0201)

SRLBased Trust Predicting Model Used in MultiAgent Systems

LI Xiaoyong   

  • Received:2009-10-10 Revised:2010-05-23 Online:2010-12-28 Published:2011-01-07
  • Contact: LI Xiaoyong E-mail:lixiaoyong@bupt.edu.cn

摘要:

针对多Agent系统(MAS)中信任关系管理的需求,将Sarsa强化学习(SRL)理论应用于构建MAS中基于Agent行为的信任关系预测模型. 首先根据Agent之间交互的时间顺序,构建了基于时间戳的行为状态空间结构,然后应用SRL理论,建立了基于直接可信度和反馈可信度相融合的总体信任关系预测模型. 新模型充分利用SRL理论较强的动态适应能力,解决了传统预测模型对环境的动态变化适应能力不足的问题. 累计误差方面的实验结果表明,与已有模型相比,新模型能显著提高信任决策的准确性.

关键词: 多Agent系统, 信任模型, Sarsa强化学习

Abstract:

Focusing on the requirement of trust management in multiAgent systems, the sarsa reinforcement learning (SRL) theory is applied to construct trust prediction model for multiAgent systems based on Agent’s behavior. First, basic formal description is conducted for trust decision, and behavior statespace structure is constructed based on timestamp according the interaction time sequence between network Agents. With SRL algorithm, overall trust relationship predicting model based on direct trust degree and feedback trust degree is proposed. The model makes full use of the advantages of the strong dynamic adaptive capacity of the SRL algorithm, brakes away from the inadequate dynamic adaptive capacity in the traditional software trust modeling process. Simulation in cumulative errors shows that, compared to the existing models the new model has remarkable enhancements in the trust decision accuracy.

Key words: multiAgent systems, trust model, Sarsa reinforcement learning