[1] RUI L L, QIU X S, LI W J, et al. Autonomic management architecture and its application in mobile self-organi- zing network[J]. Journal of Beijing University of Posts and Telecommunications, 2011, 34(S1):63-67, 72. 芮兰兰, 邱雪松, 李文璟, 等. 面向移动自组织网络的自主管理架构及应用[J]. 北京邮电大学学报, 2011, 34(S1):63-67, 72. [2] ZHAN G Z, CHEN L X, FANG W J, et al. Research on optimal routing of metro mesh system based on Dijkstra algorithm[J]. Telecom Engineering Technics and Stan-dardization, 2020, 33(6):77-81. 湛广志, 陈銮雄, 方伟津, 等. 基于Dijkstra算法的城域mesh系统最佳路由研究[J]. 电信工程技术与标准化, 2020, 33(6):77-81. [3] CAO X F, ZHANG J. Research on WSN node routing optimization strategy based on improved genetic algorithm[J]. Yinshan Academic Journal, 2018, 32(4):66-70. 操晓峰, 张健. 一种基于改进遗传算法的WSN节点路由优化策略[J]. 阴山学刊, 2018, 32(4):66-70. [4] JIANG Z, ZHANG H K, ZHANG L Y. The application of genetic algorithm in multicast routing of multimedia stream[J]. Journal of Beijing University of Posts and Telecommunications, 2004, 27(2):39-43. 姜圳, 张宏科, 张礼勇. 基于遗传算法的流媒体组播路由选择方法[J]. 北京邮电大学学报, 2004, 27(2):39-43. [5] SUTTON R S, BARTO A G. Reinforcement learning:an introduction[M]. Cambridge:MIT Press, 1998:225-228. [6] MAMMERI Z. Reinforcement learning based routing in networks:review and classification of approaches[J]. IEEE Access, 2019, 7:55916-55950. [7] LI Y, WANG F, JING D S, et al. A Q-learning based routing approach for wireless sensor network[J]. Computing Technology and Automation, 2017, 36(2):155-160. 李荥, 王芳, 景栋盛, 等. 一种基于Q学习的无线传感网络路由方法[J]. 计算技术与自动化, 2017, 36(2):155-160. [8] ZHOU Y, CAO T, XIANG W. Anypath routing protocol design via Q-learning for underwater sensor networks[J]. IEEE Internet of Things Journal, 2021, 8(10):8173-8190. [9] ZHANG Y, ZHANG Z M, CHEN L, et al. Reinforcement learning-based opportunistic routing protocol for underwater acoustic sensor networks[J]. IEEE Trans on Veh Technol, 2021, 70(3):2750-2770. [10] DHURANDHER S K, SINGH J, OBAIDAT M S, et al. Reinforcement learning-based routing protocol for opportunistic networks[C]//ICC 2020. Dublin:IEEE Press, 2020:1-6. [11] DING R J, XU Y D, GAO F F, et al. Deep reinforcement learning for router selection in network with heavy traffic[J]. IEEE Access, 2019, 7:37109-37120. [12] GUO X C, LIN H, LI Z Y, et al. Deep reinforcement learning based QoS-aware secure routing for SDN-IoT[J]. IEEE Internet of Things Journal, 2020, 7(7):6242-6251. |