[1] Araghi S, Khosravi A, Creighton D. A review on computational intelligence methods for controlling traffic signal timing[J]. Expert Systems with Applications, 2015, 42(3):1538-1550. [2] Sims A G, Finlay A B. SCATS, splits and offsets simplified (SOS)[J]. Australian Road Research, 1984, 12(4):17-33. [3] Lo H K. A reliability framework for traffic signal control[J]. IEEE Transactions on Intelligent Transportation Systems, 2006, 7(2):250-260. [4] Abdulhai B, Pringle R, Karakoulas G J. Reinforcement learning for true adaptive traffic signal control[J]. Journal of Transportation Engineering, 2003, 129(3):278-285. [5] Genders W, Razavi S. Using a deep reinforcement learning agent for traffic signal control[EB/OL]. (2016-11-03)[2020-01-10]. https://arXiv.org/abs/1611.01142. [6] Richter S, Aberdeen D, Yu J. Natural actor-critic for road traffic optimisation[C]//NIPS 2006, Proceedings of the 19th International Conference on Neural Information Processing Systems. Vancouver:MIT Press, 2006:1169-1176. [7] Pang Hali, Gao Weilong. Deep deterministic policy gradient for traffic signal control of single intersection[C]//2019 Chinese Control and Decision Conference (CCDC). Nanchang:IEEE, 2019:5861-5866. [8] Casas N. Deep deterministic policy gradient for urban traffic light control[EB/OL]. (2017-08-02)[2020-01-10]. https://arXiv.org/abs/1703.09035v1. [9] Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous control with deep reinforcement learning[C]//ICLR 2016:International Conference on Learning Representations. San Juan:[s.n.], 2016:1-14. [10] Schaul T, Quan J, Antonoglou I, et al. Prioritized experience replay[C]//ICLR 2016:International Conference on Learning Representations. San Juan:[s.n.], 2016:1-21. [11] Wang Xiaoqiang, Ke Liangjun, Qiao Zhimin, et al. Large-scale traffic signal control using a novel multi-agent reinforcement learning[J]. IEEE Transactions on Cybernetics, 2021, 51(1):174-187. [12] Yang Shantian, Yang Bo, Wong Hau-San, et al. Cooperative traffic signal control using multi-step return and off-policy asynchronous advantage actor-critic graph algorithm[J]. Knowledge-Based Systems, 2019, 183:1-19. [13] Wei Hua, Zheng Guanjie, Yao Huaxiu, et al. Intellilight:a reinforcement learning approach for intelligent traffic light control[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York:ACM, 2018:2496-2505. [14] Zhang Zhi, Yang Jiachen, Zha Hongyuan. Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization[C]//AAMAS 2020:Proceedings of the Nineteenth International Conference on Autonomous Agents and Multi-Agent Systems. Auckland:Springer, 2020:2083-2085. |