[1] Bell N, Dalton S, Olson L N.Exposing fine-grained parallelism in algebraic multigrid methods[J].SIAM Journal on Scientific Computing, 2012, 34(4):123-152.
[2] Buluç A, Gilbert J R.The combinatorial BLAS:design, implementation, and applications[J].International Journal of High Performance Computing Applications, 2011, 25(4):496-509.
[3] Yuan Tao, Huang Zhibin.Shuffle reduction based sparse matrix-vector multiplication on Kepler GPU[J].International Journal of Grid and Distributed Computing, 2016, 9(10):99-106.
[4] Greathouse J L, Daga M.Efficient sparse matrix-vector multiplication on GPUs using the CSR storage format[C]//Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis.New York:IEEE Press, 2014:769-780.
[5] 菅立恒, 易卫东.使用GPU加速无线传感器网络信道仿真[J].北京邮电大学学报, 2013, 36(2):24-27.Jian Liheng, Yi Weidong.Acceleration of simulation of radio channel in wireless sensor networks using GPU[J].Journal of Beijing University of Posts and Telecommunications, 2013, 36(2):24-27.
[6] Liu Weifeng, Vinter B.An efficient GPU general sparse matrix-matrix multiplication for irregular data[C]//2014 IEEE 28th International Parallel and Distributed Processing Symposium.New York:IEEE Press, 2014:370-381.
[7] 黄智濒, 周锋, 马华东.自适应访问模式的缓存替换策略[J].北京邮电大学学报, 2016, 39(3):44-48.Huang Zhibin, Zhou Feng, Ma Huadong.A cache replacement policy adapting to the request access pattern[J].Journal of Beijing University of Posts and Telecommunications, 2016, 39(3):44-48.
[8] Liu Junhong, He Xin, Liu Weifeng, et al.Register-based implementation of the sparse general matrix-matrix multiplication on GPUs[J].ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.New York:ACM, 2018:407-408.
[9] Anh P N Q, Fan Rui, Wen Yonggang.Balanced Hashing and efficient GPU sparse general matrix-matrix multiplication[C]//Proceedings of the 2016 International Conference on Supercomputing.New York:ACM, 2016:36.
[10] Dalton S, Bell N, Olson L, et al.CUSP:generic parallel algorithms for sparse matrix and graph computations:Version 0.5[EB/OL].(2015-03-13)[2018-05-30].https://cusplibrary.github.io.
[11] Batcher K E.Sorting networks and their applications[C]//Spring Joint Computer Conference.New York:ACM, 1968:307-314.
[12] Davis T A, Hu Yifan.The University of Florida sparse matrix collection[J].ACM Transactions on Mathematical Software, 2011, 38(1):1. |