[1] Hirschmuller H. Stereo processing by semiglobal match-ing and mutual information[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 30(2):328-341. [2] Furukawa Y, Ponce J. Accurate, dense, and robust multiview stereopsis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 32(8):1362-1376. [3] Vu H-H, Labatut P, Pons J-P, et al. High accuracy and visibility-consistent dense multiview stereo[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 34(5):889-901. [4] Galliani S, Lasinger K, Schindler K. Massively parallel multiview stereopsis by surface normal diffusion[C]//IEEE International Conference on Computer Vision. Santiago:IEEE, 2015:873-881. [5] Schönberger J L, Zheng Enliang, Frahm J M, et al. Pixel-wise view selection for unstructured multi-view stereo[C]//European Conference on Computer Vision. Amsterdam:Springer, 2016:501-518. [6] Yao Yao, Luo Zixin, Li Shiwei, et al. MVSnet:depth inference for unstructured multi-view stereo[C]//European Conference on Computer Vision. Munich:Springer, 2018:767-783. [7] Yao Yao, Luo Zixin, Li Shiwei, et al. Recurrent MVSnet for high-resolution multi-view stereo depth inference[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach:IEEE, 2019:5525-5534. [8] Luo Keyang, Guan Tao, Ju Lili, et al. P-MVSnet:learning patch-wise matching confidence aggregation for multi-view stereo[C]//IEEE/CVF International Conference on Computer Vision. Long Beach:IEEE, 2019:10452-10461. [9] Aanæs H, Jensen R R, Vogiatzis G, et al. Large-scale data for multiple-view stereopsis[J]. International Journal of Computer Vision, 2016, 120(2):153-168. [10] Knapitsch A, Park J, Zhou Qianyi, et al. Tanks and temples:benchmarking large-scale scene reconstruction[J]. ACM Transactions on Graphics, 2017, 36(4):1-13. [11] Angelova A, Long P M. Benchmarking large-scale fine-grained categorization[C]//IEEE Winter Conference on Applications of Computer Vision. Steamboat Springs:IEEE, 2014:532-539. [12] Hartmann W, Galliani S, Havlena M, et al. Learned multi-patch similarity[C]//IEEE International Conference on Computer Vision. Venice:IEEE, 2017:1586-1594. [13] Ji Mengqi, Gall J, Zheng Haitian, et al. Surfacenet:an end-to-end 3D neural network for multiview stereopsis[C]//IEEE International Conference on Computer Vision. Venice:IEEE, 2017:2307-2315. [14] Choi S, Kim S, Park K, et al. Learning descriptor, confidence, and depth estimation in multi-view stereo[C]//IEEE Conference on Computer Vision and Pattern Recognition Workshops. Salt Lake City:IEEE, 2018:276-282. [15] Zhu Guangming, Zhang Liang, Shen Peiyi, et al. Multimodal gesture recognition using 3-D convolution and convolutional LSTM[J]. IEEE Access, 2017, 5:4517-4524. [16] Konolige K, Agrawal M. FrameSLAM:from bundle adjustment to real-time visual mapping[J]. IEEE Transactions on Robotics, 2008, 24(5):1066-1077. [17] Tola E, Strecha C, Fua P. Efficient large-scale multi-view stereo for ultra high-resolution image sets[J]. Machine Vision and Applications, 2012, 23(5):903-920. |