[1] Sanderson C, Paliwal K K. Identity verification using speech and face information[J]. Digital Signal Processing, 2004, 14(5):449-480.
[2] Kittler J, Hatef M, Duin R P W, et al. On combining classifiers[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(3):226-239.
[3] 韩崇昭, 朱洪艳, 段战胜等. 多源信息融合[M]. 北京:清华大学出版社, 2010.
[4] Kuncheva L I. A theoretical study on six classifier fusion strategies[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(2):281-286.
[5] Jain A, Nandakumar K, Ross A. Score normalization in multimodal biometric systems[J]. Pattern Recognition, 2005, 38(12):2270-2285.
[6] Nandakumar K, Chen Y, Dass S C, et al. Likelihood ratio-based biometric score fusion[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 30(2):342.
[7] Ma A J, Yuen P C, Lai J H. Linear dependency modeling for classifier fusion and feature combination[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(5):1135-1148.
[8] Shekhar S, Patel V M, Nasrabadi N M, et al. Joint sparse representation for robust multimodal biometrics recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(1):113-126.
[9] Yan J, Zheng W, Xu Q, et al. Sparse kernel reduced-rank regression for bimodal emotion recognition from facial expression and speech[J]. IEEE Transactions on Multimedia, 2016, 18(7):1319-1329.
[10] Bai S, Bai X. Sparse contextual activation for efficient visual re-ranking[J]. IEEE Transactions on Image Processing, 2016, 25(3):1056-1069.
[11] Zhang L, Zhang D. Visual understanding via multi-feature shared learning with global consistency[J]. IEEE Transactions on Multimedia, 2016, 18(2):247-259.
[12] Yu M, Liu L, Shao L. Structure-preserving binary representations for RGB-D action recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(8):1651-1664.
[13] Demirkus M, Precup D, Clark J, et al. Hierarchical spatio-temporal probabilistic graphical model with multiple feature fusion for binary facial attribute classification in real-world face videos[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(6):1185-1203.
[14] Hu J F, Zheng W S, Lai J, et al. Jointly learning heterogeneous features for RGB-D activity recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016(99):1-1.
[15] Feichtenhofer C, Pinz A, Zisserman A. Convolutional two-stream network fusion for video action recognition[C]//CVPR 2016. Las Vegas:IEEE Computer Society, 2016:1933-1941.
[16] Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos[J]. Advances in Neural Information Processing Systems, 2014, 1(4):568-576.
[17] Zhang X, Zhang H, Zhang Y, et al. Deep fusion of multiple semantic cues for complex event recognition[J]. IEEE Transactions on Image Processing, 2016, 25(3):1033-1046. |