Image Sentiment Analysis with Multimodal Discriminative Embedding Space

doi:10.13190/j.jbupt.2018-040

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2019, Vol. 42 ›› Issue (1): 61-67.doi: 10.13190/j.jbupt.2018-040

• Papers • Previous Articles Next Articles

Image Sentiment Analysis with Multimodal Discriminative Embedding Space

Lü Guang-rui, CAI Guo-yong, LIN Yu-ming

Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin 541004, China

Received:2018-03-20 Online:2019-02-28 Published:2019-03-08

Abstract

Abstract: In order to alleviate affective gap and large intra-class variance existing in visual sentiment analysis, firstly a new method is proposed, which exploits simultaneously not only deep latent correlations between visual and textual modalities, but also deep linear discrimination of visual modality and weak supervision of mid-level semantic features of images. The method uses multimodal deep network architecture to find a latent embedding space in which deep correlations between visual and textual modalities are maximized, and at the same time there is a deep discrimination on visual modality. In the latent space, the extracted semantic feature of texts can be transferred to the extracted discriminant visual feature of images. Secondly based on the usfulness of attention mechanism, an attention network is presented, which accepts the extracted features in the latent space as input and is trained as a sentiment classifier. Results of experiments conducted on real datasets show that the proposed approach achieves better sentiment classification accuracy than those state-of-the-art approaches.

Key words: sentiment analysis, latent correlation, linear discrimination, multimodal network, attention mechanism

CLC Number:

TP391

Lü Guang-rui, CAI Guo-yong, LIN Yu-ming. Image Sentiment Analysis with Multimodal Discriminative Embedding Space[J]. JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM, 2019, 42(1): 61-67.

References

[1] Weiss K, Khoshgoftaar T M, Wang D D. A survey of transfer learning[J]. Journal of Big Data, 2016, 3(1):9.
[2] Borth D, Ji R, Chen T, et al. Large-scale visual sentiment ontology and detectors using adjective noun pairs[C]//ACM International Conference on Multimedia. New York:ACM, 2013:223-232.
[3] Jou B, Chen T, Pappas N, et al. Visual affect around the world:A large-scale multilingual visual sentiment ontology[C]//ACM International Conference on Multimedia. New York:ACM, 2015:159-168.
[4] 李钊, 卢苇, 邢薇薇, 等. CNN视觉特征的图像检索[J]. 北京邮电大学学报, 2015, 38(s1):103-106. Li Zhao, Lu Wei, Xing Weiwei, et al. Image retrieval based on CNN visual features[J]. Journal of Beijing University of Posts and Telecommunications, 2015, 38(s1):103-106.
[5] You Q, Yang J, Yang J, et al. Robust image sentiment analysis using progressively trained and domain transferred deep networks[C]//29^th AAAI Conference on Artificial Intelligence. Menlo Park:AAAI, 2015:381-388.
[6] Campos V, Jou B, Giro-i-Nieto X. From pixels to sentiment:fine-tuning CNNs for visual sentiment prediction[J]. Image and Vision Computing, 2017(65):15-22.
[7] Islam J, Zhang Y. Visual sentiment analysis for social images using transfer learning approach[C]//IEEE International Conferences on Big Data and Cloud Computing. Piscataway:IEEE, 2016:124-130.
[8] Andrew G, Arora R, Bilmes J, et al. Deep canonical correlation analysis[C]//International Conference on Machine Learning. Atlanta:ICML, 2013:1247-1255.
[9] Dorfer M, Kelz R, Widmer G, et al. Deep linear discriminant analysis[C]//International Conference on Learning Representations. San Juan:ICLR, 2016:1-13.
[10] Katsurai M, Satoh S. Image sentiment analysis using latent correlations among visual, textual, and sentiment views[C]//IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway:IEEE, 2016:2837-2841.

Image Sentiment Analysis with Multimodal Discriminative Embedding Space

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 10

Recommended Articles

Metrics

Comments

[1]	BAO Qianhui, WEN Juan, SHI Shuzhen, DONG Mengping, LIU Xue. Aspect Category Classification Integrated in Syntactic Dependency and BERT-Att-BiLSTM [J]. Journal of Beijing University of Posts and Telecommunications, 2023, 46(4): 123-128.
[2]	. 3D Segmentation of Brain Tumor MRI Image based on RAPNet [J]. Journal of Beijing University of Posts and Telecommunications, 2023, 46(2): 91-97.
[3]	LIU Linlan, SONG Xiuyang, CHEN Yubin. Link Prediction in Opportunistic Networks Based on Network Representation Learning [J]. Journal of Beijing University of Posts and Telecommunications, 2022, 45(4): 64-69,103.
[4]	NING Jing, SHE Hongyan, ZHAO Dong, LUO Dan, WANG Lei. A Road-Level Traffic Accident Risk Prediction Method [J]. Journal of Beijing University of Posts and Telecommunications, 2022, 45(2): 72-78.
[5]	ZHAO Haiying, ZHU Hui, HOU Xiaogang. Traditional Custume Image Semantic Segmentation Based on Improved EMA Unit [J]. Journal of Beijing University of Posts and Telecommunications, 2022, 45(1): 69-74.
[6]	WANG Wen-zhu, XIAO Bo, CHEN Ke-hong. A Method for Targeted Sentiment Analysis [J]. Journal of Beijing University of Posts and Telecommunications, 2021, 44(5): 21-27.
[7]	LI Wei, CHEN Shu-dong, OUYANG Xiao-ye, DU Rong, WANG Rong. Distant Supervision Relation Extraction Method Based on Highway Multi-Kernel Network [J]. Journal of Beijing University of Posts and Telecommunications, 2020, 43(5): 71-76.
[8]	WANG Xiao-jie, BAI Zi-wei, LI Ke, YUAN Cai-xia. Survey on Machine Reading Comprehension [J]. Journal of Beijing University of Posts and Telecommunications, 2019, 42(6): 1-9.
[9]	ZHU Hao, TAN Yong-mei. English Textual Entailment Recognition Using Capsules [J]. JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM, 2019, 42(3): 21-28.
[10]	SU Fang, WANG Xiao-yu, ZHANG Zhi. Review Summarization Generation Based on Attention Mechanism [J]. JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM, 2018, 41(3): 7-13.