The Instruction
Tuning of Large Language Models with Multi-Modal Recommendation Instruction

Journal of Beijing University of Posts and Telecommunications ›› 2024, Vol. 47 ›› Issue (4): 36-43.

The Instruction Tuning of Large Language Models with Multi-Modal Recommendation Instruction

HAO Bowen^1,3, LIU Yifei², LI Liyao³, WANG Jie¹, PENG Yan¹

Received:2023-12-19 Revised:2024-01-17 Online:2024-08-28 Published:2024-08-26

Abstract

Abstract: The tuning of large language models based on multimodal instructions has been proven effective in endowing large language models with the capability to address relevant multimodal tasks. To further empower large language models in handling multimodal zero-shot or few-shot recommendation tasks, multi-modal recommendation of large language model is proposed, which is built upon the foundation of ChatGLM2-6B, and is trained on multimodal recommendation dataset that includes both textual and image information. The construction of multimodal user profiles and item attributes is achieved through the utilization of ChatGPT and GPT-4 for generating instructions. Additionally, instructions for zero-shot and few-shot recommendations are formulated. The model undergoes efficient parameter fine-tuning using the P-tuning v2 method, requiring only a single A100 40GB graphics processing unit for the fine-tuning process. Experimental results demonstrate that the proposed model significantly outperforms existing baseline models.

Key words: multimodal recommendation instructions, large language model, instruction tuning

CLC Number:

TP391

HAO Bowen, LIU Yifei, LI Liyao, WANG Jie, PENG Yan. The Instruction Tuning of Large Language Models with Multi-Modal Recommendation Instruction[J]. Journal of Beijing University of Posts and Telecommunications, 2024, 47(4): 36-43.

[1]	HAN Xu, SUN Yawei, ZHAO Lu. Application of Holistic Artificial Intelligence and Large Language Models for Comprehensive Information Collection [J]. Journal of Beijing University of Posts and Telecommunications, 2024, 47(4): 11-19,28.
[2]	QI Siyang, HU Huiyun, LI Hongbing, LI Qi, XIAO Bo. Domain-Specific Question Answering System Construction Approach Integrated with Large Language Model [J]. Journal of Beijing University of Posts and Telecommunications, 2024, 47(4): 50-56.
[3]	LUO Yan, LIU Yuyang, LI Xiaoying, LIU Hui. Construction and Application of Holistic Artificial Intelligence System for Medical Large Language Models [J]. Journal of Beijing University of Posts and Telecommunications, 2024, 47(4): 98-104.
[4]	CHEN Guang, GUO Jun. Artificial Intelligence in the Era of Large Language Models: Technical Significance, Industry Applications, and Challenges [J]. Journal of Beijing University of Posts and Telecommunications, 2024, 47(4): 20-28.

The Instruction Tuning of Large Language Models with Multi-Modal Recommendation Instruction

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 4

Recommended Articles

Metrics

Comments