AI technology can accurately capture and feedback user emotions in digital media interaction to realize precise interaction. In this paper, we design an AI emotion interactivity enhancement model based on multimodal fusion, and apply the neural network model of Bi-GRU and dual attention mechanism to fuse the long and short-term emotion classification results of the tested samples at the decision level to obtain the final emotion classification results. Then the weight coefficient vector of each sentiment category is calculated based on the sentiment classification confusion matrix of the classifier, which is used as the a priori knowledge for multimodal sentiment analysis for decision fusion. The performance is examined on the MOSI dataset and the AI-based interaction design strategy in digital media is proposed. Analyzing the interaction design effect, the interaction design applying the model of this paper has better user experience sense, emotional arousal, pleasure level, and emotional feedback effect in subjectivity evaluation than the control group, and 75% of the experimental subjects think that the feedback-adjusted digital media has a better pleasure level.