Abstract: To address the issue that traditional convolutional neural networks (CNNs) excel at capturing local text features but struggle with modeling long-range semantic dependencies, and that ...
Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in automated front-end engineering, e.g., generating UI code from visual designs. However, existing front-end UI code ...
Abstract: The article constructs a hybrid fusion model for multimodal emotion recognition. First, facial expression features are extracted using Mediapipe and OpenCV, and then a convolutional neural ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results