Weight of a Feeling: Temporal and Modal Contributions to Emotion from Music Videos
Published in 2025 IEEE International Conference on Big Data (BigData), 2025
This study investigates how audio and video modalities contribute to emotion perception in music videos, accounting for cognitive effects such as the primacy-recency effect. Using EfficientNetB0 for audio and transformers for video, with valence, arousal, and dominance as labels, weighted late fusion is applied to study modal influences.
View Full Abstract
This study investigates how audio and video modalities contribute to emotion perception in music videos, accounting for cognitive effects such as the primacy-recency effect. Using EfficientNetB0 for audio and transformers for video, with valence, arousal, and dominance as labels, weighted late fusion is applied to study modal influences. Results demonstrate that discounting insignificant parts of music videos statistically validates primacy-recency effects on emotion perception, with implications for affective computing and human-computer interaction.
Published in: 2025 IEEE International Conference on Big Data (BigData) Pages: 5187-5193 Date: December 2025
Citation: S. Masti, S. K. Sateesh, S. Vengatagiri, A. Raimon and B. Das, "Weight of a Feeling: Temporal and Modal Contributions to Emotion from Music Videos," 2025 IEEE International Conference on Big Data (BigData), Macau, China, 2025, pp. 5187-5193, doi: 10.1109/BigData66926.2025.11401735.
