Weight of a Feeling: Temporal and Modal Contributions to Emotion from Music Videos
Published in 2025 IEEE International Conference on Big Data (BigData), 2025
This study investigates how audio and video modalities contribute to emotion perception in music videos, accounting for cognitive effects such as the primacy-recency effect. Using EfficientNetB0 for audio and transformers for video, with valence, arousal, and dominance as labels, weighted late fusion is applied to study modal influences.
Citation: S. Masti, S. K. Sateesh, S. Vengatagiri, A. Raimon and B. Das, "Weight of a Feeling: Temporal and Modal Contributions to Emotion from Music Videos," 2025 IEEE International Conference on Big Data (BigData), Macau, China, 2025, pp. 5187-5193, doi: 10.1109/BigData66926.2025.11401735.
View on IEEE Xplore
