Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention
Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention
Praveen, R. Gnana, Cardinal, Patrick and Granger, Eric.
IEEE Transactions on Biometrics, Behavior, and Identity Science 2023