Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention

Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention

Praveen, R. Gnana, Cardinal, Patrick and Granger, Eric.

IEEE Transactions on Biometrics, Behavior, and Identity Science 2023