Multimodal utterance-level affect analysis using visual, audio and text features
Published in Arxiv, 2018
Recommended citation: Deng, Didan, et al. "Multimodal utterance-level affect analysis using visual, audio and text features." arXiv preprint arXiv:1805.00625 (2018). https://arxiv.org/abs/1805.00625
This paper is about emotion recognition from visual and audio modalities.
Recommended citation: Deng, Didan, et al. "Multimodal utterance-level affect analysis using visual, audio and text features." arXiv preprint arXiv:1805.00625 (2018).