多文化场景下的多模态情感识别

doi:10.13328/j.cnki.jos.005412

微信服务号

微信订阅号

2025年4月14日 5:58 星期一

首页 > 过刊浏览>2018年第29卷第4期 >1060-1070. DOI:10.13328/j.cnki.jos.005412

PDF HTML阅读 XML下载导出引用引用提醒

多文化场景下的多模态情感识别
DOI:
                        10.13328/j.cnki.jos.005412
                    
CSTR:
                        
                    
作者:
                        陈师哲陈师哲
中国人民大学 信息学院, 北京 100872
在期刊界中查找
在百度中查找
在本站中查找
王帅王帅
中国人民大学 信息学院, 北京 100872
在期刊界中查找
在百度中查找
在本站中查找
金琴金琴
中国人民大学 信息学院, 北京 100872
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:陈师哲(1994-),女,湖南邵阳人,博士生,CCF学生会员,主要研究领域为多媒体语义内容分析;金琴(1972-),女,博士,博士生导师,CCF专业会员,主要研究领域为多媒体计算;王帅(1993-),男,硕士生,CCF学生会员,主要研究领域为多模态情感计算.
通讯作者:金琴,E-mail:qjin@ruc.edu.cn
中图分类号:
基金项目:国家重点研发计划（2016YFB1001200）

Multimodal Emotion Recognition in Multi-Cultural Conditions

Author:

CHEN Shi-Zhe
CHEN Shi-Zhe
School of Information, Renmin University of China, Beijing 100872, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Shuai
WANG Shuai
School of Information, Renmin University of China, Beijing 100872, China
在期刊界中查找
在百度中查找
在本站中查找
JIN Qin
JIN Qin
School of Information, Renmin University of China, Beijing 100872, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

National Key Research and Development Program of China (2016YFB1001200)

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

自动情感识别是一个非常具有挑战性的课题，并且有着广泛的应用价值.探讨了在多文化场景下的多模态情感识别问题.从语音声学和面部表情等模态分别提取了不同的情感特征，包括传统的手工定制特征和基于深度学习的特征，并通过多模态融合方法结合不同的模态，比较不同单模态特征和多模态特征融合的情感识别性能.在CHEAVD中文多模态情感数据集和AFEW英文多模态情感数据集进行实验，通过跨文化情感识别研究，验证了文化因素对于情感识别的重要影响，并提出3种训练策略提高在多文化场景下情感识别的性能，包括：分文化选择模型、多文化联合训练以及基于共同情感空间的多文化联合训练，其中，基于共同情感空间的多文化联合训练通过将文化影响与情感特征分离，在语音和多模态情感识别中均取得最好的识别效果.

关键词:情感识别;多文化场景;语音情感特征;面部表情特征;多模态融合;深度卷积神经网络

Abstract:

Automatic emotion recognition is a challenging task with a wide range of applications. This paper addresses the problem of emotion recognition in multi-cultural conditions. Different multi-modal features are extracted from audio and visual modalities, and the emotion recognition performance is compared between hand-crafted features and automatically learned features from deep neural networks. Multimodal feature fusion is also explored to combine different modalities. The CHEAVD Chinese multimodal emotion dataset and AFEW English multimodal emotion dataset are utilized to evaluate the proposed methods. The importance of the culture factor for emotion recognition through cross-culture emotion recognition is demonstrated, and then three different strategies, including selecting corresponding emotion model for different cultures, jointly training with multi-cultural datasets, and embedding features from multi-cultural datasets into the same emotion space, are developed to improve the emotion recognition performance in the multi-cultural environment. The embedding strategy separates the culture influence from original features and can generate more discriminative emotion features, resulting in best performance for acoustic and multimodal emotion recognition.

Key words:emotion recognition;multi-cultural condition;acoustic emotion feature;facial expression feature;multimodal fusion;deepconvolutional neural networks

引用本文

陈师哲,王帅,金琴.多文化场景下的多模态情感识别.软件学报,2018,29(4):1060-1070

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2017-04-30
最后修改日期:2017-06-26
录用日期:
在线发布日期: 2017-11-29
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码