口语对话中的代词指代消解

doi:10.3724/SP.J.1001.2011.03720

微信服务号

微信订阅号

2025年6月1日 19:42 星期日

首页 > 过刊浏览>2011年第22卷第2期 >233-244. DOI:10.3724/SP.J.1001.2011.03720

PDF HTML阅读 XML下载导出引用引用提醒

口语对话中的代词指代消解
DOI:
                        10.3724/SP.J.1001.2011.03720
                    
CSTR:
                        
                    
作者:
                        费仲超费仲超
复旦大学 计算机科学技术学院,上海 200433;上海贝尔股份有限公司 产品线战略及技术领先部,上海 201206
在期刊界中查找
在百度中查找
在本站中查找
周雅倩周雅倩
复旦大学 计算机科学技术学院,上海 200433
在期刊界中查找
在百度中查找
在本站中查找
黄萱菁黄萱菁
复旦大学 计算机科学技术学院,上海 200433
在期刊界中查找
在百度中查找
在本站中查找
吴立德吴立德
复旦大学 计算机科学技术学院,上海 200433
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(60503070, 60673038); 上海市科委科研计划(08511500302)

Pronoun Resolution in Spoken Dialog

Author:

FEI Zhong-Chao
FEI Zhong-Chao
School of Computer Science, Fudan University, Shanghai 200433, China;Portfolio Strategy and Technology Leadership CTO Group, Alcatel-Lucent Shanghai Bell, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找
ZHOU Ya-Qian
ZHOU Ya-Qian
School of Computer Science, Fudan University, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找
HUANG Xuan-Jing
HUANG Xuan-Jing
School of Computer Science, Fudan University, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找
WU Li-De
WU Li-De
School of Computer Science, Fudan University, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

提出一套分为两步的代词指代消解算法,算法不需要人工清洗语料及预定义规则.算法第1 步采用一些新特征和机器学习算法对名词性指代代词和非名词性指代(non-anaphoric)代词分类,第2 步分别对两类代词进行消解.针对名词性代词指代消解,提出了适用于口语对话的特征抽取及表示方法,如代词和候选先行词的距离、语法、语义等的抽取和表示方法,然后通过综合这些特征来选择先行词.针对非名词性指代,将右边界规则(right frontier rule)改进为可以在口语对话中自动抽取的形式,并根据该规则选择先行项.在Byron 于2004 年发布的语料上测试,消解正确率达到77.0%,召回率达到66.0%.与Byron 的工作相比,该方法在保证系统能够自动完成的同时还提高了消解性能.

关键词:代词指代消解;口语对话理解;代词分类

Abstract:

This paper presents a two-stage pronoun resolution algorithm. It does not need to clean the testing corpus and predefine patterns manually. In the first stage of the algorithm, some new features and machine learning methods are used to classify pronouns into anaphoric and non-anaphoric ones. In the second stage, these two kinds of pronouns are resolved respectively. For the anaphoric ones, some methods are presented to extract distance, syntactic, and semantic features etc. For the non-anaphoric ones, the Right Frontier Rule is improved to do the resolution work. While testing the corpus published by Byron in 2004, this algorithm achieves a precision of 77.0% and a recall of 66.0%. Compared with the work of Byron, the algorithm is fully automatic, and the results are much better.

Key words:pronoun resolution; spoken dialog understanding; pronoun classification

引用本文

费仲超,周雅倩,黄萱菁,吴立德.口语对话中的代词指代消解.软件学报,2011,22(2):233-244

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2009-02-20
最后修改日期:2009-08-12
录用日期:
在线发布日期:
出版日期:

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码