基于词汇支配度的汉语依存分析模型

微信服务号

微信订阅号

首页 > 过刊浏览>2006年第17卷第9期 >1876-1883

基于词汇支配度的汉语依存分析模型
DOI:
                        
CSTR:
                        
作者:
                        
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:Supported by the Key Project of National Natural Science Foundation of China under Grant No.60435020 (国家自然科学基金重点项目); the National Natural Science Foundation of China under Grant Nos.60575042, 60503072 (国家自然科学基金)

Chinese Dependency Parsing Model Based on Lexical Governing Degree

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

如何应用句法结构和词汇化是句法分析建模所面临的两个主要问题,汉语依存分析对这两方面做了初步的探索.首先通过对大规模依存树库的统计学习,获取其中的词汇依存信息,建立了一个词汇化的概率分析模型.然后引入词汇支配度的概念,以充分利用了句子中的结构信息.词汇化方法有效地弥补了以前工作中词性信息的粒度过粗问题.同时,词汇支配度增强了对句法结构的识别,有效地避免了非法结构的生成.在4 000句的测试集上,依存分析获得了约74%的正确率.

Abstract:

Use of structural information and lexicalization are two of the main challenges facing syntactic analysis, and they are investigated in this paper. First, the probabilities of lexical dependencies are obtained by training a large-scale dependency treebank and used to build the lexical model. Second, the governing degree of words is introduced to utilize the structure information. The lexical method overcomes the weakness of POS dependencies in the past work; meanwhile the governing degree of words is helpful to distinguish the syntactic structures so some ill-formed structures are avoided. Finally, the paper shows a good experimental result of around 74% accuracy on the test set that consists of 4000 sentences.

参考文献

相似文献

引证文献

引用本文

刘挺,马金山,李生.基于词汇支配度的汉语依存分析模型.软件学报,2006,17(9):1876-1883

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2005-04-28
最后修改日期:2005-10-10
录用日期:
在线发布日期:
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码