一个汉语短语自动界定模型<sup>*</sup>

微信服务号

微信订阅号

2025年4月24日 23:58 星期四

首页 > 过刊浏览>1996年第7卷第zk期 >315-322

一个汉语短语自动界定模型*
DOI:
                        
                    
CSTR:
                        
                    
作者:
                        周强周强
北京大学计算语言学研究所北京100871
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:本文研究得到国家自然科学基金资助.

A MODEL FOR AUTOMATIC PREDICTION OF CHINESE PHRASE BOUNDARY LOCATION

Author:

Zhou Qiang
Zhou Qiang

在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

本文提出了一个汉语短语自动界定模型，它通过基于统计的自动界定处理．利用通过错误驱动自动学习而得到的调整规则进行界定情况局部调整，利用人工总结的全局调整规则进行精调整等3个处理阶段，可以较好地确定一句经过正确切分和词性标注处理的汉语句子中不同短语的边界位置。从而为进一步的汉语短语自动划分和标注处理打下了良好的基础．对1000多句句子的实验结果表明，模型的界定正确事达到了96.33％(封闭测试)、94.54％(开放测试)．

关键词:汉语短语界定模型,短语划分,语料库自动标注．

Abstract:

Phrase boundary location provides an important information for bracketing and tagging the phrase automatically．This paper describes an experimental model for the automatic prediction of the phrase boundary location．It consists of three processing stages：first，automatically identify the phrase boundaries using statistics from treebank；then，post—tune the results using local tuning rules generated by an error—driven the machine learning method；at last，refine the results of the last two stages with the overall tuning rules summarized by man．Experimental results on a corpus of 1 434 sentences demonstrate a high rate of the success for predicting the phrase boundary(96.33％correct the prediction for the close testing and 94.54％correct the prediction for open testing)．

Key words:Predicting phrase boundary，phrase bracketing，corpus annotation.

引用本文

周强.一个汉语短语自动界定模型^*.软件学报,1996,7(zk):315-322

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:1995-09-14
最后修改日期:
录用日期:
在线发布日期:
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码