串匹配算法中模式串与文本之间关系的研究

微信服务号

微信订阅号

2025年4月9日 4:47 星期三

首页 > 过刊浏览>2010年第21卷第7期 >1503-1514

串匹配算法中模式串与文本之间关系的研究
DOI:
                        
                    
CSTR:
                        
                    
作者:
                        刘 萍刘 萍
中国科学院 计算技术研究所,北京 100190
在期刊界中查找
在百度中查找
在本站中查找
刘 萍刘 萍
信息内容安全技术国家工程实验室,北京 100190
在期刊界中查找
在百度中查找
在本站中查找
刘燕兵刘燕兵
中国科学院 计算技术研究所,北京 100190
在期刊界中查找
在百度中查找
在本站中查找
郭 莉郭 莉
中国科学院 计算技术研究所,北京 100190
在期刊界中查找
在百度中查找
在本站中查找
方滨兴方滨兴
中国科学院 计算技术研究所,北京 100190
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:Supported by the National Basic Research Program of China under Grant No.2007CB311100 (国家重点基础研究发展计划(973))

Research on Relationship Between Patterns and Text in String Matching Algorithms

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

经典的串匹配算法设计和分析中假设“字符互相独立并且等概率出现”,这与实际应用环境差异很大,导致出现很多问题.考虑了字符的概率分布和上下文的关联,同时兼顾应用的方便,提出了命中密度的概念.在给出基本定义和扩展定义后,通过对4种类型的代表性算法的理论和实验分析,给出了命中密度与算法性能之间的关系.同时,在对命中密度的分析中得出一些极具价值的结论.对命中密度概念的多角度理解以及对它与算法性能关系的深入剖析都说明,命中密度作为一个特征量,可以从一个侧面刻画模式串和文本之间的相关性,它对算法的设计和分析以及串匹配领域研究工作的扩展都具有指导意义.

关键词:串匹配;字符概率分布;字符串相关性

Abstract:

It was assumed that the pattern and text characters are independent and uniformly distributed over a finite alphabet in classical string matching algorithms, and this assumption differs from real applications and causes many problems. Considering the probability distributions, the contexts of the characters, and the convenience of applications, this paper gives a concept hit rate and four extended concepts about it. Then it gives the theory analysis and detailed experiments with hit rate on the four classical algorithms. The map of the relationships is obtained between the hit rate and the algorithms’ performance, and at the same time some valuable conclusions are made through above work. As a character variable, hit rate describes the relativity of patterns and text and can serve as guidelines in the algorithms design, analysis and some other extended research fields of the string matching.

Key words:string matching; probability distributions of character; relativity of strings

引用本文

刘萍,刘萍,刘燕兵,郭莉,方滨兴.串匹配算法中模式串与文本之间关系的研究.软件学报,2010,21(7):1503-1514

复制

文章指标

点击次数:4498
下载次数: 7028
HTML阅读次数: 0
引用次数: 0

历史

收稿日期:2008-09-25
最后修改日期:2009-03-31
录用日期:
在线发布日期:
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码