• Article
  • | |
  • Metrics
  • |
  • Reference [10]
  • |
  • Related [20]
  • |
  • Cited by [1]
  • | |
  • Comments
    Abstract:

    A method is presented to identify some pieces of specific information in multi-carrier data streams byfeature words and based on PinYin matching. An effective knowledge approximation method is used to judge therelation between feature words and context by statistics theory. The part of speech transfer-value as systemknowledge can be obtained by inductive learning of training corpus. When data streams are evaluated, theevaluation value can be gained according to the system knowledge by matching all feature words and based on theirPin Yin, which examines the comparability with context regular of part of speech between all feature words in datastreams and themselves in training corpus. Further more, if the evaluation value exceeds the threshold, the datastreams will be shielded. Experimental results show that the effect of the experiment system based on this method isefficient for identifying ill information and monitoring & controlling their spreading by multi-carrier data streams.

    Reference
    [1]Richard Hunter J. Performance considerations for information filtering systems using database technology[Ph.D. Thesis]. Florida Institute of Technology, 1998.
    [2]Huang XJ, Xia YJ, Wu LD. A text filtering system based on vector space model. In: Cao YQ, ed. Proceedings of the Conference of the 20th Anniversary of CIPSC Beijing: Tsinghua University Press, 2001.215~218 (in Chinese with English abstract).
    [3]Ali H A. Concept based retrieval and information filtering[Ph.D. Thesis]. University of Nebraska-Lincoln, 2001.
    [4]Niu Wei-Xia, Zhang Yong-Kui. Latent semantic indexing is applied in information filtering. Computer Engineering and Application, 2001,37(9):57~62 (in Chinese with English abstract).
    [5]Hanani U, Shapira B, Shoval P. Information filtering: Overview of issues, research and systems User Modeling and User-Adapted Interaction, 2001,11(3):203~259.
    [6]Zhang BT, Seo YW. Personalized web-document filtering using reinforcement learning Applied Artificial Intelligence,2001,15(7):665~685.
    [7]Wu LD. Large-Scale Chinese Text Processing. Shanghai: Fudan University Press, 1997 (in Chinese)
    [8]黄萱箐,夏迎炬.基于向量空间模型的广西过滤系统.见:辉煌20年——中国中文信息学会20周年学术会议论文集.2001.215~218.
    [9]牛伟霞,张永奎.潜在语义索引方法在信息过滤中的应用.计算机工程与应用,2001,37(9):57~62
    [10]吴立德.大规模中文文本处理.上海:复旦大学出版社,1997.
    Comments
    Comments
    分享到微博
    Submit
Get Citation

郑德权,胡熠,于浩,赵铁军,王青松.多载体数据流中的特定信息识别研究.软件学报,2003,14(9):1538-1543

Copy
Share
Article Metrics
  • Abstract:3845
  • PDF: 5236
  • HTML: 0
  • Cited by: 0
History
  • Received:June 24,2002
  • Revised:March 25,2003
You are the first2038637Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063