Extracting Web Entity Activities Based on SVM and Extended Conditional Random Fields
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    On the basis of the traditional methods extracting information, this paper defines the formal model ofentity activity based on case grammar and presents a method based on supported vector machine and extendedcondition random fields to extract Web entity activities accurately. First, in order to automatically train the machinelearning models, the study puts forward a heuristic method to transform the semantic role labeling training data intothe training data of entity activity extraction. Next, the study trains a support vector machine classifier and extendscondition random fields using the training data. Third, using the classifier, the study distinguishes the sentences thatcontain Web entity activities. The paper also proposes forward and extends condition random fields to model thefrequency and relationship feature. The traditional conditional random fields cannot model this while the new modelcan label the entity activity information in natural language sentences more accurately. Finally, the experimentalresults show that the method is effective in multidomains and can be applied to Web entity activity extraction.

    Reference
    Related
    Cited by
Get Citation

张传岩,洪晓光,彭朝晖,李庆忠.基于SVM和扩展条件随机场的Web实体活动抽取.软件学报,2012,23(10):2612-2627

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:August 15,2011
  • Revised:January 17,2012
  • Adopted:
  • Online: September 30,2012
  • Published:
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063