Extracting Web Entity Activities Based on SVM and Extended Conditional Random Fields

doi:10.3724/SP.J.1001.2012.04189

微信服务号

微信订阅号

2025-4-24- 14

Home > Archive>Volume 23, Issue 10, 2012 >2612-2627. DOI:10.3724/SP.J.1001.2012.04189

PDF HTML XML Export Cite reminder

Extracting Web Entity Activities Based on SVM and Extended Conditional Random Fields
DOI:
                        10.3724/SP.J.1001.2012.04189
                    
Author:
                        ZHANG Chuan-YanZHANG Chuan-Yan
School of Computer Science and Technology, Shandong University, Ji’nan 250101, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HONG Xiao-GuangHONG Xiao-Guang
School of Computer Science and Technology, Shandong University, Ji’nan 250101, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
PENG Zhao-HuiPENG Zhao-Hui
School of Computer Science and Technology, Shandong University, Ji’nan 250101, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Qing-ZhongLI Qing-Zhong
School of Computer Science and Technology, Shandong University, Ji’nan 250101, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

On the basis of the traditional methods extracting information, this paper defines the formal model ofentity activity based on case grammar and presents a method based on supported vector machine and extendedcondition random fields to extract Web entity activities accurately. First, in order to automatically train the machinelearning models, the study puts forward a heuristic method to transform the semantic role labeling training data intothe training data of entity activity extraction. Next, the study trains a support vector machine classifier and extendscondition random fields using the training data. Third, using the classifier, the study distinguishes the sentences thatcontain Web entity activities. The paper also proposes forward and extends condition random fields to model thefrequency and relationship feature. The traditional conditional random fields cannot model this while the new modelcan label the entity activity information in natural language sentences more accurately. Finally, the experimentalresults show that the method is effective in multidomains and can be applied to Web entity activity extraction.

Key words:information extraction;case grammar;entity activity;support vector machine;extended conditionrandom fields

Get Citation

张传岩,洪晓光,彭朝晖,李庆忠.基于SVM和扩展条件随机场的Web实体活动抽取.软件学报,2012,23(10):2612-2627

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 15,2011
Revised:January 17,2012
Adopted:
Online: September 30,2012
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History