Theoretical Analysis on Structured Learning with Noisy Data and its Applications

doi:10.3724/SP.J.1001.2013.04393

微信服务号

微信订阅号

2025-4-24- 13

Home > Archive>Volume 24, Issue 10, 2013 >2340-2353. DOI:10.3724/SP.J.1001.2013.04393

PDF HTML XML Export Cite reminder

Theoretical Analysis on Structured Learning with Noisy Data and its Applications
DOI:
                        10.3724/SP.J.1001.2013.04393
                    
Author:
                        YU MoYU Mo
MOE-MS Key Laboratory of Natural Language Processing and Speech, School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHAO Tie-JunZHAO Tie-Jun
MOE-MS Key Laboratory of Natural Language Processing and Speech, School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HU Peng-LongHU Peng-Long
MOE-MS Key Laboratory of Natural Language Processing and Speech, School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHENG De-QuanZHENG De-Quan
MOE-MS Key Laboratory of Natural Language Processing and Speech, School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Performance of supervised machine learning can be badly affected by noises of labeled data, as indicated by existing well studied theories on learning with noisy data. However these theories only focus on two-class classification problems. This paper studies the relation between noise examples and their effects on structured learning. Firstly, the paper founds that noise of labeled data increases in structured learning problems, leading to a higher noise rate in training procedure than on labeled data. Existing theories do not consider noise increament in structured learning, thus underestimate the complexities of learning problems. This paper provides a new theory on learning from noise data with structured predictions. Based on the theory, the concept of "effective size of training data" is proposed to describe the qualities of noisy training data sets in practice. The paper also analyzes the situations when structured learning models will go back to lower order ones in applications. Experimental results are given to confirm the correctness of these theories as well as their practical values on cross-lingual projection and co-training.

Key words:structured learning;PAC learning with noise;pos-tagging;natural language processing;co-training;cross-lingual projection;semi-supervised learning

Get Citation

于墨,赵铁军,胡鹏龙,郑德权.结构化学习的噪声可学习性分析及其应用.软件学报,2013,24(10):2340-2353

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 11,2012
Revised:February 04,2013
Adopted:
Online: October 12,2013
Published:

You are the first2038055Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History