• Article
  • | |
  • Metrics
  • |
  • Reference [14]
  • |
  • Related [20]
  • |
  • Cited by [7]
  • | |
  • Comments
    Abstract:

    Use of structural information and lexicalization are two of the main challenges facing syntactic analysis, and they are investigated in this paper. First, the probabilities of lexical dependencies are obtained by training a large-scale dependency treebank and used to build the lexical model. Second, the governing degree of words is introduced to utilize the structure information. The lexical method overcomes the weakness of POS dependencies in the past work; meanwhile the governing degree of words is helpful to distinguish the syntactic structures so some ill-formed structures are avoided. Finally, the paper shows a good experimental result of around 74% accuracy on the test set that consists of 4000 sentences.

    Reference
    [1]Nasr A,Rambow O.A simple string-rewriting formalism for dependency grammar.In:Proc.of the Workshop on Recent Advances in Dependency Grammar.Barcelona:Association for Computational Linguistics,2004.17-24.
    [2]Liu WQ,Wang MH,Zhong YX.On study of hierarchy structure dependency relations in Chinese.Journal of Chinese Information Processing,1996,10(2):32-46 (in Chinese with English abstract).
    [3]Charniak E.A maximum-entropy-inspired parser.In:Proc.of the 1st Conf.of the North American Chapter of the Association for Computational Linguistics.Seattle:Association for Computational Linguistics,2000.132-139.
    [4]Collins M.Head-Driven statistical models for natural language parsing[Ph.D.Thesis].Pennsylvania:University of Pennsylvania,1999.
    [5]Eisner J.Three new probabilistic models for dependency parsing:An exploration.In:Proc.of the COLING.Copenhagen:Association for Computational Linguistics,1996.340-345.
    [6]Zhou Q,Huang CN.An improved approach for Chinese parsing based on local preference information.Journal of Software,1999,10(1):1-6 (in Chinese with English abstract).
    [7]Bikel DM,Chiang D.Two statistical parsing models applied to the Chinese treebank.In:Proc.of the 2nd Chinese Language Processing Workshop.Hong Kong:Association for Computational Linguistics,2000.1-6.
    [8]Xiong DY,Li SL,Liu Q,Lin SX,Qian YL.Parsing the Penn Chinese treebank with semantic knowledge.In:Dale R,Wong KF,eds.Proc.of the IJCNLP 2005.Jeju Island:Springer-Verlag,2005.70-81.
    [9]Zhou M.A block-based dependency parser for unrestricted Chinese text.In:Proc.of the 2nd Chinese Language Processing Workshop Attached to ACL-2000.Hong Kong:Association for Computational Linguistics,2000.78-84.
    [10]Infante-Lopez G,Rijke M,Sima'an K.A general probabilistic model for dependency parsing.In:Blockeel H,Denecker M,eds.Proc.of the 14th Dutch-Belgian Artificial Intelligence Conf.BNAIC-02.Leuven:BNVKI,Dutch and the Belgian AI Association,2002.139-146.
    [11]Ma JS,Zhang Y,Liu T,Li S.A statistical dependency parser of Chinese under small training data.In:Proc.of the Workshop on Beyond Shallow Analyses-Formalisms and Statistical Modeling for Deep Analyses,IJCNLP-04.Sanya:Asia Federation of Natural Language Processing,2004.1-5.
    [12]Cheng YC,Asahara M,Matsumoto Y.Deterministic dependency structure analyzer for Chinese.In:Proc.of the IJCNLP-04.Sanya:Asia Federation of Natural Language Processing,2004.135-140.
    [2]刘伟权,王明会,钟义信.建立现代汉语依存关系的层次体系.中文信息学报,1996,10(2):32-46.
    [6]周强,黄昌宁.基于局部优先的汉语句法分析方法.软件学报,1999,10(1):1-6.
    Comments
    Comments
    分享到微博
    Submit
Get Citation

刘挺,马金山,李生.基于词汇支配度的汉语依存分析模型.软件学报,2006,17(9):1876-1883

Copy
Share
Article Metrics
  • Abstract:4303
  • PDF: 5779
  • HTML: 0
  • Cited by: 0
History
  • Received:April 28,2005
  • Revised:October 10,2005
You are the first2045212Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063