一种基于E-Chunk的机器翻译模型
作者:
基金项目:

国家自然科学基金资助项目(69985001);国家重点基础研究973资助项目(G19980305011);国家教育部博士点基金资助项目(1999014503)

  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [19]
  • |
  • 相似文献 [20]
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    提出了一种基于E-Chunk的多引擎机器翻译模型.该模型以中心语驱动的分析技术为基础,通过词汇相似特征计算E-Chunk的匹配代价,自底向上地完成最优E-Chunk覆盖的构造,并以E-Chunk为基本翻译单元完成机器翻译过程.初步的实验结果显示,该方法在面向领域文本的自动翻译方面是有效的.

    Abstract:

    In this paper, a new E-Chunk based multi-engine machine translation model is proposed. The model is composed of a head-driven lexicalized parser, a word-similarity based E-Chunk match engine and a bilingual E-Chunk based transfer engine. The optimal E-Chunk tiling is constructed in a bottom-up style efficiently. Preliminary experimental results show that it is effective in domain oriented machine translation.

    参考文献
    [1] Arnold,D.,Balkan,L.,Humphreys,R.L.,et al.Machine Translation,an Introductory Guide.Machesster-Oxford: NCC Blackwell,1994.
    [2] Nagao,M.A framework of a mechanical translation between Japanese and English by analogy principle.In: Elithorn,A.,Banerji,R.,eds.,Artificial and Human Intelligence.Amsterdam,New York: Elsevier Science Publishers Corporation,1984.173~180.
    [3] Sato,Satoshi,Makoto,Nagao.Toward memory-based translation.In: Proceedings of the 13th International Conference on Computational Linguistics (COLING'90),Vol.3.Helsinki,Finland: Helsingiensis Universitas,1990.247~252.
    [4] Kaji,Hiroyuki,Yuuko,Kida,Yasutsugu,Morimoto.Learning translation templates from bilingual text.In: Proceedings of the 15th [sic] International Conference on Computational Linguistics (COLING'92).Nates: the Association,ICCL,1992.672~678.
    [5] Abney,S.Parsing by chunk.In: Tenny,B.A.,eds.Principle-Based Parsing.Nowell,MA: Kluwer,1991.
    [6] Abney,S.Partial parsing via finite-state cascades.Natural Language Engineering,1996,2(4):337~344.
    [7] Pollard,C.,Ivan,S.Head-Driven Phrase Structure Grammar.Centre for the Study of Language and Information,Stanford University,1994.
    [8] Yael,K.,Edelman,S.Learning similarity-based word sense disambiguation.Computational Linguistics,1998,24(1):41~60.
    [9] Sima'an,Khalil.Computational complexity of probabilistic disambiguation by means of tree-grammars.In: Proceedings of the COLING'96.Copenhagen: the Association,Morristown,NJ,1996.
    [10] Whitelock,Pete.Shake and bake translation.In: Rupp,C.J.,Rosner,M.A.,Johnson,R.L.,eds.Constraints,Language and Computation.London: Academic Press,1994.339~359.
    [11] Dorr,B.Machine Translation: a View from Lexicon.Cambridge,MA: MIT Press,1993.
    [12] Lee,L.J.Similarity-Based approaches to natural language processing [Ph.D.Thesis].Harvard University,1997.
    [13] Hall,P.,Dowling,G.Approximate string matching.Computing Surveys,1980,12(4):381~402.
    [14] Sato,Satoshi.CTM: an example-based translation aid system using the character-based best match retrieval method.In: Proceedings of the COLING'92.Nantes: the Association,ICCL,1992.
    [15] Yao,Tian-shun,Li,Jing-jiao,Liu,Dong-li,et al.Natural Language Understanding.Beijing: Tsinghua University Press,1995 (in Chinese).
    [16] Alshawi,H.Head automata and tree tiling: translation with minimal representations.In: Proceedings of the Association for Computational Linguistics.Santa Cruz,CA: Morgan Kaufmann Publishers,1996.167~176.
    [17] Bod,R.Using and annotated corpus as stochastic grammar.In: Proceedings of the EACL'93.Utrecht: the Association,Morristown,NJ,1993.
    [18] Nirenburg,Sergei,Constantine,Demashnev,Grannes,D.J.Two approaches to matching in EBMT.In: Proceedings of the 5th International Conference on Theoretical and Methodological Issues in Machine Translation.Amsterdam: IOS Press,1993.
    [19] 姚天顺,李晶皎,刘东立,等.自然语言理解.北京:清华大学出版社,1995.
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

李沐,吕学强,姚天顺.一种基于E-Chunk的机器翻译模型.软件学报,2002,13(4):669-676

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2000-08-21
  • 最后修改日期:2000-12-19
文章二维码
您是第19945409位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号