• Article
  • | |
  • Metrics
  • |
  • Reference [11]
  • |
  • Related
  • |
  • Cited by [2]
  • | |
  • Comments
    Abstract:

    Topic link detection is a foundational research in the field of topic detection and tracking, which detects whether two random stories talk about the same topic. This paper proposes a method of applying semantic domain language model to link detection, based on the structure relation among contents and the semantic distribution in a story, and also verifies the influence of the strategy incorporating dependency parsing into semantic description. Evaluation on Chinese Corpus of TDT4 show that the semantic domain language model substantially improved the performance of current detection system, whose minimum DET cost is reduced by about 3 percent.

    Reference
    [1] Allan J. Topic Detection and Tracking: Event-Based Information Organization. Springer-Verlag, 2002. 1-16.
    [2] Yiming Y, Ault T, Pierce T, Lattimer CW. Improving text categorization methods for event tracking. In: ACM, ed. Proc. of the SIGIR 2000. Athens: Association for Computing Machinery Press, 2000. 65-72.
    [3] Luo WH, Liu Q, Chen XQ. Development and analysis of technology of topic detection and tracking. In: Sun MS, ed. Proc. of the JSCL-2003. Beijing: Tsinghua University Press, 2003. 560-566 (in Chinese with English abstract).
    [4] Yu MQ, Lou WH, Xu HB, Bai S. Research on hierarchical topic detection in topic detection and tracking. Journal of Computer Research and Development, 2006,43(3):489-495 (in Chinese with English abstract).
    [5] Kumaran G, Allan J. Text classification and named entities for new event detection. In: ACM, ed. Proc. of the SIGIR 2004. New York: Association for Computing Machinery Press, 2004. 297-304.
    [6] Farahat A, Chen F, Brants T. Optimizing story link detection is not equivalent to optimizing new event detection. In: Isahara H, ed. Proc. of the ACL-03. Sapporo: Association for Computational Linguistics Press, 2003. 232-239.
    [7] Allan J, Carbonell J, Doddington G, Yamron J, Yiming Y. Topic detection and tracking pilot study final report. In: Proc. of the Broadcast News Transcription and Understanding Workshop, Vol.2. 1998. 1-25.
    [8] Lavrenko V, Allan J, Deguzman E, Laflamme D, Pollard V, Thomas S. Relevance models for topic detection and tracking. In: Proc. of the 2nd Int’l Conf. on Human Language Technology Research. San Francisco: Morgan Kaufmann Publishers, 2002. 104-110.
    [9] Ponte J, Croft WB. Text segmentation by topic. In: Peters C, ed. Proc. of the European Conf. on Research and Advanced Technology for Digital Libraries. London: ECDL Press, 1997. 113-125.
    [10] Nallapati R. Semantic language models for topic detection and tracking. In: Hearst M, ed. Proc. of HLT-NAACL2003 Student Research Workshop. Edmonton: Association for Computational Linguistics, 2003. 1-6.
    [11] Chen SF, Goodman J. An empirical study of smoothing techniques for language modeling. Computer Speech and Language, 1999,13(4):310-318.
    Related
    Comments
    Comments
    分享到微博
    Submit
Get Citation

洪 宇,张 宇,范基礼,刘 挺,李 生.基于语义域语言模型的中文话题关联检测.软件学报,2008,19(9):2265-2275

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 14,2007
  • Revised:November 20,2007
You are the first2045202Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063