Subjectivity Ranking of Verbs and Adjectives with an Unlabeled Corpus
Author:
Affiliation:

  • Article
  • | |
  • Metrics
  • |
  • Reference [27]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    In this paper, aiming to automatically distinguish subjective words from objective ones in Chinese, the study performs a subjectivity ranking test on Chinese adjectives and verbs. The paper exploits subjectivity clues and the subjectivity of Chinese characters. The subjectivity clues are further divided into gradability clues and subject clues. The study then uses graph-based algorithms to calculate the subjectivity originated from subjectivity clues. The subject clues and subjectivity of Chinese characters are novel ideas in such tasks. Five annotators are asked to label subjectivity of 500 words, from which the gold standard is built upon and evaluates rankings in various settings. It is shown that when words to be ranked occur frequently, this approach can outperform or match some human annotators. Furthermore, although the study only an unlabeled corpus, more prior knowledge can be incorporated into the graph-based approach.

    Reference
    [1] Wiebe J, Wilson T, Cardie C. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, 2005,39(2-3):165-210. [doi: 10.1007/s10579-005-7880-9]
    [2] Ortony A, Turner TJ. What's basic about basic emotions! Psychological Review, 1990,97(3):315-331.
    [3] Wiebe J. Learning subjective adjectives from corpora. In: Kautz HA, ed. Proc. of the American Association for Artificial Intelligence. Cambridge: MIT Press, 2000. 735-740.
    [4] Riloff E, Wiebe J. Learning extraction patterns for subjective expressions. In: Proc. of the Conf. on Empirical Methods in Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2003. 105-112. [doi: 10.3115/1119355.1119369]
    [5] Kim SM, Hovy E. Determining the sentiment of opinions. In: Proc. of the Int'l Conf. on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2004. Article No.1376. [doi: 10.3115/1220355.1220555]
    [6] Esuli A, Sebastiani F. Pageranking wordnet synsets: An application to opinion mining. In: Proc. of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2007. 424-431.
    [7] Mihalcea R, Banea C, Wiebe J. Learning multilingual subjective language via cross-lingual projections. In: Proc. of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2007. 976-983.
    [8] Goldberg AB, Fillmore N, Andrzejewski D, Xu ZT, Gibson B, Zhu XJ. May all your wishes come true: A study of wishes and how to recognize them. In: Proc. of the HLT-NAACL. Stroudsburg: Association for Computational Linguistics, 2009. 263-271.
    [9] Ku LW, Chen HH. Mining opinions from the Web: Beyond relevance retrieval. Journal of the American Society for Information Science and Technology, 2007,58(12):1838-1850. [doi: 10.1002/asi.20630]
    [10] Wang GZ. A Dictionary of Chinese Praise and Blame Words. Beijing: Sinolingua, 2001 (in Chinese).
    [11] Stone PJ. The General Inquirer: A Computer Approach to Content Analysis. Cambridge: The MIT Press, 1966.
    [12] Hatzivassiloglou V, McKeown K. Predicting the semantic orientation of adjectives. In: Proc. of the Joint ACL/EACL Conf. Stroudsburg: Association for Computational Linguistics, 1997. 174-181. [doi: 10.3115/976909.979640]
    [13] Hatzivassiloglou V, Wiebe J. Effects of adjective orientation and gradability on sentence subjectivity. In: Proc. of the Int'l Conf. on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2000. 299-305. [doi: 10.3115/990820.9908 64]
    [14] Banea C, Mihalcea R, Wiebe J. A bootstrapping method for building subjectivity lexicons for languages with scarce resources. In: Proc. of the Int'l Conf. on Language Resources and Evaluation. Paris: European Language Resources Association, 2008. 2764-2767.
    [15] Gyamfi Y, Wiebe J, Mihalcea R, Akkaya G. Integrating knowledge for subjectivity sense labeling. In: Proc. of the HLT-NAACL. Stroudsburg: Association for Computational Linguistics, 2009. 10-18.
    [16] Jijkoun V, Hofmann K. Generating a non-English subjectivity lexicon: Relations that matter. In: Proc. of the 12th Conf. of the European Chapter of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2009. 398-405.
    [17] Athanasiadou A. On the subjectivity of intensifiers. Language Sciences, 2007,29(4):554-565. [doi: 10.1016/j.langsci.2007.01.009]
    [18] Zagibalov T, Carroll J. Automatic seed word selection for unsupervised sentiment classification of Chinese text. In: Proc. of the 22nd Int'l Conf. on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2008. 1073-1080.
    [19] Zhao J. The study on the highest degree adverbs in mandarin [MS. Thesis]. Shanghai: Shanghai Normal University, 2006 (in Chinese).
    [20] Benveniste E. Subjectivity in language. In: Gables C, ed. Proc. of the Problems in General Linguistics, Vol.18. Miami: University of Miami Press, 1958. 223-230.
    [21] Ochs E, Schieffelin BB. Language has a heart. Text, 1989,9(1):7-25.
    [22] Wiebe JM, Riloff E. Creating subjective and objective sentence classifiers from unannotated texts. In: Proc. of the Conf. on Computational Linguistics and Intelligent Text Processing. Mexico: Springer-Verlag, 2005. 475-486. [doi: 10.1007/978-3-540-30586-6_53]
    [23] Kleinberg J. Authoritative sources in a hyperlinked environment. Journal of the ACM, 1999,46(5):604-632. [doi: 10.1145/324133. 324140]
    [24] Zhou D, Bousquet O, Lal TN, Weston J, Scholkopf B. Ranking on data manifolds. In: Proc. of the 18th Annual Conf. on Neural Information Processing System. Cambridge: MIT Press, 2003. 169-176.
    [25] Yu SW, Duan HM, Swen B, Chang BB. Specification for corpus processing at Peking university: Word segmentation, pos tagging and phonetic notation. Journal of Chinese Language and Computing, 2003,13(2):121-158 (in Chinese with English abstract).
    [26] Zhu DX. Lecture Notes on Chinese Grammar. Beijing: The Commercial Press, 1982 (in Chinese).
    [27] Tong H, He J, Li M, Zhang C, Ma W. Graph based multi-modality learning. In: Proc. of the 13th Annual ACM Int'l Conf. on Multimedia. New York: ACM Press, 2005. 862-871. [doi: 10.1145/1101149.1101337]
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

徐戈,蒙新泛,王厚峰.采用无标注语料的动词和形容词主观性评级.软件学报,2013,24(5):1036-1050

Copy
Share
Article Metrics
  • Abstract:3820
  • PDF: 3902
  • HTML: 0
  • Cited by: 0
History
  • Received:October 12,2011
  • Revised:July 03,2012
  • Online: May 07,2013
You are the first2035233Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063