2008 AHolisticLexiconBasedApprToOpinMin

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Opinion Mining, Sentiment Analysis, Context-dependent Word, Product Mention, Lexicon-based Algorithm.

Notes

Cited By

Quotes

Abstract

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. In this paper, we focus on customer reviews of products. In particular, we study the problem of determining the semantic orientations (positive, negative or neutral) of opinions expressed on product features in reviews. This problem has many applications, e.g., opinion mining, summarization and search. Most existing techniques utilize a list of opinion (bearing) words (also called opinion lexicon) for the purpose. Opinion words are words that express desirable (e.g., great, amazing, etc.) or undesirable (e.g., bad, poor, etc) states. These approaches, however, all have some major shortcomings. In this paper, we propose a holistic lexicon-based approach to solving the problem by exploiting external evidences and linguistic conventions of natural language expressions. This approach allows the system to handle opinion words that are context dependent, which cause major difficulties for existing algorithms. It also deals with many special words, phrases and language constructs which have impacts on opinions based on their linguistic patterns. It also has an effective function for aggregating multiple conflicting opinion words in a sentence. A system, called Opinion Observer, based on the proposed technique has been implemented. Experimental results using a benchmark product review data set and some additional reviews show that the proposed technique is highly effective. It outperforms existing methods significantly


References

  • 1 A. Andreevskaia and S. Bergler. Mining WordNet for Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses. In EACL'06, pp. 209--216, 2006.
  • 2 P. Beineke, Trevor Hastie, C. Manning, and S. Vaithyanathan. An Exploration of Sentiment Summarization. In: Proceedings of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, 2003.
  • 3 Giuseppe Carenini, Raymond T. Ng, Adam Pauls, Interactive multimedia summaries of evaluative text, Proceedings of the 11th International Conference on Intelligent user interfaces, January 29-February 01, 2006, Sydney, Australia  doi:10.1145/1111449.1111480
  • 4 S. Das, and M. Chen. Yahoo! for Amazon: Extracting market sentiment from stock message boards. APFA'01, 2001.
  • 5 Kushal Dave, Steve Lawrence, David M. Pennock, Mining the peanut gallery: opinion extraction and semantic classification of product reviews, Proceedings of the 12th International Conference on World Wide Web, May 20-24, 2003, Budapest, Hungary  doi:10.1145/775152.775226
  • 6 Xiaowen Ding, Bing Liu, The utility of linguistic rules in opinion mining, Proceedings of the 30th ACM SIGIR Conference retrieval, July 23-27, 2007, Amsterdam, The Netherlands  doi:10.1145/1277741.1277921
  • 7 A. Esuli and F. Sebastiani, EACL-06, (2006). Determining Term Subjectivity and Term Orientation for Opinion Mining, EACL-06, 2006.
  • 8 C. Fellbaum. WordNet: an Electronic Lexical Database, MIT Press, 1998.
  • 9 M. Gamon, A. Aue, S. Corston-Oliver, and E. K. Ringger. Pulse: Mining customer opinions from free text. IDA'2005.
  • 10 Vasileios Hatzivassiloglou, Janyce M. Wiebe, Effects of adjective orientation and gradability on sentence subjectivity, Proceedings of the 18th conference on Computational linguistics, p.299-305, July 31-August 04, 2000, Saarbrücken, Germany  doi:10.3115/990820.990864
  • 11 Vasileios Hatzivassiloglou, Kathleen R. McKeown, Predicting the semantic orientation of adjectives, Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, p.174-181, July 07-12, 1997, Madrid, Spain
  • 12 Marti A. Hearst, Direction-based text interpretation as an information access refinement, Text-based intelligent systems: current research and practice in information extraction and retrieval, Lawrence Erlbaum Associates, Inc., Mahwah, NJ, 1992
  • 13 Minqing Hu, Bing Liu, Mining and summarizing customer reviews, Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 22-25, 2004, Seattle, WA, USA  doi:10.1145/1014052.1014073
  • 14 N. Jindal, and Bing Liu . Mining Comparative Sentences and Relations. In AAAI'06, 2006.
  • 15 Nobuhiro Kaji, Masaru Kitsuregawa, Automatic construction of polarity-tagged corpus from HTML documents, Proceedings of the COLING/ACL on Main conference poster sessions, p.452-459, July 17-18, 2006, Sydney, Australia
  • 16 H. Kanayama and T. Nasukawa. Fully Automatic Lexicon Expansion for Domain-Oriented Sentiment Analysis. EMNLP'06, 2006.
  • 17 Soo-Min Kim, Eduard Hovy, Determining the sentiment of opinions, Proceedings of the 20th International Conference on Computational Linguistics, p.1367-es, August 23-27, 2004, Geneva, Switzerland  doi:10.3115/1220355.1220555
  • 18 Soo-Min Kim, Eduard Hovy, Automatic identification of pro and con reasons in online reviews, Proceedings of the COLING/ACL on Main conference poster sessions, p.483-490, July 17-18, 2006, Sydney, Australia
  • 19 N. Kobayashi, R. Iida, K. Inui and Y. Matsumoto. Opinion Mining on the Web by Extracting Subject-Attribute-Value Relations. In: Proceedings of AAAI-CAAW'06, 2006.
  • 20 L.-W. Ku, Y.-T. Liang and H.-H. Chen. Opinion Extraction, Summarization and Tracking in News and Blog Corpora. In: Proceedings of the AAAI-CAAW'06, 2006.
  • 21 Bing Liu, Minqing Hu, Junsheng Cheng, Opinion observer: analyzing and comparing opinions on the Web, Proceedings of the 14th International Conference on World Wide Web, May 10-14, 2005, Chiba, Japan  doi:10.1145/1060745.1060797
  • 22 Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, Toshikazu Fukushima, Mining product reputations on the Web, Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 23-26, 2002, Edmonton, Alberta, Canada  doi:10.1145/775047.775098
  • 23 Tetsuya Nasukawa, Jeonghee Yi, Sentiment analysis: capturing favorability using natural language processing, Proceedings of the 2nd International Conference on Knowledge capture, October 23-25, 2003, Sanibel Island, FL, USA  doi:10.1145/945645.945658
  • 24 Vincent Ng, Sajib Dasgupta, S. M. Niaz Arifin, Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews, Proceedings of the COLING/ACL on Main conference poster sessions, p.611-618, July 17-18, 2006, Sydney, Australia
  • 25 NLProcessor ¿ Text Analysis Toolkit. (2000). http://www.infogistics.com/textanalysis.html.
  • 26 Bo Pang, Lillian Lee, Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, p.115-124, June 25-30, 2005, Ann Arbor, Michigan  doi:10.3115/1219840.1219855
  • 27 Bo Pang, Lillian Lee, Shivakumar Vaithyanathan, Thumbs up?: Sentiment Classification Using Machine Learning Techniques, Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, p.79-86, July 06, 2002  doi:10.3115/1118693.1118704
  • 28 Ana-Maria Popescu, Oren Etzioni, Extracting product features and opinions from reviews, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.339-346, October 06-08, 2005, Vancouver, British Columbia, Canada  doi:10.3115/1220575.1220618
  • 29 Ellen Riloff, Janyce M. Wiebe, Learning extraction patterns for subjective expressions, Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, p.105-112, July 11, 2003  doi:10.3115/1119355.1119369
  • 30 V. Stoyanov and C. Cardie. Toward opinion summarization: Linking the sources. In: Proceedings of the Workshop on Sentiment and Subjectivity in Text, 2006.
  • 31 R. Tong. An Operational System for Detecting and Tracking Opinions in on-line discussion. SIGIR 2001 Workshop on Operational Text Classification, 2001.
  • 32 Peter D. Turney, Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, July 07-12, 2002, Philadelphia, Pennsylvania  doi:10.3115/1073083.1073153
  • 33 T. Wilson, Janyce M. Wiebe, and R. Hwa. Just how mad are you? Finding strong and weak opinion clauses. AAAI'04, 2004.
  • 34 Janyce M. Wiebe, Rada Mihalcea, Word sense and subjectivity, Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, p.1065-1072, July 17-18, 2006, Sydney, Australia  doi:10.3115/1220175.1220309
  • 35 Janyce M. Wiebe, and Ellen Riloff: Creating Subjective and Objective sentence classifiers from unannotated texts. CICLing, 2005.
  • 36 Hong Yu, Vasileios Hatzivassiloglou, Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences, Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, p.129-136, July 11, 2003  doi:10.3115/1119355.1119372
  • 37 Li Zhuang, Feng Jing, Xiao-Yan Zhu, Movie review mining and summarization, Proceedings of the 15th ACM International Conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA  doi:10.1145/1183614.1183625,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2008 AHolisticLexiconBasedApprToOpinMinBing Liu
Xiaowen Ding
Philip S. Yu
A Holistic Lexicon-based Approach to Opinion MiningProceedings of the International Conference on Web Search and Web Data Mininghttp://www.wsdm2009.org/wsdm2008.org/WSDM2008-papers/p231.pdf10.1145/1341531.13415612008