Sentiment detection and visualization of Chinese micro-blog
Date
Authors
Advisors
Journal Title
Journal ISSN
ISSN
Volume Title
Publisher
Type
Peer reviewed
Abstract
Micro-blog has been increasingly used for the public to express their opinions, and for organisations to detect public sentiment about social events. In contrast to the effort and progress made in English-based micro-blog analysis, research on Chinese micro-blog received relatively little attention. In this paper we examine and identify the key problems of this field, focusing particularly on the characteristics of innovative words, emoticon elements and hierarchical structure of Chinese “Weibo”. Based on the analysis we propose and develop associated theoretical and technological methods to address these problems. These include the development of new sentiment word mining method based on three wording standards and point-wise metrics, a rule set model for analyzing sentiment features of different linguistic components, and the corresponding methodology for calculating sentiment on multi-granularity considering emoticon elements. We use original Chinese tweets from a dataset of Sina Weibo to test and evaluate our new word discovery and sentiment detection methods. Initial results show that our new diction can improve sentiment detection, and demonstrate that our multi-level rule set method is more effective by giving 10.2% and 1.5% higher average accuracy than two existing methods for Chinese micro-blog sentiment analysis. In addition, we exploit visualisation techniques to study the relationships between online sentiment and real life, which can help depict the correlation between public emotions and events.