Semi-Supervised Target-Dependent Sentiment Classification for Micro-Blogs

  • Shadi I. Abudalfa Collage of Computer Science and Engineering, King Fahd University of Petroleum and Minerals, Dhahran, Kingdom of Saudi Arabia
  • Moataz A. Ahmed Collage of Computer Science and Engineering, King Fahd University of Petroleum and Minerals, Dhahran, Kingdom of Saudi Arabia
Keywords: Social opinions, Sentiment analysis, target-dependent, semi-supervised learning

Abstract

The wealth of opinions available in the social media motivated researchers to develop automatic opinion detection tools. Many such tools are currently available online for opinion mining in short text, known as micro-blogs, but their efficacies are still limited. Current tools focus on detecting sentiment polarity expressed in a micro-blog regardless of the topic (target) discussed. Little improved approaches have been proposed to detect sentiment towards a specific target, referred to as target-dependent sentiment classification. Our literature review has shown that all these target-dependent approaches use supervised learning techniques. Such techniques need a huge amount of labeled data for increasing classification accuracy. However, preparing labeled data from social media needs a lot of efforts. In this work, we address this issue by employing semisupervised learning techniques that have not been used before with target-dependent sentiment classification. To the best of our knowledge, our work is the first research that employs semisupervised learning techniques in this direction. Semi-supervised learning techniques have been known in the literature to improve classification accuracy in comparison with supervised learning techniques; however, they use same number of labeled samples plus many unlabelled ones. In this work, we propose a new semi-supervised learning technique that uses less number of labeled microblogs than that used with supervised learning techniques. Experiment results have shown that the proposed technique provides competitive accuracy.

Downloads

Download data is not yet available.

References

F. Pozzi, E. Fersini, E. Messina, and B. Liu, Sentiment Analysis in Social Networks, 1st ed, Elsevier Inc., 2017.

L. Jiang, M. Yu, M. Zhou, X. Liu, and T. Zhao, “Target-dependent Twitter Sentiment Classification,” Proc. the 49th Annual Meeting of the Association for Computational Linguistics, pp. 151–160, Portland, Oregon, June 19-24, 2011.

K. Veselovska, J. Hajic, Jr. and J. Sindlerova, "Creating Annotated Resources for Polarity Classification in Czech,” Proc. KONVENS 2012 (PATHOS 2012 workshop), Vienna, September 21, 2012.

I. Tetko, D. Livingstone, A. Luik, "Neural network studies. 1. Comparison of Overfitting and Overtraining," J. Chem. Inf. Comput. Sci., vol. 35, no. 5, pp. 826–833, 1995.

B. Liu, SENTIMENT ANAYSIS: Mining Opinions, Sentiments, and Emotions, Cambridge University Press, 2015.

R. Groot, “Data Mining for Tweet Sentiment Classification,” Master thesis, Utrecht University, July 23, 2012.

M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas, “Sentiment strength detection in short informal text,” Journal of the American

Society for Information Science and Technology, vol. 61, no. 12, pp. 2544-2558, 2010.

X. Zhu, “Semi-Supervised Learning Literature Survey,” Computer Sciences Technical Report 1530, University of Wisconsin–Madison, 2005.

M. Yang, W. Tu, Z. Lu, W. Yin, and K.-P. Chow, “LCCT: A Semi-supervised Model for Sentiment Classification,” Human Language Technologies: The 2015 Annual Conference of the North American Chapter of the ACL, pp. 546–555, Denver, Colorado, May 31 – June 5, 2015.

M. Loog, “Contrastive Pessimistic Likelihood Estimation for Semi-Supervised Classification,” IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 38, no. 3:1, 2015.

S. Basu, A. Banerjee, and R. Mooney, “Semisupervised Clustering by Seeding”, Proceedings of the 19th International Conference on Machine Learning (ICML-2002), pp. 19-26, Sydney, Australia, July 2002.

X. Zhu, and Z. Ghahramani, “Learning from labeled and unlabeled data with label propagation,” Technical Report CMU-CALD-02-107, Carnegie Mellon University, 2002.

F. Gieseke, A. Airolab, T. Pahikkalab, and O. Kramera, “Fast and Simple Gradient-Based Optimization for Semi-Supervised Support Vector Machines,” 2013.

X. Wang, C. Wang, and J. Shen, “Semi–supervised K-Means Clustering by Optimizing Initial Cluster Centers,” Web Information Systems and Mining, vol. 6988 of the series Lecture Notes in Computer Science pp. 178-187, 2011.

D. Zhou,O. Bousquet, T. Lal, J. Weston, and B. Schölkopf, “Learning with local and global consistency,” Advances in Neural Information Processing Systems 16, 2004.

F. Gieseke, A. Airola, T. Pahikkala, and O. Kramer, “Sparse Quasi-Newton Optimization for Semi-Supervised Support Vector Machines,” 2012.

C. Metz, "Basic principles of ROC analysis" (PDF). Semin Nucl Med., vol. 8, no. 4, pp. 283–98, October 1978.

S. Parambath, N. Usunier, and Y. Grandvalet, “Optimizing F-measures by cost- sensitive classification,” Proc. Neural Information Processing Systems (NIPS), pp. 2123–2131, 2014.

S. Abudalfa, and M. Ahmed, “Survey on Target Dependent Sentiment Analysis of Micro-Blogs in Social Media,” the 9th IEEE GCC Conference & Exhibition, Manama, Bahrain, 7-10 May 2017.

F. Khan, U. Qamar, and S. Bashir, “S WIMS: Semi-supervise d subjective feature weighting and intelligent model selection for sentiment analysis,” Knowledge-Based Systems, vol. 100, pp. 97-11, 2016.

A. Bakliwal, P. Arora, S. Madhappan, N. Kapre, M. Singh, and V. Varma, “Mining Sentiments from Tweets,” 3rd Workshop on Sentiment and Subjectivity Analysis (WASSA) in Conjunction with 50th annual meeting of Association for Computational Linguistics (ACL) 2012.

L. Dong, F. Wei, C. Tan, D. Tang, M. Zhou, and K. Xu, “Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification,” Proc. of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), pages 49–54, Baltimore, Maryland, USA, June 23-25

C. Quan, and F. Ren, “Target Based Review Classification for Fine-Grained Sentiment Analysis,” International Journal of Innovative Computing, Information and Control, vol. 10, no. 1, February 2014.

D. Vo and Y. Zhang, “Target-Dependent Twitter Sentiment Classification with Rich Automatic Features,” the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI), 2015.

D. Tang, F. Wei, N. Yang, M. Zhou, T. Liu, and B. Qin, “Learning sentiment-specific word embedding for Twitter sentiment classification,” Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 1555–1565, 2014.

D. Tang, B. Qin, X. Feng, and T. Liu, “Effective LSTMs for Target-Dependent Sentiment Classification,” COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 3298–3307, Osaka, Japan, December 11-17 2016.

M. Zhang, Y. Zhang, and D. Vo, “Gated Neural Networks for Targeted Sentiment Analysis,” the 30th AAAI Conference on Artificial Intelligence, Phoenix, Arizona, USA. Association for the Advancement of Artificial Intelligence, 2016.

Y. Wang, H. Gao, and S. Geng "A TargetDependent Sentiment Analysis Method for Microblog Streams," 18th Asia-Pacific Web Conference, Web Technologies and Applications, APWeb 2016, Part{II}, pp. 30-42, Suzhou, China, September 23-25, 2016.

M. Jabreel and A. Moreno, “Target-dependent Sentiment Analysis of Tweets using a Bi-directional Gated Recurrent Unit,” Smart Innovation, Systems and Technologies, SIST, vol. 85, 2018.

M. Aly, “Survey on Multiclass Classification Methods,” Technical report, California Institute of Technology, November 2005.

S. Ravi, “Semi-supervised Learning in Support Vector Machines,” Project Report, Princeton University, 2014.

Published
2019-04-17
How to Cite
Abudalfa, S., & Ahmed, M. (2019). Semi-Supervised Target-Dependent Sentiment Classification for Micro-Blogs. Journal of Computer Science and Technology, 19(01), e06. https://doi.org/10.24215/16666038.19.e06
Section
Original Articles