ISSN: 1735-188X
- Call for Special Issue
- Important Announcements
- Online Submission
- Indexing/Abstracting
Volume 12, No 2, 2015
Techniques for text classification: Literature review and current trends
Rajni jindal, ruchika malhotra and abha jain.
Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, popularly known as the digital/electronic information is in the form of documents, conference material, publications, journals, editorials, web pages, e-mail etc. People largely access information from these online sources rather than being limited to archaic paper sources like books, magazines, newspapers etc. But the main problem is that this enormous information lacks organization which makes it difficult to manage. Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us to have a fair evaluation of the progress made in this field till date. We have investigated the papers to the best of our knowledge and have tried to summarize all existing information in a comprehensive and succinct manner. The studies have been summarized in a tabular form according to the publication year considering numerous key perspectives. The main emphasis is laid on various steps involved in text classification process viz. document representation methods, feature selection methods, data mining methods and the evaluation technique used by each study to carry out the results on a particular dataset.
Pages : 1-28
Keywords : Machine learning; Text classification; Feature selection; Bag-of-words; Vector space model
A Systematic Literature Review of Text Classification: Datasets and Methods
Ieee account.
- Change Username/Password
- Update Address
Purchase Details
- Payment Options
- Order History
- View Purchased Documents
Profile Information
- Communications Preferences
- Profession and Education
- Technical Interests
- US & Canada: +1 800 678 4333
- Worldwide: +1 732 981 0060
- Contact & Support
- About IEEE Xplore
- Accessibility
- Terms of Use
- Nondiscrimination Policy
- Privacy & Opting Out of Cookies
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.
Review on sentiment analysis for text classification techniques from 2010 to 2021
- 1199: Computational Intelligence Revolution in Multimedia Data Analytics and Business Management
- Published: 01 December 2022
- Volume 82 , pages 8137–8193, ( 2023 )
Cite this article
- Arif Ullah ORCID: orcid.org/0000-0002-7740-2206 1 ,
- Sundas Naqeeb Khan 2 &
- Nazri Mohd Nawi 2
959 Accesses
Explore all metrics
Progression in the popularity of social media activities had provided huge amount of data in the form of text that can immeasurably augment its specialty. This textual data offers a platform for the reviewers to share their comments about any product, service or event on social media. These types of discussions among the reviewers boost the demand and supply in business and industry field. Furthermore, for every passing day the textual data is also increasing in amount which makes data mining especially sentiment analysis or opinion mining, a research hungry area. This is mainly because of data is represented in the form of calculations about reviewers’ comments, assessment, attitudes, behavior and emotions to individual issues, events, topics, services and attributes. Previously, researchers focus on systems to recognize and categorize sentiments from the written material where opinions are extremely unstructured, assorted and classified. In this paper, authors try to presents a meticulous survey on sentiment analysis with classification, in which one hundred and forty three articles were reviewed regarding important activities, approaches, applications with multilingual and cross domain jobs. This systematic survey considers published literature during 2010-2021, organized based on machine learning, lexicon and hybrid approaches with multilingual and cross domain knowledge.
This is a preview of subscription content, log in via an institution to check access.
Access this article
Price includes VAT (Russian Federation)
Instant access to the full article PDF.
Rent this article via DeepDyve
Institutional subscriptions
Similar content being viewed by others
A survey on sentiment analysis methods, applications, and challenges
Mayur Wankhade, Annavarapu Chandra Sekhara Rao & Chaitanya Kulkarni
A review on sentiment analysis and emotion detection from text
Pansy Nandwani & Rupali Verma
Sentiment Analysis in the Age of Generative AI
Jan Ole Krugmann & Jochen Hartmann
Data availability
No specific Data are used because it is review papers all those paper which are study and used in this paper are cited in the paper.
https://nlp.stanford.edu/software/tokenizer.shtml
https://opennlp.apache.org/documentation/manual/opennlp.html#tools .
http://ictclas.nlpir.org .
http://thulac.thunlp.org .
http://nlp.stanford.edu/software/segmenter.shtml .
Abbasi A, Chen H, Salem A (2008) Sentiment analysis in multiple languages: feature selection for opinion classification in web forums. ACM Trans Inf Syst (TOIS) 26(3):12
Article Google Scholar
Abbasi A, France S, Zhang Z, Chen H (2011) Selecting attributes for sentiment classification using feature relation networks. IEEE Trans Knowl Data Eng 23(3):447–462
Abdul-Mageed M, Diab M, Kübler S (2014) SAMAR: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37
Adeleke AO, Samsudin NA, Mustapha A, Nawi NM (2017) Comparative analysis of text classification algorithms for automated labelling of Quranic verses. Int. J. Adv. Sci. Eng. Inf. Technol 7(4):1419
Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011, June) Sentiment analysis of twitter data. In: Proceedings of the workshop on languages in social media (pp. 30-38). Association for Computational Linguistics
Ali F, Kwak KS, Kim YG (2016) Opinion mining based on fuzzy domain ontology and support vector machine: A proposal to automate online review classification. Appl Soft Comput 47:235–250
Archak, N, Ghose, A, Ipeirotis, PG (2007) Deriving the pricing power of product features by mining consumer reviews
Arif MH, Li J, Iqbal M, Liu K (2018) Sentiment analysis and spam detection in short informal text using learning classifier systems. Soft Comput 22(21):7281–7291
Atkinson, K (2006) Gnu aspell 0.60. 4
Baccianella, S, Esuli, A, Sebastiani, F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In LREC (Vol. 10, No. 2010, pp. 2200-2204)
Bai X (2011) Predicting consumer sentiments from online text. Decis Support Syst 50(4):732–742
Balahur A, Hermida JM, Montoyo A (2012) Detecting implicit expressions of emotion in text: A comparative analysis. Decis Support Syst 53(4):742–753
Banea C, Mihalcea R, Wiebe J, Hassan S (2008) Multilingual subjectivity analysis using machine translation. In: Proceedings of the conference on empirical methods in natural language processing (pp. 127-135). Association for Computational Linguistics
Bao H, Li Q, Liao SS, Song S, Gao H (2013) A new temporal and social PMF-based method to predict users' interests in micro-blogging. Decis Support Syst 55(3):698–709
Basari ASH, Hussin B, Ananta IGP, Zeniarja J (2013) Opinion mining of movie review using hybrid method of support vector machine and particle swarm optimization. Procedia Eng 53:453–462
Bell D, Koulouri T, Lauria S, Macredie RD, Sutton J (2014) Micro-blogging as a mechanism for human–robot interaction. Knowl-Based Syst 69:64–77
Benamara, F, Cesarano, C, Picariello, A, Recupero, DR, Subrahmanian, VS (2007) Sentiment analysis: Adjectives and adverbs are better than adjectives alone. In ICWSM
Bhatia, P, Ji, Y, Eisenstein, J (2015) Better document-level sentiment analysis from RST discourse parsing. arXiv preprint arXiv:1509.01599
Bilianos D (2022) Experiments in text classification: Analyzing the sentiment of electronic product reviews in greek. J Quant Linguist 29(3):374–386
Bird, S, Klein, E, Loper, E (2009) Natural language processing with Python: analyzing text with the natural language toolkit. “O’Reilly Media, Inc."
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022
MATH Google Scholar
Blitzer, J, Dredze, M, Pereira, F (2007) Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 440-447)
Boiy E, Moens MF (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retr 12(5):526–558
Boiy, E, Hens, P, Deschacht, K, Moens, MF (2007) Automatic Sentiment Analysis in On-line Text. In ELPUB (pp. 349-360)
Boldrini E, Balahur A, Martínez-Barco P, Montoyo A (2012) Using EmotiBlog to annotate and analyse subjectivity in the new textual genres. Data Min Knowl Disc 25(3):603–634
Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 132-141). Association for Computational Linguistics
Bollegala D, Weir D, Carroll J (2013) Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans Knowl Data Eng 25(8):1719–1731
Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8
Bonzanini, M (2012) A knowledge-based approach for summarising opinions. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (pp. 991-991). ACM
Bouazizi, M, Ohtsuki, T (2017) A Pattern-Based Approach for Multi-Class Sentiment Analysis in Twitter. IEEE Access
Brody S, Elhadad N (2010) An unsupervised aspect-sentiment model for online reviews. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (pp. 804-812). Association for Computational Linguistics
Bross, J, Ehrig, H (2013) Automatic construction of domain and aspect specific sentiment lexicons for customer review mining. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (pp. 1077-1086). ACM
Cambria E, White B (2014) Jumping NLP curves: A review of natural language processing research. IEEE Comput Intell Mag 9(2):48–57
Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 28(2):15–21
Cambria E, Gastaldo P, Bisio F, Zunino R (2015) An ELM-based model for affective analogical reasoning. Neuro-computing 149:443–455
Google Scholar
Cao Q, Duan W, Gan Q (2011) Exploring determinants of voting for the “helpfulness” of online user reviews: A text mining approach. Decis Support Syst 50(2):511–521
Chambers, N, Bowen, V, Genco, E, Tian, X, Young, E, Harihara, G, Yang, E (2015) Identifying Political Sentiment between Nation States with Social Media. In EMNLP (pp. 65-75)
Che, W, Li, Z, Liu, T (2010) Ltp: A Chinese language technology platform. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations (pp. 13-16). Assoc Comput Linguist
Chen CC, Tseng YD (2011) Quality evaluation of product reviews using an information quality framework. Decis Support Syst 50(4):755–768
Chen, WT, Lin, SC, Huang, SL, Chung, YS, Chen, KJ (2010) E-HowNet and automatic construction of a lexical ontology. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations (pp. 45-48). Assoc Comput Linguist
Chen LS, Liu CH, Chiu HJ (2011) A neural network based approach for sentiment classification in the blogosphere. J Inf 5(2):313–322
Chen X, Vorvoreanu M, Madhavan K (2014) Mining social media data for understanding students’ learning experiences. IEEE Trans Learn Technol 7(3):246–259
Chen, X, Qiu, X, Zhu, C, Huang, X (2015) Gated Recursive Neural Network for Chinese Word Segmentation. In ACL (1) (pp. 1744-1753)
Chen Q, Li W, Lei Y, Liu X, He Y (2015) Learning to adapt credible knowledge in cross-lingual sentiment analysis. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (Vol. 1, pp. 419-429)
Chenlo JM, Hogenboom A, Losada DE (2013) Sentiment-based ranking of blog posts using rhetorical structure theory. In international conference on application of natural language to information systems (pp. 13-24). Springer, Berlin, Heidelberg
Chklovski, T, Pantel, P (2004) Verbocean: Mining the web for fine-grained semantic verb relations. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing
Choi Y, Cardie C (2009) Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification. In: Proceedings of the 2009 conference on empirical methods in natural language processing: volume 2-volume 2 (pp. 590-598). Association for Computational Linguistics
Coussement K, Van den Poel D (2009) Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers. Expert Syst Appl 36(3):6127–6134
Crammer K, Singer Y (2003) Ultraconservative online algorithms for multiclass problems. J Mach Learn Res 3(Jan):951–991
Cruz FL, Troyano JA, Enríquez F, Ortega FJ, Vallejo CG (2013) ‘Long autonomy or long delay?’The importance of domain in opinion mining. Expert Syst Appl 40(8):3174–3184
Cui, H, Mittal, V, Datar, M (2006) Comparative experiments on sentiment classification for online product reviews. In AAAI (Vol. 6, pp. 1265-1270)
Dang Y, Zhang Y, Chen H (2010) A lexicon-enhanced method for sentiment classification: An experiment on online product reviews. IEEE Intell Syst 25(4):46–53
Das, S, Chen, M (2001) Yahoo! for Amazon: Extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific finance association annual conference (APFA) (Vol. 35, p. 43)
Dasgupta, S, Ng, V (2009) Mine the easy, classify the hard: a semi-supervised approach to automatic sentiment classification. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2 (pp. 701-709). Assoc Computat Linguist
Demirtas, E (2013) Cross-lingual sentiment analysis with machine translation
Deng ZH, Luo KH, Yu HL (2014) A study of supervised term weighting scheme for sentiment analysis. Expert Syst Appl 41(7):3506–3513
Derczynski, L, Ritter, A, Clark, S, Bontcheva, K (2013) Twitter part-of-speech tagging for all: Overcoming sparse and noisy data. In Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013 (pp. 198-206)
Deshmukh, JS, Tripathy, AK (2017) Entropy based classifier for cross-domain opinion mining. Appl Comput Inf
Di Caro L, Grella M (2013) Sentiment analysis via dependency parsing. Comput Stand Interfaces 35(5):442–453
Ding, X, Liu, B, Zhang, L (2009) Entity discovery and assignment for opinion mining applications. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 1125-1134). ACM
Du J, Xu H, Huang X (2014) Box office prediction based on micro-blog. Expert Syst Appl 41(4):1680–1689
Duric A, Song F (2012) Feature selection for sentiment analysis based on content and syntax models. Decis Support Syst 53(4):704–711
Eirinaki M, Pisal S, Singh J (2012) Feature-based opinion mining and ranking. J Comput Syst Sci 78(4):1175–1184
Article MathSciNet Google Scholar
Fan TK, Chang CH (2011) Blogger-centric contextual advertising. Expert Syst Appl 38(3):1777–1788
Fang Q, Xu C, Sang J, Hossain MS, Muhammad G (2015) Word-of-mouth understanding: entity-centric multimodal aspect-opinion mining in social media. IEEE Trans Multimed 17(12):2281–2296
Fauzi MA, Firmansyah N, Afirianto T (2018) Improving sentiment analysis of short informal Indonesian product reviews using synonym based feature expansion
Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89
Gao D, Wei F, Li W, Liu X, Zhou M (2015) Cross-lingual sentiment lexicon learning with bilingual word graph label propagation. Comput Linguist 41(1):21–40
Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40(16):6266–6282
Gimpel, K., Schneider, N., O'Connor, B., Das, D., Mills, D., Eisenstein, J., ..., Smith, N. A. (2011) Part-of-speech tagging for twitter: Annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2 (pp. 42-47). Assoc Comput Linguist
Gindl S, Weichselbraun A, Scharl A (2013) Rule-based opinion target and aspect extraction to acquire affective knowledge. In proceedings of the 22nd international conference on world wide web (pp. 557-564). ACM
Go, A, Bhayani, R, Huang, L (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(12)
Gupta SK, Phung D, Adams B, Venkatesh S (2013) Regularized nonnegative shared subspace learning. Data Min Knowl Disc 26(1):57–97
Article MathSciNet MATH Google Scholar
Hagenau M, Liebmann M, Neumann D (2013) Automated news reading: stock price prediction based on financial news using context-capturing features. Decis Support Syst 55(3):685–697
He Y, Zhou D (2011) Self-training from labeled features for sentiment analysis. Inf Process Manag 47(4):606–616
He Y, Lin C, Alani H (2011) Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 123-131). Association for Computational Linguistics
Heerschop, B, Goossen, F, Hogenboom, A, Frasincar, F, Kaymak, U, de Jong, F (2011) Polarity analysis of texts using discourse structure. In Proceedings of the 20th ACM international conference on Information and knowledge management (pp. 1061-1070). ACM
Hiroshi, K, Tetsuya, N, Hideo, W (2004) Deeper sentiment analysis using machine translation technology. In Proceedings of the 20th international conference on Computational Linguistics (p. 494). Assoc Comput Linguist
Howard J, Ruder S (2018). Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146
Hu Y, Li W (2011) Document sentiment classification by exploring description model of topical terms. Comput Speech Lang 25(2):386–403
Hu, M, Liu, B (2004) Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 168-177). ACM
Hu N, Bose I, Koh NS, Liu L (2012) Manipulation of online reviews: An analysis of ratings, readability, and sentiments. Decis Support Syst 52(3):674–684
Hu YH, Chen YL, Chou HL (2017) Opinion mining from online hotel reviews–A text summarization approach. Inf Process Manag 53(2):436–449
Hung C, Lin HK (2013) Using objective words in SentiWordNet to improve sentiment classification for word of mouth. IEEE Intell Syst 1
Ismail, S, Alsammak, A, Elshishtawy, T (2016) A Generic Approach for Extracting Aspects and Opinions of Arabic Reviews. In Proceedings of the 10th International Conference on Informatics and Systems (pp. 173-179). ACM
Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2016) Reading text in the wild with convolutional neural networks. Int J Comput Vis 116(1):1–20
Jiang L, Yu M, Zhou M, Liu X, Zhao T (2011) Target-dependent twitter sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 151-160). Association for Computational Linguistics.
Jiang D, Luo X, Xuan J, Xu Z (2017) Sentiment computing for the news event based on the social media big data. IEEE Access 5:2373–2382
Jiao J, Zhou Y (2011) Sentiment polarity analysis based multi-dictionary. Phys Procedia 22:590–596
Jindal, N, Liu, B (2008) Opinion spam and analysis. In Proceedings of the 2008 International Conference on Web Search and Data Mining (pp. 219-230). ACM
Jo, Y, Oh, AH (2011) Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining (pp. 815-824). ACM
Kamps J, Marx M, Mokken RJ, de Rijke M (2001) Words with attitude (pp, 332-341). Language and Computation (ILLC), University of Amsterdam, Institute for Logic
Kanayama, H, Nasukawa, T (2006) Fully automatic lexicon expansion for domain-oriented sentiment analysis. In Proceedings of the 2006 conference on empirical methods in natural language processing (pp. 355-363). Assoc Comput Linguist
Kang H, Yoo SJ, Han D (2012) Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst Appl 39(5):6000–6010
Kang M, Ahn J, Lee K (2018) Opinion mining using ensemble text hidden Markov models for text classification. Expert Syst Appl 94:218–227
Kaufmann, M (2012) JMaxAlign: A Maximum Entropy Parallel Sentence Alignment Tool. In COLING (Demos) (pp. 277-288)
Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1–36
Kennedy A, Inkpen D (2006) Sentiment classification of movie reviews using contextual valence shifters. Comput Intell 22(2):110–125
Keshtkar F, Inkpen D (2013) A bootstrapping method for extracting paraphrases of emotion expressions from texts. Comput Intell 29(3):417–435
Khamparia A, Pandey B (2020) Association of learning styles with different e-learning problems: a systematic review and classification. Educ Inf Technol 25(2):1303–1331
Khan FH, Bashir S, Qamar U (2014) TOM: twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 57:245–257
Khan SN, Nawi NM, Imrona M, Shahzad A, Ullah A, Rahman A (2018) Opinion mining summarization and automation process: a survey. Int J Adv Sci Eng Inf Technol 8(5):1836–1844
Kim, SM, Hovy, E (2004) Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics (p. 1367). Assoc Computat Linguist
Kontopoulos E, Berberidis C, Dergiades T, Bassiliades N (2013) Ontology-based sentiment analysis of twitter posts. Expert Syst Appl 40(10):4065–4074
Kouloumpis E, Wilson T, Moore JD (2011) Twitter sentiment analysis: the good the bad and the omg! Icwsm 11(538-541):164
Ku LW, Chen HH (2007) Mining opinions from the web: beyond relevance retrieval. J Assoc Inf Sci Technol 58(12):1838–1850
Kumar A, Dabas V, Hooda P (2018) Text classification algorithms for mining unstructured data: a SWOT analysis. Int J Info Technol.:1–11
Lafferty, J, McCallum, A, Pereira, FC (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data
Lambert P (2015) Aspect-level cross-lingual sentiment classification with constrained SMT. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 2: short papers) (Vol. 2, pp. 781-787)
Lambov D, Pais S, Dias G (2011) Merged agreement algorithms for domain independent sentiment analysis. Procedia Soc Behav Sci 27:248–257
Lane PC, Clarke D, Hender P (2012) On developing robust models for favourability analysis: model choice, feature sets and imbalanced data. Decis Support Syst 53(4):712–718
Lazaridou A, Titov I, Sporleder C (2013) A Bayesian model for joint unsupervised induction of sentiment, aspect and discourse representations. In proceedings of the 51st annual meeting of the Association for Computational Linguistics (volume 1: long papers) (Vol. 1, pp. 1630-1639).
Li YM, Li TY (2013) Deriving market intelligence from micro-blogs. Decis Support Syst 55(1):206–217
Li YM, Shiu YL (2012) A diffusion mechanism for social advertising over micro-blogs. Decis Support Syst 54(1):9–22
Li ST, Tsai FC (2013) A fuzzy conceptualization model for text mining with application in opinion polarity classification. Knowl-Based Syst 39:23–33
Li, F, Huang, M, Zhu, X (2010) Sentiment Analysis with Global Topics and Local Dependency. In AAAI (Vol. 10, pp. 1371-1376)
Li, S, Wang, Z, Zhou, G, Lee, SYM (2011) Semi-supervised learning for imbalanced sentiment classification. In IJCAI proceedings-international joint conference on artificial intelligence (Vol. 22, No. 3, p. 1826)
Li S, Xue Y, Wang Z, Zhou G (2013) Active learning for cross-domain sentiment classification. In IJCAI (pp. 2127-2133)
Li, S, Huang, L, Wang, J, Zhou, G (2015) Semi-stacking for semi-supervised sentiment classification. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers) (Vol. 2, pp. 27-31)
Lin, C, He, Y (2009) Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on Information and knowledge management (pp. 375-384). ACM
Lin C, He Y, Everson R, Ruger S (2012) Weakly supervised joint sentiment-topic detection from text. IEEE Trans Knowl Data Eng 24(6):1134–1145
Liu B (2012) Sentiment analysis and opinion mining. Synth Lect Human Lang Technol 5(1):1–167
Liu K, Zhao J (2009) Cross-domain sentiment classification using a two-stage method. In proceedings of the 18th ACM conference on information and knowledge management (pp. 1717-1720). ACM
Liu, B, Lee, WS, Yu, PS, Li, X (2002) Partially supervised classification of text documents. In ICML (Vol. 2, pp. 387-394)
Liu, B, Hu, M, Cheng, J (2005) Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web (pp. 342-351). ACM
Liu, Y, Huang, X, An, A, Yu, X (2007) ARSA: a sentiment-aware model for predicting sales performance using blogs. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 607-614). ACM
Liu K, Xu L, Zhao J (2015) Co-extracting opinion targets and opinion words from online reviews based on the word alignment model. IEEE Trans Knowl Data Eng 27(3):636–650
Lu CY, Lin SH, Liu JC, Cruz-Lara S, Hong JS (2010) Automatic event-level textual emotion sensing using mutual action histogram between entities. Expert Syst Appl 37(2):1643–1653
Lu Y, Kong X, Quan X, Liu W, Xu Y (2010) Exploring the sentiment strength of user reviews. In international conference on web-age information management (pp. 471-482). Springer, Berlin, Heidelberg.
Lu, Y, Castellanos, M, Dayal, U, Zhai, C (2011) Automatic construction of a context-aware sentiment lexicon: an optimization approach. In Proceedings of the 20th international conference on World Wide Web (pp. 347-356). ACM
Ma, J, Hinrichs, EW (2015) Accurate Linear-Time Chinese Word Segmentation via Embedding Matching. In ACL (1) (pp. 1733-1743)
Maas, AL, Daly, RE, Pham, PT, Huang, D, Ng, AY, Potts, C (2011) Learning word vectors for sentiment analysis. In Proceedings of the 49th annual meeting of the association for computational linguistics: Human language technologies-volume 1 (pp. 142-150). Assoc Computat Linguist
Maks I, Vossen P (2012) A lexicon model for deep sentiment analysis and opinion mining applications. Decis Support Syst 53(4):680–688
Manning, C, Surdeanu, M, Bauer, J, Finkel, J, Bethard, S, McClosky, D (2014) The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55-60)
Marrese-Taylor E, Velásquez JD, Bravo-Marquez F (2014) A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst Appl 41(17):7764–7775
Marstawi, A, Sharef, NM, Aris, TNM, Mustapha, A (2017) Ontology-based Aspect Extraction for an Improved Sentiment Analysis in Summarization of Product Reviews. In Proceedings of the 8th International Conference on Computer Modeling and Simulation (pp. 100-104). ACM
MartíN-Valdivia MT, MartíNez-CáMara E, Perea-Ortega JM, UreñA-LóPez LA (2013) Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches. Expert Syst Appl 40(10):3934–3942
McDonald, R, Crammer, K, Pereira, F (2005) Online large-margin training of dependency parsers. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 91-98). Assoc Comput Linguist
McDonald, R, Hannan, K, Neylon, T, Wells, M, Reynar, J (2007) Structured models for fine-to-coarse sentiment analysis. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 432-439)
Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: A survey. Ain Shams Eng J 5(4):1093–1113
Meng X, Wei F, Liu X, Zhou M, Xu G, Wang H (2012) Cross-lingual mixture model for sentiment classification. In: Proceedings of the 50th annual meeting of the Association for Computational Linguistics: long papers-volume 1 (pp. 572-581). Association for Computational Linguistics.
Miao Q, Li Q, Dai R (2009) AMAZING: A sentiment mining and retrieval system. Expert Syst Appl 36(3):7192–7198
Mihalcea, R, Banea, C, Wiebe, J (2007) Learning multilingual subjective language via cross-lingual projections. In: Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 976-983)
Min HJ, Park JC (2012) Identifying helpful reviews based on customer’s mentions about experiences. Expert Syst Appl 39(15):11830–11838
Mohammad SM (2012) From once upon a time to happily ever after: tracking emotions in mail and books. Decis Support Syst 53(4):730–741
Montoyo, A, MartíNez-Barco, P, Balahur, A (2012) Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments
Moraes R, Valiati JF, Neto WPG (2013) Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Syst Appl 40(2):621–633
Moreo A, Romero M, Castro JL, Zurita JM (2012) Lexicon-based comments-oriented news sentiment analyzer system. Expert Syst Appl 39(10):9166–9180
Mostafa MM (2013) More than words: social networks’ text mining for consumer brand sentiments. Expert Syst Appl 40(10):4241–4251
Mukherjee S, Joshi S (2014) Author-specific sentiment aggregation for polarity prediction of reviews. In LREC (pp. 3092-3099)
Mullen, T, Collier, N (2004) Sentiment analysis using support vector machines with diverse information sources. In Proceedings of the 2004 conference on empirical methods in natural language processing
Narayanan, R, Liu, B, Choudhary, A (2009) Sentiment analysis of conditional sentences. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 (pp. 180-189). Assoc Comput Linguist
Nasukawa, T, Yi, J (2003) Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the 2nd international conference on Knowledge capture (pp. 70-77). ACM
Neviarouskaya, A, Prendinger, H, Ishizuka, M (2010) Recognition of affect, judgment, and appreciation in text. In Proceedings of the 23rd international conference on computational linguistics (pp. 806-814). Assoc Comput Linguist
Nopp, C, Hanbury, A (2015) Detecting Risks in the Banking System by Sentiment Analysis. In EMNLP (pp. 591-600)
O'Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: linking text sentiment to public opinion time series. ICWSM 11(122-129):1–2
Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541
Ortigosa-Hernández J, Rodríguez JD, Alzate L, Lucania M, Inza I, Lozano JA (2012) Approaching sentiment analysis by using semi-supervised learning of multi-dimensional classifiers. Neurocomputing 92:98–115
Ouhame S, Hadi Y, Ullah A (2021) An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model. Neural Comput Applic:1–13
Owoputi, O O'Connor, B, Dyer, C, Gimpel, K, Schneider, N, Smith, NA (2013) Improved part-of-speech tagging for online conversational text with word clusters. Assoc Comput Linguist
Pai MY, Chu HC, Wang SC, Chen YM (2013) Electronic word of mouth analysis for service experience. Expert Syst Appl 40(6):1993–2006
Pak, A, Paroubek, P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In LREc (Vol. 10, No. 2010)
Pan SJ, Ni X, Sun JT, Yang Q, Chen Z (2010) Cross-domain sentiment classification via spectral feature alignment. In proceedings of the 19th international conference on world wide web (pp. 751-760). ACM
Pang, B, Lee, L (2004) A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics (p. 271). Assoc Comput Linguist
Pang, B, Lee, L (2005) Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 115-124). Assoc Comput Linguist
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retriev 2(1–2):1–135
Penalver-Martinez I, Garcia-Sanchez F, Valencia-Garcia R, Rodriguez-Garcia MA, Moreno V, Fraga A, Sanchez-Cervantes JL (2014) Feature-based opinion mining through ontologies. Expert Syst Appl 41(13):5995–6008
Peng, F, Feng, F, McCallum, A (2004) Chinese segmentation and new word detection using conditional random fields. In Proceedings of the 20th international conference on Computational Linguistics (p. 562). Assoc Computat Linguist
Pennebaker, JW, Boyd, RL, Jordan, K, Blackburn, K (2015) The development and psychometric properties of LIWC2015
Pisal S, Singh J, Eirinaki M (2011) AskUs: An opinion search engine. In data mining workshops (ICDMW), 2011 IEEE 11th international conference on (pp. 1243-1246). IEEE.
Popescu O, Strapparava C (2014) Time corpora: epochs, opinions and changes. Knowl-Based Syst 69:3–13
Popescu AM, Nguyen B, Etzioni O (2005) OPINE: extracting product features and opinions from reviews. In: Proceedings of HLT/EMNLP on interactive demonstrations (pp. 32-33). Association for Computational Linguistics
Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 108:42–49
Prabowo R, Thelwall M (2009) Sentiment analysis: A combined approach. J Inf 3(2):143–157
Przepiórkowski A (2009) XML text interchange format in the National Corpus of polish. In the proceedings of practical applications in language and computers PALC 2009. Peter Lang, Frankfurt am Main
Ptaszynski M, Dokoshi H, Oyama S, Rzepka R, Kurihara M, Araki K, Momouchi Y (2013) Affect analysis in context of characters in narratives. Expert Syst Appl 40(1):168–176
Qazi A, Raj RG, Hardaker G, Standing C (2017) A systematic literature review on opinion types and sentiment analysis techniques: tasks and challenges. Int Res ( Elsiver)
Qiao, F, Wu, J, Li, J, Bashir, AK, Mumtaz, S, Tariq, U (2020) Trustworthy edge storage orchestration in intelligent transportation systems using reinforcement learning. IEEE Trans Intell Transp Syst
Qiu G, Liu B, Bu J, Chen C (2009) Expanding domain sentiment lexicon through double propagation. In IJCAI (Vol. 9, pp. 1199-1204)
Qiu G, He X, Zhang F, Shi Y, Bu J, Chen C (2010) DASA: dissatisfaction-oriented advertising based on sentiment analysis. Expert Syst Appl 37(9):6182–6191
Qiu, X, Zhang, Q, Huang, X (2013) Fudannlp: A toolkit for Chinese natural language processing. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp. 49-54)
Qiu L, Rui H, Whinston A (2013) Social network-embedded prediction markets: the effects of information acquisition and communication on predictions. Decis Support Syst 55(4):978–987
Quan C, Ren F (2014) Unsupervised product feature extraction for feature-oriented opinion determination. Inf Sci 272:16–28
Rabelo JC, Prudêncio RB, Barros FA (2012) Using link structure to infer opinions in social networks. In systems, man, and cybernetics (SMC), 2012 IEEE international conference on (pp. 681-685). IEEE.
Racherla P, Friske W (2012) Perceived ‘usefulness’ of online consumer reviews: An exploratory investigation across three services categories. Electron Commer Res Appl 11(6):548–559
Rahman, MM, Wang, H (2016) Hidden topic sentiment model. In Proceedings of the 25th International Conference on World Wide Web (pp. 155-165). Int World Wide Web Conf Steering Committee
Raut, MY, Kulkarni, MA (2016) Polarity shift in opinion mining. In Advances in Electronics, Communication and Computer Technology (ICAECCT), 2016 IEEE International Conference on (pp. 333-337). IEEE
Rehurek, R, Sojka, P (2010) Software framework for topic modeling with large corpora. In In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks
Reyes A, Rosso P (2012) Making objective decisions from subjective data: detecting irony in customer reviews. Decis Support Syst 53(4):754–760
Rezaeinia SM, Rahmani R, Ghodsi A, Veisi H (2019) Sentiment analysis based on improved pre-trained word embeddings. Expert Syst Appl 117:139–147
Rill S, Reinel D, Scheidt J, Zicari RV (2014) Politwi: early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowl-Based Syst 69:24–33
Riloff, E, Wiebe, J (2003) Learning extraction patterns for subjective expressions. In Proceedings of the 2003 conference on Empirical methods in natural language processing (pp. 105-112). Assoc Comput Linguist
Riloff, E, Wiebe, J, Wilson, T (2003) Learning subjective nouns using extraction pattern bootstrapping. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4 (pp. 25-32). Assoc Comput Linguist
Roiger, RJ (2017) Data mining: a tutorial-based primer. CRC Press
Roussev V, Quates C (2013) File fragment encoding classification—An empirical approach. Digit Investig 10:S69–S77
Rui H, Liu Y, Whinston A (2013) Whose and what chatter matters? The effect of tweets on movie sales. Decis Support Syst 55(4):863–870
Saleh MR, Martín-Valdivia MT, Montejo-Ráez A, Ureña-López LA (2011) Experiments with SVM to classify opinions in different domains. Expert Syst Appl 38(12):14799–14804
Santosh DT, Babu KS, Prasad SD, Vivekananda A (2016) Opinion mining of online product reviews from traditional LDA topic clusters using feature ontology Tree and Sentiwordnet. IJEME 6:1–11
Seki Y, Kando N, Aono M (2009) Multilingual opinion holder identification using author and authority viewpoints. Inf Process Manag 45(2):189–199
Severyn A, Moschitti A, Uryupina O, Plank B, Filippova K (2016) Multi-lingual opinion mining on youtube. Inf Process Manag 52(1):46–60
Shah K, Patel H, Sanghvi D, Shah M (2020) A comparative analysis of logistic regression, random forest and KNN models for the text classification. Augmented Human Res 5(1):1–16
Sharma, R, Nigam, S, Jain, R (2014) Mining of product reviews at aspect level. arXiv preprint arXiv:1406.3714
Shawe-Taylor J, Sun S (2011) A review of optimization methodologies in support vector machines. Neurocomputing 74(17):3609–3618
Silva NFFD, Coletta LF, Hruschka ER (2016) A survey and comparative study of tweet sentiment analysis via semi-supervised learning. ACM Comput Surv (CSUR) 49(1):15
Sindhwani, V, Melville, P (2008) Document-word co-regularization for semi-supervised sentiment analysis. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on (pp. 1025-1030). IEEE
Socher, R, Perelygin, A, Wu, J, Chuang, J, Manning, CD, Ng, A, Potts, C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing (pp. 1631-1642)
Somasundaran, S, Namata, G, Wiebe, J, Getoor, L (2009) Supervised and unsupervised methods in employing discourse relations for improving opinion polarity classification. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 (pp. 170-179). Assoc Comput Linguist
Sperberg-McQueen CM (1991) Text in the electronic age: Texual study and textual study and text encoding, with examples from medieval texts. Lit Linguist Comput 6(1):34–46
Spina D, Gonzalo J, Amigó E (2013) Discovering filter keywords for company name disambiguation in twitter. Expert Syst Appl 40(12):4986–5003
Steinberger J, Ebrahim M, Ehrmann M, Hurriyetoglu A, Kabadjov M, Lenkova P, … Zavarella V (2012) Creating sentiment dictionaries via triangulation. Decis Support Syst 53(4):689–694
Stone PJ, Hunt EB (1963) A computer approach to content analysis: studies using the general inquirer system. In: Proceedings of the May 21-23, 1963, spring joint computer conference (pp. 241-256). ACM.
Stone, PJ, Dunphy, DC, Smith, MS (1966) The general inquirer: A computer approach to content analysis
Sun, Y, Xu, J, Wu, H, Lin, G, Mumtaz, S (2021) Deep learning based semi-supervised control for vertical security of maglev vehicle with guaranteed bounded airgap. IEEE Trans Intell Transp Syst
Taboada M, Grieve J (2004, March) Analyzing appraisal automatically. In: Proceedings of AAAI spring symposium on exploring attitude and affect in text (AAAI technical re# port SS# 04# 07), Stanford University, CA, pp. 158q161. AAAI Press
Taboada, M, Anthony, C, Voll, K (2006) Methods for creating semantic orientation dictionaries. In Conference on Language Resources and Evaluation (LREC) (pp. 427-432)
Taboada M, Brooke J, Tofiloski M, Voll K, Stede M (2011) Lexicon-based methods for sentiment analysis. Computat Linguist 37(2):267–307
Täckström O, McDonald R (2011) Semi-supervised latent variable models for sentence-level sentiment analysis. In proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies: short papers-volume 2 (pp. 569-574). Association for Computational Linguistics.
Taddy M (2013) Measuring political sentiment on twitter: factor optimal design for multinomial inverse regression. Techno-metrics 55(4):415–425
Tan S, Wang Y (2011) Weighted SCL model for adaptation of sentiment classification. Expert Syst Appl 38(8):10524–10531
Tan S, Wu Q (2011) A random walk algorithm for automatic construction of domain-oriented sentiment lexicon. Expert Syst Appl 38(10):12094–12100
Tan S, Cheng X, Wang Y, Xu H (2009) Adapting naive bayes to domain adaptation for sentiment analysis. In European conference on information retrieval (pp. 337-349). Springer, Berlin, Heidelberg
Tan LKW, Na JC, Theng YL, Chang K (2012) Phrase-level sentiment polarity classification using rule-based typed dependencies and additional complex phrases consideration. J Comput Sci Technol 27(3):650–666
Tan S, Li Y, Sun H, Guan Z, Yan X, Bu J, … He X (2014) Interpreting the public sentiment variations on twitter. IEEE Trans Knowl Data Eng 26(5):1158–1170
Tang D, Qin B, Wei F, Dong L, Liu T, Zhou M (2015) A joint segmentation and classification framework for sentence level sentiment classification. IEEE/ACM Trans Audio, Speech, Lang Process 23(11):1750–1761
Thelen, M, Riloff, E (2002) A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10(pp. 214-221). Assoc Comput Linguist
Thelwall M, Buckley K (2013) Topic-based sentiment analysis for the social web: the role of mood and issue-related words. J Assoc Inf Sci Technol 64(8):1608–1617
Thelwall M, Buckley K, Paltoglou G, Cai D, Kappas A (2010) Sentiment strength detection in short informal text. J Assoc Inf Sci Technol 61(12):2544–2558
Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Assoc Inf Sci Technol 62(2):406–418
Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Assoc Inf Sci Technol 63(1):163–173
Thet TT, Na JC, Khoo CS (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci 36(6):823–848
Tong, RM (2001) An operational system for detecting and tracking opinions in on-line discussion. In Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification (Vol. 1, No. 6)
Trivedi, R, Eisenstein, J (2013) Discourse connectors for latent subjectivity in sentiment analysis. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 808-813)
Tsai ACR, Wu CE, Tsai RTH, Hsu JYJ (2013) Building a concept-level sentiment dictionary based on commonsense knowledge. IEEE Intell Syst 28(2):22–30
Tseng, H, Chang, P, Andrew, G, Jurafsky, D, Manning, C (2005) A conditional random field word segmenter for sighan bakeoff 2005. In Proceedings of the fourth SIGHAN workshop on Chinese language Processing (Vol. 171)
Tsytsarau M, Palpanas T (2012) Survey on mining subjective data on the web. Data Min Knowl Disc 24(3):478–514
Article MATH Google Scholar
Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting elections with twitter: what 140 characters reveal about political sentiment. Icwsm 10(1):178–185
Turney, PD (2002) Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 417-424). Assoc Comput Linguist
Ullah A, Nawi NM (2020) Enhancing the dynamic load balancing technique for cloud computing using HBATAABC algorithm. Int J Model, Simul, Sci Comp 11(05):2050041
Ullah A, Nawi NM, Khan MH (2020) BAT algorithm used for load balancing purpose in cloud computing: an overview. Int J High Perform Comput Network 16(1):43–54
Ullah A, Şahin CB, Dinler OB, Khan MH, Aznaoui H (2021) Heart disease prediction using various machines learning approach. J Cardiovasc. Dis. Res. 12(3):379–391. https://doi.org/10.31838/jcdr.2021.12.03.58
Usai A, Pironti M, Mital M, Mejri CA (2018) Knowledge discovery out of text data: a systematic review via text mining J Knowledge Manag
Van de Camp M, Van den Bosch A (2012) The socialist network. Decis Support Syst 53(4):761–769
Vinodhini G, Chandrasekaran RM (2014) Opinion mining using principal component analysis based ensemble model for e-commerce application. CSI Transact ICT 2(3):169–179
Walker MA, Anand P, Abbott R, Tree JEF, Martell C, King J (2012) That is your evidence?: classifying stance in online political debate. Decis Support Syst 53(4):719–729
Wan X (2008) Using bilingual knowledge and ensemble techniques for unsupervised Chinese sentiment analysis. In: Proceedings of the conference on empirical methods in natural language processing (pp. 553-561). Association for Computational Linguistics
Wan X (2009) Co-training for cross-lingual sentiment classification. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP: volume 1-volume 1 (pp. 235-243). Association for Computational Linguistics
Wang JH, Lee CC (2011) Unsupervised opinion phrase extraction and rating in Chinese blog posts. In: Privacy, security, risk and trust (PASSAT) and 2011 IEEE third international conference on social computing (SocialCom), 2011 IEEE third international conference on(pp. 820-823). IEEE
Wang, S, Manning, CD (2012) Baselines and bigrams: Simple, good sentiment and topic classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2 (pp. 90-94). Assoc Comput Linguist
Wang, H, Lu, Y, Zhai, C (2010) Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 783-792). ACM
Wang S, Li D, Song X, Wei Y, Li H (2011) A feature selection method based on improved fisher’s discriminant ratio for text sentiment classification. Expert Syst Appl 38(7):8696–8702
Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93
Wang T, Cai Y, Leung HF, Lau RY, Li Q, Min H (2014) Product aspect extraction supervised with online domain knowledge. Knowl-Based Syst 71:86–100
Wang, R, Huang, W, Chen, W, Wang, T, Lei, K (2015) ASEM: mining aspects and sentiment of events from microblog. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (pp. 1923-1926). ACM
Wang JZ, Yan Z, Yang LT, Huang BX (2015) An approach to rank reviews by fusing and mining opinions based on review pertinence. Inf Fus 23:3–15
Wang L, Liu K, Cao Z, Zhao J, de Melo G (2015) Sentiment-aspect extraction based on restricted boltzmann machines. In proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (Vol. 1, pp. 616-625).
Watanabe H (1992) A similarity-driven transfer system. In proceedings of the 14th conference on computational linguistics-volume 2 (pp. 770-776). Association for computational linguistics.
Wei, B, Pal, C (2010) Cross lingual adaptation: an experiment on sentiment classifications. In Proceedings of the ACL 2010 conference short papers (pp. 258-262). Assoc Comput Linguist
Weichselbraun A, Gindl S, Scharl A (2013) Extracting and grounding contextualized sentiment lexicons. IEEE Intell Syst 28(2):39–46
Weichselbraun A, Gindl S, Scharl A (2014) Enriching semantic knowledge bases for opinion mining in big data applications. Knowl-Based Syst 69:78–85
Whitelaw, C, Garg, N, Argamon, S (2005) Using appraisal groups for sentiment analysis. In Proceedings of the 14th ACM international conference on Information and knowledge management (pp. 625-631). ACM
Wiebe J, Wilson T, Cardie C (2005) Annotating expressions of opinions and emotions in language. Lang Resour Eval 39(2-3):165–210
Wilson, T, Wiebe, J, Hoffmann, P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347-354). Assoc Comput Linguist
Wilson, T, Hoffmann, P, Somasundaran, S, Kessler, J, Wiebe, J, Choi, Y ..., Patwardhan, S (2005) OpinionFinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations (pp. 34-35). Assoc Comput Linguist
Wu Q, Tan S (2011) A two-stage framework for cross-domain sentiment classification. Expert Syst Appl 38(11):14269–14275
Wu CE, Tsai RTH (2014) Using relation selection to improve value propagation in a conceptnet-based sentiment dictionary. Knowl-Based Syst 69:100–107
Xia R, Zong C, Li S (2011) Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci 181(6):1138–1152
Xianghua F, Guo L, Yanyan G, Zhiqiang W (2013) Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon. Knowl-Based Syst 37:186–195
Xu K, Liao SS, Li J, Song Y (2011) Mining comparative opinions from customer reviews for competitive intelligence. Decis Support Syst 50(4):743–754
Xu T, Peng Q, Cheng Y (2012) Identifying the semantic orientation of terms using S-HAL for sentiment analysis. Knowl-Based Syst 35:279–289
Xu, YC, Zhang, C, Xue, L (2013) WITHDRAWN: Measuring product susceptibility in online product review social network
Xu X, Cheng X, Tan S, Liu Y, Shen H (2013) Aspect-level opinion mining of online customer reviews. China Commun 10(3):25–41
Xu H, Zhang F, Wang W (2015) Implicit feature identification in Chinese reviews using explicit topic mining model. Knowl-Based Syst 76:166–175
Xuan HNT, Le AC, Nguyen LM (2012) Linguistic features for subjectivity classification. In Asian language processing (IALP), 2012 international conference on (pp. 17-20). IEEE
Yadav, SK, Pal, S (2012) Data mining: A prediction for performance improvement of engineering students using classification. arXiv preprint arXiv:1203.3832
Yan Z, Xing M, Zhang D, Ma B (2015) EXPRS: An extended pagerank method for product feature extraction from online consumer reviews. Inf Manag 52(7):850–858
Yang B, Cardie C (2014) Context-aware learning for sentence-level sentiment analysis with posterior regularization. In proceedings of the 52nd annual meeting of the Association for Computational Linguistics (volume 1: long papers) (Vol. 1, pp. 325-335)
Yang H, Wen J, Wu X, He L, Mumtaz S (2019) An efficient edge artificial intelligence multipedestrian tracking method with rank constraint. IEEE Trans Indust Inf 15(7):4178–4188
Yan-Yan Z, Bing Q, Ting L (2010) Integrating intra-and inter-document evidences for improving sentence sentiment classification. Acta Automat Sin 36(10):1417–1425
Ye, Q, Shi, W, Li, Y (2006) Sentiment classification for movie reviews in Chinese by improved semantic oriented approach. In System Sciences, 2006. HICSS'06. Proceedings of the 39th Annual Hawaii International Conference on (Vol. 3, pp. 53b-53b). IEEE
Yi, J, Nasukawa, T, Bunescu, R, Niblack, W (2003) Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Data Mining, 2003. ICDM 2003. Third IEEE International Conference on (pp. 427-434). IEEE
Yu H, Hatzivassiloglou V (2003) Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 conference on empirical methods in natural language processing (pp. 129-136). Association for Computational Linguistics
Yu LC, Wu JL, Chang PC, Chu HS (2013) Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news. Knowl-Based Syst 41:89–97
Yu Y, Duan W, Cao Q (2013) The impact of social and conventional media on firm equity value: A sentiment analysis approach. Decis Support Syst 55(4):919–926
Zhai Z, Liu B, Xu H, Jia P (2011) Clustering product features for opinion mining. In proceedings of the fourth ACM international conference on web search and data mining (pp. 347-354). ACM
Zhan J, Loh HT, Liu Y (2009) Gather customer concerns from online product reviews–A text summarization approach. Expert Syst Appl 36(2):2107–2115
Zhang, Z (2008) Weighing stars: Aggregating online product reviews for intelligent e-commerce applications. IEEE Intell Syst. 23(5)
Zhang, Y, Clark, S (2008) Joint Word Segmentation and POS Tagging Using a Single Perceptron. In ACL (pp. 888-896)
Zhang, W, Skiena, S (2010) Trading Strategies to Exploit Blog and News Sentiment. In Icwsm
Zhang C, Zeng D, Li J, Wang FY, Zuo W (2009) Sentiment analysis of Chinese documents: from sentence to document level. J Assoc Inf Sci Technol 60(12):2474–2487
Zhang L, Liu B, Lim SH, O'Brien-Strain E (2010) Extracting and ranking product features in opinion documents. In proceedings of the 23rd international conference on computational linguistics: posters (pp. 1462-1470). Association for Computational Linguistics
Zhang Z, Ye Q, Zhang Z, Li Y (2011) Sentiment classification of internet restaurant reviews written in Cantonese. Expert Syst Appl 38(6):7674–7682
Zhang W, Xu H, Wan W (2012) Weakness finder: find product weakness from Chinese reviews by using aspects based sentiment analysis. Expert Syst Appl 39(11):10283–10291
Zhao, L, Huang, M, Sun, J, Luo, H, Yang, X, Zhu, X (2015) Sentiment extraction by leveraging aspect-opinion association structure. In Proceedings of the 24th ACM international on conference on information and knowledge management (pp. 343-352). ACM
Zheng, X, Chen, H, Xu, T (2013) Deep Learning for Chinese Word Segmentation and POS Tagging. In EMNLP(pp. 647-657)
Zhou L, Chaovalit P (2008) Ontology-supported polarity mining. J Assoc Inf Sci Technol 59(1):98–110
Zhou, L, Li, B, Gao, W, Wei, Z, Wong, KF (2011) Unsupervised discovery of discourse relations for eliminating intra-sentence polarity ambiguities. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 162-171). Assoc Comput Linguist
Zhou X, Wan X, Xiao J (2016) CMiner: opinion extraction and summarization for Chinese microblogs. IEEE Trans Knowl Data Eng 28(7):1650–1663
Zhu, J, Zhu, M, Wang, Q, Xiao, T (2015) Niuparser: A Chinese syntactic and semantic parsing toolkit. Proceedings of ACL-IJCNLP 2015 System Demonstrations, 145-150
Zirn, C, Niepert, M, Stuckenschmidt, H, Strube, M (2011) Fine-Grained Sentiment Analysis with Structural Features. In IJCNLP (pp. 336-344)
Download references
Author information
Authors and affiliations.
Department of Computing, Riphah International University, Faisalabad, Punjab, 44000, Faisalabad, Pakistan
Soft Computing and Data Mining Centre (SMC), Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Parit Raja, Malaysia
Sundas Naqeeb Khan & Nazri Mohd Nawi
You can also search for this author in PubMed Google Scholar
Contributions
All authors are equality contribution for making paper.
Corresponding author
Correspondence to Arif Ullah .
Ethics declarations
Competing interest.
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Reprints and permissions
About this article
Ullah, A., Khan, S.N. & Nawi, N.M. Review on sentiment analysis for text classification techniques from 2010 to 2021. Multimed Tools Appl 82 , 8137–8193 (2023). https://doi.org/10.1007/s11042-022-14112-3
Download citation
Received : 01 October 2020
Revised : 20 August 2021
Accepted : 25 October 2022
Published : 01 December 2022
Issue Date : March 2023
DOI : https://doi.org/10.1007/s11042-022-14112-3
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- Sentiment analysis
- Text classification
- Opinion mining
- Natural language
- Find a journal
- Publish with us
- Track your research
ISI Journals
- Informing Science: The International Journal of an Emerging Transdiscipline (InformingSciJ)
- Journal of Information Technology Education: Research (JITE:Research)
- Journal of Information Technology Education: Innovations in Practice (JITE:IIP)
- Journal of Information Technology Education: Discussion Cases (JITE: DC)
- Interdisciplinary Journal of e-Skills and Lifelong Learning (IJELL)
- Interdisciplinary Journal of Information, Knowledge, and Management (IJIKM)
- International Journal of Doctoral Studies (IJDS)
- Issues in Informing Science and Information Technology (IISIT)
- Journal for the Study of Postsecondary and Tertiary Education (JSPTE)
- Informing Faculty (IF)
Collaborative Journals
- Muma Case Review (MCR)
- Muma Business Review (MBR)
- International Journal of Community Development and Management Studies (IJCDMS)
- InSITE2024 : Jul 24 - 25 2024,
- All Conferences »
- Publications
- Journals
- Conferences
Text Classification Techniques: A Literature Review
Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining.
Background Artificial Intelligence is reshaping text classification techniques to better acquire knowledge. However, in spite of the growth and spread of AI in all fields of research, its role with respect to text mining is not well understood yet.
Methodology For this study, various articles written between 2010 and 2017 on “text classification techniques in AI”, selected from leading journals of computer science, were analyzed. Each article was completely read. The research problems related to text classification techniques in the field of AI were identified and techniques were grouped according to the algorithms involved. These algorithms were divided based on the learning procedure used. Finally, the findings were plotted as a tree structure for visualizing the relationship between learning procedures and algorithms.
Contribution This paper identifies the strengths, limitations, and current research trends in text classification in an advanced field like AI. This knowledge is crucial for data scientists. They could utilize the findings of this study to devise customized data models. It also helps the industry to understand the operational efficiency of text mining techniques. It further contributes to reducing the cost of the projects and supports effective decision making.
Findings It has been found more important to study and understand the nature of data before proceeding into mining. The automation of text classification process is required, with the increasing amount of data and need for accuracy. Another interesting research opportunity lies in building intricate text data models with deep learning systems. It has the ability to execute complex Natural Language Processing (NLP) tasks with semantic requirements.
Recommendations for Practitioners Frame analysis, deception detection, narrative science where data expresses a story, healthcare applications to diagnose illnesses and conversation analysis are some of the recommendations suggested for practitioners.
Recommendation for Researchers Developing simpler algorithms in terms of coding and implementation, better approaches for knowledge distillation, multilingual text refining, domain knowledge integration, subjectivity detection, and contrastive viewpoint summarization are some of the areas that could be explored by researchers.
Impact on Society Text classification forms the base of data analytics and acts as the engine behind knowledge discovery. It supports state-of-the-art decision making, for example, predicting an event before it actually occurs, classifying a transaction as ‘Fraudulent’ etc. The results of this study could be used for developing applications dedicated to assisting decision making processes. These informed decisions will help to optimize resources and maximize benefits to the mankind.
Future Research In the future, better methods for parameter optimization will be identified by selecting better parameters that reflects effective knowledge discovery. The role of streaming data processing is still rarely explored when it comes to text classification.
Back to Top ↑
- Become a Reviewer
- Privacy Policy
- Ethics Policy
- Legal Disclaimer
SEARCH PUBLICATIONS
Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser .
Enter the email address you signed up with and we'll email you a reset link.
- We're Hiring!
- Help Center
Techniques for text classification: Literature review and current trends
Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, popularly known as the digital/electronic information is in the form of documents, conference material, publications, journals, editorials, web pages, e-mail etc. People largely access information from these online sources rather than being limited to archaic paper sources like books, magazines, newspapers etc. But the main problem is that this enormous information lacks organization which makes it difficult to manage. Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us to have a fair evaluation of the progress made in this field till date. We have investigated the papers to the ...
Related Papers
The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have made this classification problem very difficult. We investigate four different methods for document classification: the naive Bayes classifier, the nearest neighbour classifier, decision trees and a subspace method. These were applied to seven-class Yahoo news groups (business, entertainment, health, international, politics, sports and technology) individually and in combination. We studied three classifier combination approaches: simple voting, dynamic classifier selection and adaptive classifier combination. Our experimental results indicate that the naive Bayes classifier and the subspace method outperform the other two classifiers on our data sets. Combinations of multiple classifiers did not always improve the classification accuracy compared to the best individual classifier. Among the three different combination approaches, our adaptive classifier combination method introduced here performed the best. The best classification accuracy that we are able to achieve on this seven-class problem is approximately 83%, which is comparable to the performance of other similar studies. However, the classification problem considered here is more difficult because the pattern classes used in our experiments have a large overlap of words in their corresponding documents.
Khairullah Khan
IJFRCSCE Journal
–Text classification is used to classify the document of similar types. Text classification can be also performed under supervision i.e. it is an supervised leaning technique Text classification is a process in which documents are sorted spontaneously into different classes using predefined set. The main issue is that large scale of information lacks organization which makes it difficult to manage. Text classification is identified as one of the key methods used for recognizing such types of digital information. Text classification have various applications such as in information retrieval, natural language processing, automatic indexing, text filtering, image processing, etc. Text classification is also used to process the big data and it can also be used to predict the class labels for newly added data. Text classification is also being used in academic and industries to classify the unstructured data. There are various types of the text classification approaches such as decision tree, SVM, Naïve Bayes etc. In this survey paper, we have analysed the various text classification techniques such as decision tree, SVM, Naïve Bayes etc. These techniques have their individual set of advantages which make them suitable in almost all classification jobs. In this paper we have also analysed evaluation parameters such as F-measure, G-measure and accuracy used in various research works. .
IJRISE Journal
The printed transformation has seen a gigantic change in the accessibility of online data. Discovering data for pretty much any need has never been more programmed. Content arrangement (otherwise called content classification or point spotting) is the errand of naturally sorting an arrangement of archives into classifications from a predefined set. This assignment has a few applications, including computerized ordering of logical articles, recording licenses into patent indexes, particular spread of data to data purchasers, robotized populace of various leveled inventories of Web assets, spam separating, and recognizable proof of report class. Computerized content characterization is appealing in light of the fact that it liberates associations from the need of physically sorting out report bases, which can be excessively costly, or essentially not plausible since time is running short imperatives of the application or the quantity of records included. The exactness of present day content characterization frameworks equals that of prepared human experts, on account of a blend of information retrieval (IR) innovation and machine learning (ML) innovation. The point of this paper is to highlight the essential calculations that are utilized in content archives grouping, while in the meantime making familiarity with a portion of the fascinating difficulties that stay to be fathomed.
Malaysian Journal of Computer Science
Moe Htet Min
Due to the mass availability of textual data on Web, text classification (TC), classifying texts into predetermined sets becomes a spotlight for researchers. A number of TC applications have been proposed yet very few studies reported an overview of TC research area in a proper and systematic manner. This paper aims to provide an overview of TC research trends and gaps by structuring and analyzing research patterns, encountered problems and problem-solving methods in TC. In other words, this study highlights problem types, data sources, choice of language of text and types of applied techniques in TC. An intensive systematic study is conducted by applying guidelines proposed by Petersen and colleagues in 2007. In this paper, ninety-six literatures from five electronic databases from 2006 to 2017 were systematically reviewed and followed each and every step properly in accordance with systematic mapping study. Nine main problems in TC research area were identified and significant fin...
ACM Computing Surveys
Aidana Darkenova
— With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic categorization of documents became the key method for organizing the information and know-ledge discovery. Proper classification of e-documents, online news, blogs, e-mails and digital libraries need text mining, machine learning and natural language processing tech-niques to get meaningful knowledge. The aim of this paper is to highlight the important techniques and methodologies that are employed in text documents classification, while at the same time making awareness of some of the interesting challenges that remain to be solved, focused mainly on text representation and machine learning techniques. This paper provides a review of the theory and methods of document classification and text mining, focusing on the existing litera-ture.
International Journal of Computer Applications
Mukesh Zaveri
Web Intelligence
Xiaohui Tao
Text classification (a.k.a text categorisation) is an effective and efficient technology for information organisation and management. With the explosion of information resources on the Web and corporate intranets continues to increase, it has being become more and more important and has attracted wide attention from many different research fields. In the literature, many feature selection methods and classification algorithms have been proposed. It also has important applications in the real world. However, the dramatic increase in the availability of massive text data from various sources is creating a number of issues and challenges for text classification such as scalability issues. The purpose of this report is to give an overview of existing text classification technologies for building more reliable text classification applications, to propose a research direction for addressing the challenging problems in text mining.
IRJET Journal
RELATED PAPERS
Tom Odhiambo
Anales de la Facultad de Teología PUC-Chile
Sandra Arenas
Antonio Daniel Dalmasso
Darya Tsymbalyuk
Forensic Science International: Genetics Supplement Series
Raquel Herbstrith Carvalho
Phytochemistry Reviews
Justyna Makowska-Wąs
Procedia - Social and Behavioral Sciences
BMC Neuroscience
Theoden Netoff
Modern Economics
Olena Kravchenko
International Journal of Oral and Maxillofacial Surgery
L. Ardekian
Revista Brasileira de Parasitologia Veterinária
Eliane Mattos Piranda
BERITA BIOLOGI
Routledge Handbook of Jewish Ritual and Practice
ANAIS DO IV SEMINÁRIO DE ARTES DIGITAIS
Lab Front , Pablo Gobira
Carlos Somolinos
Stelios Assimakopoulos
Psychiatric Quarterly
Angelo Barbato
Matti Koivula
Applied Mathematics and Computation
Changbum Chun
Anaesthesia
kirsty forrest
bioRxiv (Cold Spring Harbor Laboratory)
Edward Avezov
Bulletin of the World Health Organization
Marcela Uhart
International Journal of Interactive Mobile Technologies (iJIM)
Dedi Kuswandi
Anne C. Dailey
arXiv (Cornell University)
See More Documents Like This
RELATED TOPICS
- We're Hiring!
- Help Center
- Find new research papers in:
- Health Sciences
- Earth Sciences
- Cognitive Science
- Mathematics
- Computer Science
- Academia ©2024
VIDEO
COMMENTS
Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification ...
review if the paper describes research on text classification. This review does not describe all the text classification models and the techniques used to develop them in detail for practitioners. Our aim is to classify the papers with respect to their years, datasets, different feature selection
Techniques for text classification: Literature review and current trends. This paper has studied the existing work in the area of text classification and tried to summarize all existing information in a comprehensive and succinct manner to have a fair evaluation of the progress made in this field till date.
current trends and future research options in text classification techniques. L ITERATURE R EVIEW This article is a literature review of various stud ies related to text classification approac hes ...
Text Classification (TC), also known as Document Classification or Text Categorization, is the process of assigning several predefined categories to a set of texts, often based on its content (Jindal et al., 2015; Wang & Deng, 2017).With the advent of the era of big data, the enormous quantity and diversity of digital documents have made it challenging for TC.
The main emphasis is laid on various steps involved in text classification process viz. document representation methods, feature selection methods, data mining methods and the evaluation technique used by each study to carry out the results on a particular dataset. Pages: 1-28. Keywords: Machine learning; Text classification; Feature selection ...
Feature Selection (FS) methods alleviate key problems in classification procedures as they are used to improve classification accuracy, reduce data dimensionality, and remove irrelevant data. FS methods have received a great deal of attention from the text classification community. However, only a few literature surveys include them focusing on text classification, and the ones available are ...
Techniques for text classification: Literature review and current trends Rajni Jindal, Ruchika Malhotra, Abha Jain; Affiliations Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India. E-mail: rajnijindal (at) dce.ac.in Ruchika Malhotra Department of Computer Science & Engineering, Delhi ...
The Research Trends of Text Classification Studies (2000-2020): A Bibliometric Analysis ... Malhotra R., Jain A. (2015). Techniques for text classification: Literature review and current trends. ... Shaikh K. (2018). Prediction of cause of death from forensic autopsy reports using text classification techniques: A comparative study. Journal ...
We study the literature in major journals and conferences on the usage of shallow learning and deep learning methods for text classification. Shallow learning techniques such as Naive Bayes, Support Vector Machine, Random Forests were initially widely used to solve problems in text classification. however, these techniques generally require the presence of a precise feature extraction model ...
Due to the mass availability of textual data on Web, text classification (TC), classifying texts into predetermined sets becomes a spotlight for researchers.
Clinical text classification techniques have been employed in several types of free-text clinical reports, such as pathology reports, radiology reports, autopsy reports, death certificates, and biomedical documents. Overall, nine different types of clinical reports were identified from the literature as shown in Table 4. As shown here, majority ...
The strengths, limitations, and current research trends in text classification in an advanced field like AI are identified to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining. Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an ...
role classification can be defined as an argument extraction task. The event classification is a multi-label text classification task to classify the type of each event. The role classification task is a multi-classification task based on word pairs, determining the role relationship between any pair of triggers and entities in a sentence.
This paper provides a comparison of the performance of well-known text classification techniques including genetic algorithm, k nearest neighbor, decision tree, support vector machine and Naïve Bayes. ... Literature review and current trends Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India ...
Progression in the popularity of social media activities had provided huge amount of data in the form of text that can immeasurably augment its specialty. This textual data offers a platform for the reviewers to share their comments about any product, service or event on social media. These types of discussions among the reviewers boost the demand and supply in business and industry field ...
The text classification techniques section elaborately describes various approaches. The findings section explains various results observed from the articles reviewed. The discussions section explains research gaps, and the conclusion section highlights some of the current trends and future research options in text classification techniques.
Text Classification Techniques: A Literature Review. Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining. Background Artificial Intelligence is ...
Webology, Volume 12, Number 2, December, 2015 Home Table of Contents Titles & Subject Index Authors Index Techniques for text classification: Literature review and current trends Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India.
Natural language processing (NLP) has significantly transformed in the last decade, especially in the field of language modeling. Large language models (LLMs) have achieved SOTA performances on natural language understanding (NLU) and natural language generation (NLG) tasks by learning language representation in self-supervised ways. This paper provides a comprehensive survey to capture the ...
Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, ... Techniques for text classification: Literature review and current trends ...
TRENDS AND PATTERNS OF TEXT CLASSIFICATION TECHNIQUES: A SYSTEMATIC MAPPING STUDY. Due to the mass availability of textual data on Web, text classification (TC), classifying texts into ...
Techniques for text classification: Literature review and current trends. Webology, Dec 2015 Rajni Jindal, Ruchika Malhotra, Abha ... Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us ...