logo

ISSN: 1735-188X

  • Call for Special Issue
  • Important Announcements
  • Online Submission
  • Indexing/Abstracting

logo

Volume 12, No 2, 2015

Techniques for text classification: Literature review and current trends

Rajni jindal, ruchika malhotra and abha jain.

Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, popularly known as the digital/electronic information is in the form of documents, conference material, publications, journals, editorials, web pages, e-mail etc. People largely access information from these online sources rather than being limited to archaic paper sources like books, magazines, newspapers etc. But the main problem is that this enormous information lacks organization which makes it difficult to manage. Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us to have a fair evaluation of the progress made in this field till date. We have investigated the papers to the best of our knowledge and have tried to summarize all existing information in a comprehensive and succinct manner. The studies have been summarized in a tabular form according to the publication year considering numerous key perspectives. The main emphasis is laid on various steps involved in text classification process viz. document representation methods, feature selection methods, data mining methods and the evaluation technique used by each study to carry out the results on a particular dataset.

Pages : 1-28

Keywords : Machine learning; Text classification; Feature selection; Bag-of-words; Vector space model

A Systematic Literature Review of Text Classification: Datasets and Methods

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

Review on sentiment analysis for text classification techniques from 2010 to 2021

  • 1199: Computational Intelligence Revolution in Multimedia Data Analytics and Business Management
  • Published: 01 December 2022
  • Volume 82 , pages 8137–8193, ( 2023 )

Cite this article

  • Arif Ullah   ORCID: orcid.org/0000-0002-7740-2206 1 ,
  • Sundas Naqeeb Khan 2 &
  • Nazri Mohd Nawi 2  

959 Accesses

Explore all metrics

Progression in the popularity of social media activities had provided huge amount of data in the form of text that can immeasurably augment its specialty. This textual data offers a platform for the reviewers to share their comments about any product, service or event on social media. These types of discussions among the reviewers boost the demand and supply in business and industry field. Furthermore, for every passing day the textual data is also increasing in amount which makes data mining especially sentiment analysis or opinion mining, a research hungry area. This is mainly because of data is represented in the form of calculations about reviewers’ comments, assessment, attitudes, behavior and emotions to individual issues, events, topics, services and attributes. Previously, researchers focus on systems to recognize and categorize sentiments from the written material where opinions are extremely unstructured, assorted and classified. In this paper, authors try to presents a meticulous survey on sentiment analysis with classification, in which one hundred and forty three articles were reviewed regarding important activities, approaches, applications with multilingual and cross domain jobs. This systematic survey considers published literature during 2010-2021, organized based on machine learning, lexicon and hybrid approaches with multilingual and cross domain knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

techniques for text classification literature review and current trends

Similar content being viewed by others

techniques for text classification literature review and current trends

A survey on sentiment analysis methods, applications, and challenges

Mayur Wankhade, Annavarapu Chandra Sekhara Rao & Chaitanya Kulkarni

techniques for text classification literature review and current trends

A review on sentiment analysis and emotion detection from text

Pansy Nandwani & Rupali Verma

techniques for text classification literature review and current trends

Sentiment Analysis in the Age of Generative AI

Jan Ole Krugmann & Jochen Hartmann

Data availability

No specific Data are used because it is review papers all those paper which are study and used in this paper are cited in the paper.

https://nlp.stanford.edu/software/tokenizer.shtml

https://opennlp.apache.org/documentation/manual/opennlp.html#tools .

http://ictclas.nlpir.org .

http://thulac.thunlp.org .

http://nlp.stanford.edu/software/segmenter.shtml .

Abbasi A, Chen H, Salem A (2008) Sentiment analysis in multiple languages: feature selection for opinion classification in web forums. ACM Trans Inf Syst (TOIS) 26(3):12

Article   Google Scholar  

Abbasi A, France S, Zhang Z, Chen H (2011) Selecting attributes for sentiment classification using feature relation networks. IEEE Trans Knowl Data Eng 23(3):447–462

Abdul-Mageed M, Diab M, Kübler S (2014) SAMAR: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37

Adeleke AO, Samsudin NA, Mustapha A, Nawi NM (2017) Comparative analysis of text classification algorithms for automated labelling of Quranic verses. Int. J. Adv. Sci. Eng. Inf. Technol 7(4):1419

Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011, June) Sentiment analysis of twitter data. In: Proceedings of the workshop on languages in social media (pp. 30-38). Association for Computational Linguistics

Ali F, Kwak KS, Kim YG (2016) Opinion mining based on fuzzy domain ontology and support vector machine: A proposal to automate online review classification. Appl Soft Comput 47:235–250

Archak, N, Ghose, A, Ipeirotis, PG (2007) Deriving the pricing power of product features by mining consumer reviews

Arif MH, Li J, Iqbal M, Liu K (2018) Sentiment analysis and spam detection in short informal text using learning classifier systems. Soft Comput 22(21):7281–7291

Atkinson, K (2006) Gnu aspell 0.60. 4

Baccianella, S, Esuli, A, Sebastiani, F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In LREC (Vol. 10, No. 2010, pp. 2200-2204)

Bai X (2011) Predicting consumer sentiments from online text. Decis Support Syst 50(4):732–742

Balahur A, Hermida JM, Montoyo A (2012) Detecting implicit expressions of emotion in text: A comparative analysis. Decis Support Syst 53(4):742–753

Banea C, Mihalcea R, Wiebe J, Hassan S (2008) Multilingual subjectivity analysis using machine translation. In: Proceedings of the conference on empirical methods in natural language processing (pp. 127-135). Association for Computational Linguistics

Bao H, Li Q, Liao SS, Song S, Gao H (2013) A new temporal and social PMF-based method to predict users' interests in micro-blogging. Decis Support Syst 55(3):698–709

Basari ASH, Hussin B, Ananta IGP, Zeniarja J (2013) Opinion mining of movie review using hybrid method of support vector machine and particle swarm optimization. Procedia Eng 53:453–462

Bell D, Koulouri T, Lauria S, Macredie RD, Sutton J (2014) Micro-blogging as a mechanism for human–robot interaction. Knowl-Based Syst 69:64–77

Benamara, F, Cesarano, C, Picariello, A, Recupero, DR, Subrahmanian, VS (2007) Sentiment analysis: Adjectives and adverbs are better than adjectives alone. In ICWSM

Bhatia, P, Ji, Y, Eisenstein, J (2015) Better document-level sentiment analysis from RST discourse parsing. arXiv preprint arXiv:1509.01599

Bilianos D (2022) Experiments in text classification: Analyzing the sentiment of electronic product reviews in greek. J Quant Linguist 29(3):374–386

Bird, S, Klein, E, Loper, E (2009) Natural language processing with Python: analyzing text with the natural language toolkit. “O’Reilly Media, Inc."

Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022

MATH   Google Scholar  

Blitzer, J, Dredze, M, Pereira, F (2007) Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 440-447)

Boiy E, Moens MF (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retr 12(5):526–558

Boiy, E, Hens, P, Deschacht, K, Moens, MF (2007) Automatic Sentiment Analysis in On-line Text. In ELPUB (pp. 349-360)

Boldrini E, Balahur A, Martínez-Barco P, Montoyo A (2012) Using EmotiBlog to annotate and analyse subjectivity in the new textual genres. Data Min Knowl Disc 25(3):603–634

Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 132-141). Association for Computational Linguistics

Bollegala D, Weir D, Carroll J (2013) Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans Knowl Data Eng 25(8):1719–1731

Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8

Bonzanini, M (2012) A knowledge-based approach for summarising opinions. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (pp. 991-991). ACM

Bouazizi, M, Ohtsuki, T (2017) A Pattern-Based Approach for Multi-Class Sentiment Analysis in Twitter. IEEE Access

Brody S, Elhadad N (2010) An unsupervised aspect-sentiment model for online reviews. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (pp. 804-812). Association for Computational Linguistics

Bross, J, Ehrig, H (2013) Automatic construction of domain and aspect specific sentiment lexicons for customer review mining. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (pp. 1077-1086). ACM

Cambria E, White B (2014) Jumping NLP curves: A review of natural language processing research. IEEE Comput Intell Mag 9(2):48–57

Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 28(2):15–21

Cambria E, Gastaldo P, Bisio F, Zunino R (2015) An ELM-based model for affective analogical reasoning. Neuro-computing 149:443–455

Google Scholar  

Cao Q, Duan W, Gan Q (2011) Exploring determinants of voting for the “helpfulness” of online user reviews: A text mining approach. Decis Support Syst 50(2):511–521

Chambers, N, Bowen, V, Genco, E, Tian, X, Young, E, Harihara, G, Yang, E (2015) Identifying Political Sentiment between Nation States with Social Media. In EMNLP (pp. 65-75)

Che, W, Li, Z, Liu, T (2010) Ltp: A Chinese language technology platform. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations (pp. 13-16). Assoc Comput Linguist

Chen CC, Tseng YD (2011) Quality evaluation of product reviews using an information quality framework. Decis Support Syst 50(4):755–768

Chen, WT, Lin, SC, Huang, SL, Chung, YS, Chen, KJ (2010) E-HowNet and automatic construction of a lexical ontology. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations (pp. 45-48). Assoc Comput Linguist

Chen LS, Liu CH, Chiu HJ (2011) A neural network based approach for sentiment classification in the blogosphere. J Inf 5(2):313–322

Chen X, Vorvoreanu M, Madhavan K (2014) Mining social media data for understanding students’ learning experiences. IEEE Trans Learn Technol 7(3):246–259

Chen, X, Qiu, X, Zhu, C, Huang, X (2015) Gated Recursive Neural Network for Chinese Word Segmentation. In ACL (1) (pp. 1744-1753)

Chen Q, Li W, Lei Y, Liu X, He Y (2015) Learning to adapt credible knowledge in cross-lingual sentiment analysis. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (Vol. 1, pp. 419-429)

Chenlo JM, Hogenboom A, Losada DE (2013) Sentiment-based ranking of blog posts using rhetorical structure theory. In international conference on application of natural language to information systems (pp. 13-24). Springer, Berlin, Heidelberg

Chklovski, T, Pantel, P (2004) Verbocean: Mining the web for fine-grained semantic verb relations. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing

Choi Y, Cardie C (2009) Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification. In: Proceedings of the 2009 conference on empirical methods in natural language processing: volume 2-volume 2 (pp. 590-598). Association for Computational Linguistics

Coussement K, Van den Poel D (2009) Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers. Expert Syst Appl 36(3):6127–6134

Crammer K, Singer Y (2003) Ultraconservative online algorithms for multiclass problems. J Mach Learn Res 3(Jan):951–991

Cruz FL, Troyano JA, Enríquez F, Ortega FJ, Vallejo CG (2013) ‘Long autonomy or long delay?’The importance of domain in opinion mining. Expert Syst Appl 40(8):3174–3184

Cui, H, Mittal, V, Datar, M (2006) Comparative experiments on sentiment classification for online product reviews. In AAAI (Vol. 6, pp. 1265-1270)

Dang Y, Zhang Y, Chen H (2010) A lexicon-enhanced method for sentiment classification: An experiment on online product reviews. IEEE Intell Syst 25(4):46–53

Das, S, Chen, M (2001) Yahoo! for Amazon: Extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific finance association annual conference (APFA) (Vol. 35, p. 43)

Dasgupta, S, Ng, V (2009) Mine the easy, classify the hard: a semi-supervised approach to automatic sentiment classification. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2 (pp. 701-709). Assoc Computat Linguist

Demirtas, E (2013) Cross-lingual sentiment analysis with machine translation

Deng ZH, Luo KH, Yu HL (2014) A study of supervised term weighting scheme for sentiment analysis. Expert Syst Appl 41(7):3506–3513

Derczynski, L, Ritter, A, Clark, S, Bontcheva, K (2013) Twitter part-of-speech tagging for all: Overcoming sparse and noisy data. In Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013 (pp. 198-206)

Deshmukh, JS, Tripathy, AK (2017) Entropy based classifier for cross-domain opinion mining. Appl Comput Inf

Di Caro L, Grella M (2013) Sentiment analysis via dependency parsing. Comput Stand Interfaces 35(5):442–453

Ding, X, Liu, B, Zhang, L (2009) Entity discovery and assignment for opinion mining applications. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 1125-1134). ACM

Du J, Xu H, Huang X (2014) Box office prediction based on micro-blog. Expert Syst Appl 41(4):1680–1689

Duric A, Song F (2012) Feature selection for sentiment analysis based on content and syntax models. Decis Support Syst 53(4):704–711

Eirinaki M, Pisal S, Singh J (2012) Feature-based opinion mining and ranking. J Comput Syst Sci 78(4):1175–1184

Article   MathSciNet   Google Scholar  

Fan TK, Chang CH (2011) Blogger-centric contextual advertising. Expert Syst Appl 38(3):1777–1788

Fang Q, Xu C, Sang J, Hossain MS, Muhammad G (2015) Word-of-mouth understanding: entity-centric multimodal aspect-opinion mining in social media. IEEE Trans Multimed 17(12):2281–2296

Fauzi MA, Firmansyah N, Afirianto T (2018) Improving sentiment analysis of short informal Indonesian product reviews using synonym based feature expansion

Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89

Gao D, Wei F, Li W, Liu X, Zhou M (2015) Cross-lingual sentiment lexicon learning with bilingual word graph label propagation. Comput Linguist 41(1):21–40

Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40(16):6266–6282

Gimpel, K., Schneider, N., O'Connor, B., Das, D., Mills, D., Eisenstein, J., ..., Smith, N. A. (2011) Part-of-speech tagging for twitter: Annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2 (pp. 42-47). Assoc Comput Linguist

Gindl S, Weichselbraun A, Scharl A (2013) Rule-based opinion target and aspect extraction to acquire affective knowledge. In proceedings of the 22nd international conference on world wide web (pp. 557-564). ACM

Go, A, Bhayani, R, Huang, L (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(12)

Gupta SK, Phung D, Adams B, Venkatesh S (2013) Regularized nonnegative shared subspace learning. Data Min Knowl Disc 26(1):57–97

Article   MathSciNet   MATH   Google Scholar  

Hagenau M, Liebmann M, Neumann D (2013) Automated news reading: stock price prediction based on financial news using context-capturing features. Decis Support Syst 55(3):685–697

He Y, Zhou D (2011) Self-training from labeled features for sentiment analysis. Inf Process Manag 47(4):606–616

He Y, Lin C, Alani H (2011) Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 123-131). Association for Computational Linguistics

Heerschop, B, Goossen, F, Hogenboom, A, Frasincar, F, Kaymak, U, de Jong, F (2011) Polarity analysis of texts using discourse structure. In Proceedings of the 20th ACM international conference on Information and knowledge management (pp. 1061-1070). ACM

Hiroshi, K, Tetsuya, N, Hideo, W (2004) Deeper sentiment analysis using machine translation technology. In Proceedings of the 20th international conference on Computational Linguistics (p. 494). Assoc Comput Linguist

Howard J, Ruder S (2018). Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146

Hu Y, Li W (2011) Document sentiment classification by exploring description model of topical terms. Comput Speech Lang 25(2):386–403

Hu, M, Liu, B (2004) Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 168-177). ACM

Hu N, Bose I, Koh NS, Liu L (2012) Manipulation of online reviews: An analysis of ratings, readability, and sentiments. Decis Support Syst 52(3):674–684

Hu YH, Chen YL, Chou HL (2017) Opinion mining from online hotel reviews–A text summarization approach. Inf Process Manag 53(2):436–449

Hung C, Lin HK (2013) Using objective words in SentiWordNet to improve sentiment classification for word of mouth. IEEE Intell Syst 1

Ismail, S, Alsammak, A, Elshishtawy, T (2016) A Generic Approach for Extracting Aspects and Opinions of Arabic Reviews. In Proceedings of the 10th International Conference on Informatics and Systems (pp. 173-179). ACM

Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2016) Reading text in the wild with convolutional neural networks. Int J Comput Vis 116(1):1–20

Jiang L, Yu M, Zhou M, Liu X, Zhao T (2011) Target-dependent twitter sentiment classification. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies-volume 1 (pp. 151-160). Association for Computational Linguistics.

Jiang D, Luo X, Xuan J, Xu Z (2017) Sentiment computing for the news event based on the social media big data. IEEE Access 5:2373–2382

Jiao J, Zhou Y (2011) Sentiment polarity analysis based multi-dictionary. Phys Procedia 22:590–596

Jindal, N, Liu, B (2008) Opinion spam and analysis. In Proceedings of the 2008 International Conference on Web Search and Data Mining (pp. 219-230). ACM

Jo, Y, Oh, AH (2011) Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining (pp. 815-824). ACM

Kamps J, Marx M, Mokken RJ, de Rijke M (2001) Words with attitude (pp, 332-341). Language and Computation (ILLC), University of Amsterdam, Institute for Logic

Kanayama, H, Nasukawa, T (2006) Fully automatic lexicon expansion for domain-oriented sentiment analysis. In Proceedings of the 2006 conference on empirical methods in natural language processing (pp. 355-363). Assoc Comput Linguist

Kang H, Yoo SJ, Han D (2012) Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews. Expert Syst Appl 39(5):6000–6010

Kang M, Ahn J, Lee K (2018) Opinion mining using ensemble text hidden Markov models for text classification. Expert Syst Appl 94:218–227

Kaufmann, M (2012) JMaxAlign: A Maximum Entropy Parallel Sentence Alignment Tool. In COLING (Demos) (pp. 277-288)

Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1–36

Kennedy A, Inkpen D (2006) Sentiment classification of movie reviews using contextual valence shifters. Comput Intell 22(2):110–125

Keshtkar F, Inkpen D (2013) A bootstrapping method for extracting paraphrases of emotion expressions from texts. Comput Intell 29(3):417–435

Khamparia A, Pandey B (2020) Association of learning styles with different e-learning problems: a systematic review and classification. Educ Inf Technol 25(2):1303–1331

Khan FH, Bashir S, Qamar U (2014) TOM: twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 57:245–257

Khan SN, Nawi NM, Imrona M, Shahzad A, Ullah A, Rahman A (2018) Opinion mining summarization and automation process: a survey. Int J Adv Sci Eng Inf Technol 8(5):1836–1844

Kim, SM, Hovy, E (2004) Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics (p. 1367). Assoc Computat Linguist

Kontopoulos E, Berberidis C, Dergiades T, Bassiliades N (2013) Ontology-based sentiment analysis of twitter posts. Expert Syst Appl 40(10):4065–4074

Kouloumpis E, Wilson T, Moore JD (2011) Twitter sentiment analysis: the good the bad and the omg! Icwsm 11(538-541):164

Ku LW, Chen HH (2007) Mining opinions from the web: beyond relevance retrieval. J Assoc Inf Sci Technol 58(12):1838–1850

Kumar A, Dabas V, Hooda P (2018) Text classification algorithms for mining unstructured data: a SWOT analysis. Int J Info Technol.:1–11

Lafferty, J, McCallum, A, Pereira, FC (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data

Lambert P (2015) Aspect-level cross-lingual sentiment classification with constrained SMT. In: Proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 2: short papers) (Vol. 2, pp. 781-787)

Lambov D, Pais S, Dias G (2011) Merged agreement algorithms for domain independent sentiment analysis. Procedia Soc Behav Sci 27:248–257

Lane PC, Clarke D, Hender P (2012) On developing robust models for favourability analysis: model choice, feature sets and imbalanced data. Decis Support Syst 53(4):712–718

Lazaridou A, Titov I, Sporleder C (2013) A Bayesian model for joint unsupervised induction of sentiment, aspect and discourse representations. In proceedings of the 51st annual meeting of the Association for Computational Linguistics (volume 1: long papers) (Vol. 1, pp. 1630-1639).

Li YM, Li TY (2013) Deriving market intelligence from micro-blogs. Decis Support Syst 55(1):206–217

Li YM, Shiu YL (2012) A diffusion mechanism for social advertising over micro-blogs. Decis Support Syst 54(1):9–22

Li ST, Tsai FC (2013) A fuzzy conceptualization model for text mining with application in opinion polarity classification. Knowl-Based Syst 39:23–33

Li, F, Huang, M, Zhu, X (2010) Sentiment Analysis with Global Topics and Local Dependency. In AAAI (Vol. 10, pp. 1371-1376)

Li, S, Wang, Z, Zhou, G, Lee, SYM (2011) Semi-supervised learning for imbalanced sentiment classification. In IJCAI proceedings-international joint conference on artificial intelligence (Vol. 22, No. 3, p. 1826)

Li S, Xue Y, Wang Z, Zhou G (2013) Active learning for cross-domain sentiment classification. In IJCAI (pp. 2127-2133)

Li, S, Huang, L, Wang, J, Zhou, G (2015) Semi-stacking for semi-supervised sentiment classification. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers) (Vol. 2, pp. 27-31)

Lin, C, He, Y (2009) Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on Information and knowledge management (pp. 375-384). ACM

Lin C, He Y, Everson R, Ruger S (2012) Weakly supervised joint sentiment-topic detection from text. IEEE Trans Knowl Data Eng 24(6):1134–1145

Liu B (2012) Sentiment analysis and opinion mining. Synth Lect Human Lang Technol 5(1):1–167

Liu K, Zhao J (2009) Cross-domain sentiment classification using a two-stage method. In proceedings of the 18th ACM conference on information and knowledge management (pp. 1717-1720). ACM

Liu, B, Lee, WS, Yu, PS, Li, X (2002) Partially supervised classification of text documents. In ICML (Vol. 2, pp. 387-394)

Liu, B, Hu, M, Cheng, J (2005) Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web (pp. 342-351). ACM

Liu, Y, Huang, X, An, A, Yu, X (2007) ARSA: a sentiment-aware model for predicting sales performance using blogs. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 607-614). ACM

Liu K, Xu L, Zhao J (2015) Co-extracting opinion targets and opinion words from online reviews based on the word alignment model. IEEE Trans Knowl Data Eng 27(3):636–650

Lu CY, Lin SH, Liu JC, Cruz-Lara S, Hong JS (2010) Automatic event-level textual emotion sensing using mutual action histogram between entities. Expert Syst Appl 37(2):1643–1653

Lu Y, Kong X, Quan X, Liu W, Xu Y (2010) Exploring the sentiment strength of user reviews. In international conference on web-age information management (pp. 471-482). Springer, Berlin, Heidelberg.

Lu, Y, Castellanos, M, Dayal, U, Zhai, C (2011) Automatic construction of a context-aware sentiment lexicon: an optimization approach. In Proceedings of the 20th international conference on World Wide Web (pp. 347-356). ACM

Ma, J, Hinrichs, EW (2015) Accurate Linear-Time Chinese Word Segmentation via Embedding Matching. In ACL (1) (pp. 1733-1743)

Maas, AL, Daly, RE, Pham, PT, Huang, D, Ng, AY, Potts, C (2011) Learning word vectors for sentiment analysis. In Proceedings of the 49th annual meeting of the association for computational linguistics: Human language technologies-volume 1 (pp. 142-150). Assoc Computat Linguist

Maks I, Vossen P (2012) A lexicon model for deep sentiment analysis and opinion mining applications. Decis Support Syst 53(4):680–688

Manning, C, Surdeanu, M, Bauer, J, Finkel, J, Bethard, S, McClosky, D (2014) The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55-60)

Marrese-Taylor E, Velásquez JD, Bravo-Marquez F (2014) A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst Appl 41(17):7764–7775

Marstawi, A, Sharef, NM, Aris, TNM, Mustapha, A (2017) Ontology-based Aspect Extraction for an Improved Sentiment Analysis in Summarization of Product Reviews. In Proceedings of the 8th International Conference on Computer Modeling and Simulation (pp. 100-104). ACM

MartíN-Valdivia MT, MartíNez-CáMara E, Perea-Ortega JM, UreñA-LóPez LA (2013) Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches. Expert Syst Appl 40(10):3934–3942

McDonald, R, Crammer, K, Pereira, F (2005) Online large-margin training of dependency parsers. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 91-98). Assoc Comput Linguist

McDonald, R, Hannan, K, Neylon, T, Wells, M, Reynar, J (2007) Structured models for fine-to-coarse sentiment analysis. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 432-439)

Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: A survey. Ain Shams Eng J 5(4):1093–1113

Meng X, Wei F, Liu X, Zhou M, Xu G, Wang H (2012) Cross-lingual mixture model for sentiment classification. In: Proceedings of the 50th annual meeting of the Association for Computational Linguistics: long papers-volume 1 (pp. 572-581). Association for Computational Linguistics.

Miao Q, Li Q, Dai R (2009) AMAZING: A sentiment mining and retrieval system. Expert Syst Appl 36(3):7192–7198

Mihalcea, R, Banea, C, Wiebe, J (2007) Learning multilingual subjective language via cross-lingual projections. In: Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 976-983)

Min HJ, Park JC (2012) Identifying helpful reviews based on customer’s mentions about experiences. Expert Syst Appl 39(15):11830–11838

Mohammad SM (2012) From once upon a time to happily ever after: tracking emotions in mail and books. Decis Support Syst 53(4):730–741

Montoyo, A, MartíNez-Barco, P, Balahur, A (2012) Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments

Moraes R, Valiati JF, Neto WPG (2013) Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Syst Appl 40(2):621–633

Moreo A, Romero M, Castro JL, Zurita JM (2012) Lexicon-based comments-oriented news sentiment analyzer system. Expert Syst Appl 39(10):9166–9180

Mostafa MM (2013) More than words: social networks’ text mining for consumer brand sentiments. Expert Syst Appl 40(10):4241–4251

Mukherjee S, Joshi S (2014) Author-specific sentiment aggregation for polarity prediction of reviews. In LREC (pp. 3092-3099)

Mullen, T, Collier, N (2004) Sentiment analysis using support vector machines with diverse information sources. In Proceedings of the 2004 conference on empirical methods in natural language processing

Narayanan, R, Liu, B, Choudhary, A (2009) Sentiment analysis of conditional sentences. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 (pp. 180-189). Assoc Comput Linguist

Nasukawa, T, Yi, J (2003) Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the 2nd international conference on Knowledge capture (pp. 70-77). ACM

Neviarouskaya, A, Prendinger, H, Ishizuka, M (2010) Recognition of affect, judgment, and appreciation in text. In Proceedings of the 23rd international conference on computational linguistics (pp. 806-814). Assoc Comput Linguist

Nopp, C, Hanbury, A (2015) Detecting Risks in the Banking System by Sentiment Analysis. In EMNLP (pp. 591-600)

O'Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: linking text sentiment to public opinion time series. ICWSM 11(122-129):1–2

Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in Facebook and its application to e-learning. Comput Hum Behav 31:527–541

Ortigosa-Hernández J, Rodríguez JD, Alzate L, Lucania M, Inza I, Lozano JA (2012) Approaching sentiment analysis by using semi-supervised learning of multi-dimensional classifiers. Neurocomputing 92:98–115

Ouhame S, Hadi Y, Ullah A (2021) An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model. Neural Comput Applic:1–13

Owoputi, O O'Connor, B, Dyer, C, Gimpel, K, Schneider, N, Smith, NA (2013) Improved part-of-speech tagging for online conversational text with word clusters. Assoc Comput Linguist

Pai MY, Chu HC, Wang SC, Chen YM (2013) Electronic word of mouth analysis for service experience. Expert Syst Appl 40(6):1993–2006

Pak, A, Paroubek, P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In LREc (Vol. 10, No. 2010)

Pan SJ, Ni X, Sun JT, Yang Q, Chen Z (2010) Cross-domain sentiment classification via spectral feature alignment. In proceedings of the 19th international conference on world wide web (pp. 751-760). ACM

Pang, B, Lee, L (2004) A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics (p. 271). Assoc Comput Linguist

Pang, B, Lee, L (2005) Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 115-124). Assoc Comput Linguist

Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retriev 2(1–2):1–135

Penalver-Martinez I, Garcia-Sanchez F, Valencia-Garcia R, Rodriguez-Garcia MA, Moreno V, Fraga A, Sanchez-Cervantes JL (2014) Feature-based opinion mining through ontologies. Expert Syst Appl 41(13):5995–6008

Peng, F, Feng, F, McCallum, A (2004) Chinese segmentation and new word detection using conditional random fields. In Proceedings of the 20th international conference on Computational Linguistics (p. 562). Assoc Computat Linguist

Pennebaker, JW, Boyd, RL, Jordan, K, Blackburn, K (2015) The development and psychometric properties of LIWC2015

Pisal S, Singh J, Eirinaki M (2011) AskUs: An opinion search engine. In data mining workshops (ICDMW), 2011 IEEE 11th international conference on (pp. 1243-1246). IEEE.

Popescu O, Strapparava C (2014) Time corpora: epochs, opinions and changes. Knowl-Based Syst 69:3–13

Popescu AM, Nguyen B, Etzioni O (2005) OPINE: extracting product features and opinions from reviews. In: Proceedings of HLT/EMNLP on interactive demonstrations (pp. 32-33). Association for Computational Linguistics

Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 108:42–49

Prabowo R, Thelwall M (2009) Sentiment analysis: A combined approach. J Inf 3(2):143–157

Przepiórkowski A (2009) XML text interchange format in the National Corpus of polish. In the proceedings of practical applications in language and computers PALC 2009. Peter Lang, Frankfurt am Main

Ptaszynski M, Dokoshi H, Oyama S, Rzepka R, Kurihara M, Araki K, Momouchi Y (2013) Affect analysis in context of characters in narratives. Expert Syst Appl 40(1):168–176

Qazi A, Raj RG, Hardaker G, Standing C (2017) A systematic literature review on opinion types and sentiment analysis techniques: tasks and challenges. Int Res ( Elsiver)

Qiao, F, Wu, J, Li, J, Bashir, AK, Mumtaz, S, Tariq, U (2020) Trustworthy edge storage orchestration in intelligent transportation systems using reinforcement learning. IEEE Trans Intell Transp Syst

Qiu G, Liu B, Bu J, Chen C (2009) Expanding domain sentiment lexicon through double propagation. In IJCAI (Vol. 9, pp. 1199-1204)

Qiu G, He X, Zhang F, Shi Y, Bu J, Chen C (2010) DASA: dissatisfaction-oriented advertising based on sentiment analysis. Expert Syst Appl 37(9):6182–6191

Qiu, X, Zhang, Q, Huang, X (2013) Fudannlp: A toolkit for Chinese natural language processing. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp. 49-54)

Qiu L, Rui H, Whinston A (2013) Social network-embedded prediction markets: the effects of information acquisition and communication on predictions. Decis Support Syst 55(4):978–987

Quan C, Ren F (2014) Unsupervised product feature extraction for feature-oriented opinion determination. Inf Sci 272:16–28

Rabelo JC, Prudêncio RB, Barros FA (2012) Using link structure to infer opinions in social networks. In systems, man, and cybernetics (SMC), 2012 IEEE international conference on (pp. 681-685). IEEE.

Racherla P, Friske W (2012) Perceived ‘usefulness’ of online consumer reviews: An exploratory investigation across three services categories. Electron Commer Res Appl 11(6):548–559

Rahman, MM, Wang, H (2016) Hidden topic sentiment model. In Proceedings of the 25th International Conference on World Wide Web (pp. 155-165). Int World Wide Web Conf Steering Committee

Raut, MY, Kulkarni, MA (2016) Polarity shift in opinion mining. In Advances in Electronics, Communication and Computer Technology (ICAECCT), 2016 IEEE International Conference on (pp. 333-337). IEEE

Rehurek, R, Sojka, P (2010) Software framework for topic modeling with large corpora. In In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks

Reyes A, Rosso P (2012) Making objective decisions from subjective data: detecting irony in customer reviews. Decis Support Syst 53(4):754–760

Rezaeinia SM, Rahmani R, Ghodsi A, Veisi H (2019) Sentiment analysis based on improved pre-trained word embeddings. Expert Syst Appl 117:139–147

Rill S, Reinel D, Scheidt J, Zicari RV (2014) Politwi: early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowl-Based Syst 69:24–33

Riloff, E, Wiebe, J (2003) Learning extraction patterns for subjective expressions. In Proceedings of the 2003 conference on Empirical methods in natural language processing (pp. 105-112). Assoc Comput Linguist

Riloff, E, Wiebe, J, Wilson, T (2003) Learning subjective nouns using extraction pattern bootstrapping. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4 (pp. 25-32). Assoc Comput Linguist

Roiger, RJ (2017) Data mining: a tutorial-based primer. CRC Press

Roussev V, Quates C (2013) File fragment encoding classification—An empirical approach. Digit Investig 10:S69–S77

Rui H, Liu Y, Whinston A (2013) Whose and what chatter matters? The effect of tweets on movie sales. Decis Support Syst 55(4):863–870

Saleh MR, Martín-Valdivia MT, Montejo-Ráez A, Ureña-López LA (2011) Experiments with SVM to classify opinions in different domains. Expert Syst Appl 38(12):14799–14804

Santosh DT, Babu KS, Prasad SD, Vivekananda A (2016) Opinion mining of online product reviews from traditional LDA topic clusters using feature ontology Tree and Sentiwordnet. IJEME 6:1–11

Seki Y, Kando N, Aono M (2009) Multilingual opinion holder identification using author and authority viewpoints. Inf Process Manag 45(2):189–199

Severyn A, Moschitti A, Uryupina O, Plank B, Filippova K (2016) Multi-lingual opinion mining on youtube. Inf Process Manag 52(1):46–60

Shah K, Patel H, Sanghvi D, Shah M (2020) A comparative analysis of logistic regression, random forest and KNN models for the text classification. Augmented Human Res 5(1):1–16

Sharma, R, Nigam, S, Jain, R (2014) Mining of product reviews at aspect level. arXiv preprint arXiv:1406.3714

Shawe-Taylor J, Sun S (2011) A review of optimization methodologies in support vector machines. Neurocomputing 74(17):3609–3618

Silva NFFD, Coletta LF, Hruschka ER (2016) A survey and comparative study of tweet sentiment analysis via semi-supervised learning. ACM Comput Surv (CSUR) 49(1):15

Sindhwani, V, Melville, P (2008) Document-word co-regularization for semi-supervised sentiment analysis. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on (pp. 1025-1030). IEEE

Socher, R, Perelygin, A, Wu, J, Chuang, J, Manning, CD, Ng, A, Potts, C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing (pp. 1631-1642)

Somasundaran, S, Namata, G, Wiebe, J, Getoor, L (2009) Supervised and unsupervised methods in employing discourse relations for improving opinion polarity classification. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 (pp. 170-179). Assoc Comput Linguist

Sperberg-McQueen CM (1991) Text in the electronic age: Texual study and textual study and text encoding, with examples from medieval texts. Lit Linguist Comput 6(1):34–46

Spina D, Gonzalo J, Amigó E (2013) Discovering filter keywords for company name disambiguation in twitter. Expert Syst Appl 40(12):4986–5003

Steinberger J, Ebrahim M, Ehrmann M, Hurriyetoglu A, Kabadjov M, Lenkova P, … Zavarella V (2012) Creating sentiment dictionaries via triangulation. Decis Support Syst 53(4):689–694

Stone PJ, Hunt EB (1963) A computer approach to content analysis: studies using the general inquirer system. In: Proceedings of the May 21-23, 1963, spring joint computer conference (pp. 241-256). ACM.

Stone, PJ, Dunphy, DC, Smith, MS (1966) The general inquirer: A computer approach to content analysis

Sun, Y, Xu, J, Wu, H, Lin, G, Mumtaz, S (2021) Deep learning based semi-supervised control for vertical security of maglev vehicle with guaranteed bounded airgap. IEEE Trans Intell Transp Syst

Taboada M, Grieve J (2004, March) Analyzing appraisal automatically. In: Proceedings of AAAI spring symposium on exploring attitude and affect in text (AAAI technical re# port SS# 04# 07), Stanford University, CA, pp. 158q161. AAAI Press

Taboada, M, Anthony, C, Voll, K (2006) Methods for creating semantic orientation dictionaries. In Conference on Language Resources and Evaluation (LREC) (pp. 427-432)

Taboada M, Brooke J, Tofiloski M, Voll K, Stede M (2011) Lexicon-based methods for sentiment analysis. Computat Linguist 37(2):267–307

Täckström O, McDonald R (2011) Semi-supervised latent variable models for sentence-level sentiment analysis. In proceedings of the 49th annual meeting of the Association for Computational Linguistics: human language technologies: short papers-volume 2 (pp. 569-574). Association for Computational Linguistics.

Taddy M (2013) Measuring political sentiment on twitter: factor optimal design for multinomial inverse regression. Techno-metrics 55(4):415–425

Tan S, Wang Y (2011) Weighted SCL model for adaptation of sentiment classification. Expert Syst Appl 38(8):10524–10531

Tan S, Wu Q (2011) A random walk algorithm for automatic construction of domain-oriented sentiment lexicon. Expert Syst Appl 38(10):12094–12100

Tan S, Cheng X, Wang Y, Xu H (2009) Adapting naive bayes to domain adaptation for sentiment analysis. In European conference on information retrieval (pp. 337-349). Springer, Berlin, Heidelberg

Tan LKW, Na JC, Theng YL, Chang K (2012) Phrase-level sentiment polarity classification using rule-based typed dependencies and additional complex phrases consideration. J Comput Sci Technol 27(3):650–666

Tan S, Li Y, Sun H, Guan Z, Yan X, Bu J, … He X (2014) Interpreting the public sentiment variations on twitter. IEEE Trans Knowl Data Eng 26(5):1158–1170

Tang D, Qin B, Wei F, Dong L, Liu T, Zhou M (2015) A joint segmentation and classification framework for sentence level sentiment classification. IEEE/ACM Trans Audio, Speech, Lang Process 23(11):1750–1761

Thelen, M, Riloff, E (2002) A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10(pp. 214-221). Assoc Comput Linguist

Thelwall M, Buckley K (2013) Topic-based sentiment analysis for the social web: the role of mood and issue-related words. J Assoc Inf Sci Technol 64(8):1608–1617

Thelwall M, Buckley K, Paltoglou G, Cai D, Kappas A (2010) Sentiment strength detection in short informal text. J Assoc Inf Sci Technol 61(12):2544–2558

Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Assoc Inf Sci Technol 62(2):406–418

Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Assoc Inf Sci Technol 63(1):163–173

Thet TT, Na JC, Khoo CS (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci 36(6):823–848

Tong, RM (2001) An operational system for detecting and tracking opinions in on-line discussion. In Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification (Vol. 1, No. 6)

Trivedi, R, Eisenstein, J (2013) Discourse connectors for latent subjectivity in sentiment analysis. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 808-813)

Tsai ACR, Wu CE, Tsai RTH, Hsu JYJ (2013) Building a concept-level sentiment dictionary based on commonsense knowledge. IEEE Intell Syst 28(2):22–30

Tseng, H, Chang, P, Andrew, G, Jurafsky, D, Manning, C (2005) A conditional random field word segmenter for sighan bakeoff 2005. In Proceedings of the fourth SIGHAN workshop on Chinese language Processing (Vol. 171)

Tsytsarau M, Palpanas T (2012) Survey on mining subjective data on the web. Data Min Knowl Disc 24(3):478–514

Article   MATH   Google Scholar  

Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting elections with twitter: what 140 characters reveal about political sentiment. Icwsm 10(1):178–185

Turney, PD (2002) Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 417-424). Assoc Comput Linguist

Ullah A, Nawi NM (2020) Enhancing the dynamic load balancing technique for cloud computing using HBATAABC algorithm. Int J Model, Simul, Sci Comp 11(05):2050041

Ullah A, Nawi NM, Khan MH (2020) BAT algorithm used for load balancing purpose in cloud computing: an overview. Int J High Perform Comput Network 16(1):43–54

Ullah A, Şahin CB, Dinler OB, Khan MH, Aznaoui H (2021) Heart disease prediction using various machines learning approach. J Cardiovasc. Dis. Res. 12(3):379–391. https://doi.org/10.31838/jcdr.2021.12.03.58

Usai A, Pironti M, Mital M, Mejri CA (2018) Knowledge discovery out of text data: a systematic review via text mining J Knowledge Manag

Van de Camp M, Van den Bosch A (2012) The socialist network. Decis Support Syst 53(4):761–769

Vinodhini G, Chandrasekaran RM (2014) Opinion mining using principal component analysis based ensemble model for e-commerce application. CSI Transact ICT 2(3):169–179

Walker MA, Anand P, Abbott R, Tree JEF, Martell C, King J (2012) That is your evidence?: classifying stance in online political debate. Decis Support Syst 53(4):719–729

Wan X (2008) Using bilingual knowledge and ensemble techniques for unsupervised Chinese sentiment analysis. In: Proceedings of the conference on empirical methods in natural language processing (pp. 553-561). Association for Computational Linguistics

Wan X (2009) Co-training for cross-lingual sentiment classification. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP: volume 1-volume 1 (pp. 235-243). Association for Computational Linguistics

Wang JH, Lee CC (2011) Unsupervised opinion phrase extraction and rating in Chinese blog posts. In: Privacy, security, risk and trust (PASSAT) and 2011 IEEE third international conference on social computing (SocialCom), 2011 IEEE third international conference on(pp. 820-823). IEEE

Wang, S, Manning, CD (2012) Baselines and bigrams: Simple, good sentiment and topic classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2 (pp. 90-94). Assoc Comput Linguist

Wang, H, Lu, Y, Zhai, C (2010) Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 783-792). ACM

Wang S, Li D, Song X, Wei Y, Li H (2011) A feature selection method based on improved fisher’s discriminant ratio for text sentiment classification. Expert Syst Appl 38(7):8696–8702

Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93

Wang T, Cai Y, Leung HF, Lau RY, Li Q, Min H (2014) Product aspect extraction supervised with online domain knowledge. Knowl-Based Syst 71:86–100

Wang, R, Huang, W, Chen, W, Wang, T, Lei, K (2015) ASEM: mining aspects and sentiment of events from microblog. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (pp. 1923-1926). ACM

Wang JZ, Yan Z, Yang LT, Huang BX (2015) An approach to rank reviews by fusing and mining opinions based on review pertinence. Inf Fus 23:3–15

Wang L, Liu K, Cao Z, Zhao J, de Melo G (2015) Sentiment-aspect extraction based on restricted boltzmann machines. In proceedings of the 53rd annual meeting of the Association for Computational Linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (Vol. 1, pp. 616-625).

Watanabe H (1992) A similarity-driven transfer system. In proceedings of the 14th conference on computational linguistics-volume 2 (pp. 770-776). Association for computational linguistics.

Wei, B, Pal, C (2010) Cross lingual adaptation: an experiment on sentiment classifications. In Proceedings of the ACL 2010 conference short papers (pp. 258-262). Assoc Comput Linguist

Weichselbraun A, Gindl S, Scharl A (2013) Extracting and grounding contextualized sentiment lexicons. IEEE Intell Syst 28(2):39–46

Weichselbraun A, Gindl S, Scharl A (2014) Enriching semantic knowledge bases for opinion mining in big data applications. Knowl-Based Syst 69:78–85

Whitelaw, C, Garg, N, Argamon, S (2005) Using appraisal groups for sentiment analysis. In Proceedings of the 14th ACM international conference on Information and knowledge management (pp. 625-631). ACM

Wiebe J, Wilson T, Cardie C (2005) Annotating expressions of opinions and emotions in language. Lang Resour Eval 39(2-3):165–210

Wilson, T, Wiebe, J, Hoffmann, P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347-354). Assoc Comput Linguist

Wilson, T, Hoffmann, P, Somasundaran, S, Kessler, J, Wiebe, J, Choi, Y ..., Patwardhan, S (2005) OpinionFinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations (pp. 34-35). Assoc Comput Linguist

Wu Q, Tan S (2011) A two-stage framework for cross-domain sentiment classification. Expert Syst Appl 38(11):14269–14275

Wu CE, Tsai RTH (2014) Using relation selection to improve value propagation in a conceptnet-based sentiment dictionary. Knowl-Based Syst 69:100–107

Xia R, Zong C, Li S (2011) Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci 181(6):1138–1152

Xianghua F, Guo L, Yanyan G, Zhiqiang W (2013) Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon. Knowl-Based Syst 37:186–195

Xu K, Liao SS, Li J, Song Y (2011) Mining comparative opinions from customer reviews for competitive intelligence. Decis Support Syst 50(4):743–754

Xu T, Peng Q, Cheng Y (2012) Identifying the semantic orientation of terms using S-HAL for sentiment analysis. Knowl-Based Syst 35:279–289

Xu, YC, Zhang, C, Xue, L (2013) WITHDRAWN: Measuring product susceptibility in online product review social network

Xu X, Cheng X, Tan S, Liu Y, Shen H (2013) Aspect-level opinion mining of online customer reviews. China Commun 10(3):25–41

Xu H, Zhang F, Wang W (2015) Implicit feature identification in Chinese reviews using explicit topic mining model. Knowl-Based Syst 76:166–175

Xuan HNT, Le AC, Nguyen LM (2012) Linguistic features for subjectivity classification. In Asian language processing (IALP), 2012 international conference on (pp. 17-20). IEEE

Yadav, SK, Pal, S (2012) Data mining: A prediction for performance improvement of engineering students using classification. arXiv preprint arXiv:1203.3832

Yan Z, Xing M, Zhang D, Ma B (2015) EXPRS: An extended pagerank method for product feature extraction from online consumer reviews. Inf Manag 52(7):850–858

Yang B, Cardie C (2014) Context-aware learning for sentence-level sentiment analysis with posterior regularization. In proceedings of the 52nd annual meeting of the Association for Computational Linguistics (volume 1: long papers) (Vol. 1, pp. 325-335)

Yang H, Wen J, Wu X, He L, Mumtaz S (2019) An efficient edge artificial intelligence multipedestrian tracking method with rank constraint. IEEE Trans Indust Inf 15(7):4178–4188

Yan-Yan Z, Bing Q, Ting L (2010) Integrating intra-and inter-document evidences for improving sentence sentiment classification. Acta Automat Sin 36(10):1417–1425

Ye, Q, Shi, W, Li, Y (2006) Sentiment classification for movie reviews in Chinese by improved semantic oriented approach. In System Sciences, 2006. HICSS'06. Proceedings of the 39th Annual Hawaii International Conference on (Vol. 3, pp. 53b-53b). IEEE

Yi, J, Nasukawa, T, Bunescu, R, Niblack, W (2003) Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Data Mining, 2003. ICDM 2003. Third IEEE International Conference on (pp. 427-434). IEEE

Yu H, Hatzivassiloglou V (2003) Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In: Proceedings of the 2003 conference on empirical methods in natural language processing (pp. 129-136). Association for Computational Linguistics

Yu LC, Wu JL, Chang PC, Chu HS (2013) Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news. Knowl-Based Syst 41:89–97

Yu Y, Duan W, Cao Q (2013) The impact of social and conventional media on firm equity value: A sentiment analysis approach. Decis Support Syst 55(4):919–926

Zhai Z, Liu B, Xu H, Jia P (2011) Clustering product features for opinion mining. In proceedings of the fourth ACM international conference on web search and data mining (pp. 347-354). ACM

Zhan J, Loh HT, Liu Y (2009) Gather customer concerns from online product reviews–A text summarization approach. Expert Syst Appl 36(2):2107–2115

Zhang, Z (2008) Weighing stars: Aggregating online product reviews for intelligent e-commerce applications. IEEE Intell Syst. 23(5)

Zhang, Y, Clark, S (2008) Joint Word Segmentation and POS Tagging Using a Single Perceptron. In ACL (pp. 888-896)

Zhang, W, Skiena, S (2010) Trading Strategies to Exploit Blog and News Sentiment. In Icwsm

Zhang C, Zeng D, Li J, Wang FY, Zuo W (2009) Sentiment analysis of Chinese documents: from sentence to document level. J Assoc Inf Sci Technol 60(12):2474–2487

Zhang L, Liu B, Lim SH, O'Brien-Strain E (2010) Extracting and ranking product features in opinion documents. In proceedings of the 23rd international conference on computational linguistics: posters (pp. 1462-1470). Association for Computational Linguistics

Zhang Z, Ye Q, Zhang Z, Li Y (2011) Sentiment classification of internet restaurant reviews written in Cantonese. Expert Syst Appl 38(6):7674–7682

Zhang W, Xu H, Wan W (2012) Weakness finder: find product weakness from Chinese reviews by using aspects based sentiment analysis. Expert Syst Appl 39(11):10283–10291

Zhao, L, Huang, M, Sun, J, Luo, H, Yang, X, Zhu, X (2015) Sentiment extraction by leveraging aspect-opinion association structure. In Proceedings of the 24th ACM international on conference on information and knowledge management (pp. 343-352). ACM

Zheng, X, Chen, H, Xu, T (2013) Deep Learning for Chinese Word Segmentation and POS Tagging. In EMNLP(pp. 647-657)

Zhou L, Chaovalit P (2008) Ontology-supported polarity mining. J Assoc Inf Sci Technol 59(1):98–110

Zhou, L, Li, B, Gao, W, Wei, Z, Wong, KF (2011) Unsupervised discovery of discourse relations for eliminating intra-sentence polarity ambiguities. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 162-171). Assoc Comput Linguist

Zhou X, Wan X, Xiao J (2016) CMiner: opinion extraction and summarization for Chinese microblogs. IEEE Trans Knowl Data Eng 28(7):1650–1663

Zhu, J, Zhu, M, Wang, Q, Xiao, T (2015) Niuparser: A Chinese syntactic and semantic parsing toolkit. Proceedings of ACL-IJCNLP 2015 System Demonstrations, 145-150

Zirn, C, Niepert, M, Stuckenschmidt, H, Strube, M (2011) Fine-Grained Sentiment Analysis with Structural Features. In IJCNLP (pp. 336-344)

Download references

Author information

Authors and affiliations.

Department of Computing, Riphah International University, Faisalabad, Punjab, 44000, Faisalabad, Pakistan

Soft Computing and Data Mining Centre (SMC), Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Parit Raja, Malaysia

Sundas Naqeeb Khan & Nazri Mohd Nawi

You can also search for this author in PubMed   Google Scholar

Contributions

All authors are equality contribution for making paper.

Corresponding author

Correspondence to Arif Ullah .

Ethics declarations

Competing interest.

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Ullah, A., Khan, S.N. & Nawi, N.M. Review on sentiment analysis for text classification techniques from 2010 to 2021. Multimed Tools Appl 82 , 8137–8193 (2023). https://doi.org/10.1007/s11042-022-14112-3

Download citation

Received : 01 October 2020

Revised : 20 August 2021

Accepted : 25 October 2022

Published : 01 December 2022

Issue Date : March 2023

DOI : https://doi.org/10.1007/s11042-022-14112-3

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Sentiment analysis
  • Text classification
  • Opinion mining
  • Natural language
  • Find a journal
  • Publish with us
  • Track your research

Informing Science Institute

ISI Journals

  • Informing Science: The International Journal of an Emerging Transdiscipline (InformingSciJ)
  • Journal of Information Technology Education: Research (JITE:Research)
  • Journal of Information Technology Education: Innovations in Practice (JITE:IIP)
  • Journal of Information Technology Education: Discussion Cases (JITE: DC)
  • Interdisciplinary Journal of e-Skills and Lifelong Learning (IJELL)
  • Interdisciplinary Journal of Information, Knowledge, and Management (IJIKM)
  • International Journal of Doctoral Studies (IJDS)
  • Issues in Informing Science and Information Technology (IISIT)
  • Journal for the Study of Postsecondary and Tertiary Education (JSPTE)
  • Informing Faculty (IF)

Collaborative Journals

  • Muma Case Review (MCR)
  • Muma Business Review (MBR)
  • International Journal of Community Development and Management Studies (IJCDMS)
  • InSITE2024 : Jul 24 - 25 2024,
  • All Conferences »
  • Publications
  • Journals  
  • Conferences  

Text Classification Techniques: A Literature Review

Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining.

Background Artificial Intelligence is reshaping text classification techniques to better acquire knowledge. However, in spite of the growth and spread of AI in all fields of research, its role with respect to text mining is not well understood yet.

Methodology For this study, various articles written between 2010 and 2017 on “text classification techniques in AI”, selected from leading journals of computer science, were analyzed. Each article was completely read. The research problems related to text classification techniques in the field of AI were identified and techniques were grouped according to the algorithms involved. These algorithms were divided based on the learning procedure used. Finally, the findings were plotted as a tree structure for visualizing the relationship between learning procedures and algorithms.

Contribution This paper identifies the strengths, limitations, and current research trends in text classification in an advanced field like AI. This knowledge is crucial for data scientists. They could utilize the findings of this study to devise customized data models. It also helps the industry to understand the operational efficiency of text mining techniques. It further contributes to reducing the cost of the projects and supports effective decision making.

Findings It has been found more important to study and understand the nature of data before proceeding into mining. The automation of text classification process is required, with the increasing amount of data and need for accuracy. Another interesting research opportunity lies in building intricate text data models with deep learning systems. It has the ability to execute complex Natural Language Processing (NLP) tasks with semantic requirements.

Recommendations for Practitioners Frame analysis, deception detection, narrative science where data expresses a story, healthcare applications to diagnose illnesses and conversation analysis are some of the recommendations suggested for practitioners.

Recommendation for Researchers Developing simpler algorithms in terms of coding and implementation, better approaches for knowledge distillation, multilingual text refining, domain knowledge integration, subjectivity detection, and contrastive viewpoint summarization are some of the areas that could be explored by researchers.

Impact on Society Text classification forms the base of data analytics and acts as the engine behind knowledge discovery. It supports state-of-the-art decision making, for example, predicting an event before it actually occurs, classifying a transaction as ‘Fraudulent’ etc. The results of this study could be used for developing applications dedicated to assisting decision making processes. These informed decisions will help to optimize resources and maximize benefits to the mankind.

Future Research In the future, better methods for parameter optimization will be identified by selecting better parameters that reflects effective knowledge discovery. The role of streaming data processing is still rarely explored when it comes to text classification.

techniques for text classification literature review and current trends

Back to Top ↑

  • Become a Reviewer
  • Privacy Policy
  • Ethics Policy
  • Legal Disclaimer

Twitter

SEARCH PUBLICATIONS

Informing Science Institute

Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Techniques for text classification: Literature review and current trends

Profile image of abha jain

Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, popularly known as the digital/electronic information is in the form of documents, conference material, publications, journals, editorials, web pages, e-mail etc. People largely access information from these online sources rather than being limited to archaic paper sources like books, magazines, newspapers etc. But the main problem is that this enormous information lacks organization which makes it difficult to manage. Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us to have a fair evaluation of the progress made in this field till date. We have investigated the papers to the ...

Related Papers

The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have made this classification problem very difficult. We investigate four different methods for document classification: the naive Bayes classifier, the nearest neighbour classifier, decision trees and a subspace method. These were applied to seven-class Yahoo news groups (business, entertainment, health, international, politics, sports and technology) individually and in combination. We studied three classifier combination approaches: simple voting, dynamic classifier selection and adaptive classifier combination. Our experimental results indicate that the naive Bayes classifier and the subspace method outperform the other two classifiers on our data sets. Combinations of multiple classifiers did not always improve the classification accuracy compared to the best individual classifier. Among the three different combination approaches, our adaptive classifier combination method introduced here performed the best. The best classification accuracy that we are able to achieve on this seven-class problem is approximately 83%, which is comparable to the performance of other similar studies. However, the classification problem considered here is more difficult because the pattern classes used in our experiments have a large overlap of words in their corresponding documents.

techniques for text classification literature review and current trends

Khairullah Khan

IJFRCSCE Journal

–Text classification is used to classify the document of similar types. Text classification can be also performed under supervision i.e. it is an supervised leaning technique Text classification is a process in which documents are sorted spontaneously into different classes using predefined set. The main issue is that large scale of information lacks organization which makes it difficult to manage. Text classification is identified as one of the key methods used for recognizing such types of digital information. Text classification have various applications such as in information retrieval, natural language processing, automatic indexing, text filtering, image processing, etc. Text classification is also used to process the big data and it can also be used to predict the class labels for newly added data. Text classification is also being used in academic and industries to classify the unstructured data. There are various types of the text classification approaches such as decision tree, SVM, Naïve Bayes etc. In this survey paper, we have analysed the various text classification techniques such as decision tree, SVM, Naïve Bayes etc. These techniques have their individual set of advantages which make them suitable in almost all classification jobs. In this paper we have also analysed evaluation parameters such as F-measure, G-measure and accuracy used in various research works. .

IJRISE Journal

The printed transformation has seen a gigantic change in the accessibility of online data. Discovering data for pretty much any need has never been more programmed. Content arrangement (otherwise called content classification or point spotting) is the errand of naturally sorting an arrangement of archives into classifications from a predefined set. This assignment has a few applications, including computerized ordering of logical articles, recording licenses into patent indexes, particular spread of data to data purchasers, robotized populace of various leveled inventories of Web assets, spam separating, and recognizable proof of report class. Computerized content characterization is appealing in light of the fact that it liberates associations from the need of physically sorting out report bases, which can be excessively costly, or essentially not plausible since time is running short imperatives of the application or the quantity of records included. The exactness of present day content characterization frameworks equals that of prepared human experts, on account of a blend of information retrieval (IR) innovation and machine learning (ML) innovation. The point of this paper is to highlight the essential calculations that are utilized in content archives grouping, while in the meantime making familiarity with a portion of the fascinating difficulties that stay to be fathomed.

Malaysian Journal of Computer Science

Moe Htet Min

Due to the mass availability of textual data on Web, text classification (TC), classifying texts into predetermined sets becomes a spotlight for researchers. A number of TC applications have been proposed yet very few studies reported an overview of TC research area in a proper and systematic manner. This paper aims to provide an overview of TC research trends and gaps by structuring and analyzing research patterns, encountered problems and problem-solving methods in TC. In other words, this study highlights problem types, data sources, choice of language of text and types of applied techniques in TC. An intensive systematic study is conducted by applying guidelines proposed by Petersen and colleagues in 2007. In this paper, ninety-six literatures from five electronic databases from 2006 to 2017 were systematically reviewed and followed each and every step properly in accordance with systematic mapping study. Nine main problems in TC research area were identified and significant fin...

ACM Computing Surveys

Aidana Darkenova

— With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic categorization of documents became the key method for organizing the information and know-ledge discovery. Proper classification of e-documents, online news, blogs, e-mails and digital libraries need text mining, machine learning and natural language processing tech-niques to get meaningful knowledge. The aim of this paper is to highlight the important techniques and methodologies that are employed in text documents classification, while at the same time making awareness of some of the interesting challenges that remain to be solved, focused mainly on text representation and machine learning techniques. This paper provides a review of the theory and methods of document classification and text mining, focusing on the existing litera-ture.

International Journal of Computer Applications

Mukesh Zaveri

Web Intelligence

Xiaohui Tao

Text classification (a.k.a text categorisation) is an effective and efficient technology for information organisation and management. With the explosion of information resources on the Web and corporate intranets continues to increase, it has being become more and more important and has attracted wide attention from many different research fields. In the literature, many feature selection methods and classification algorithms have been proposed. It also has important applications in the real world. However, the dramatic increase in the availability of massive text data from various sources is creating a number of issues and challenges for text classification such as scalability issues. The purpose of this report is to give an overview of existing text classification technologies for building more reliable text classification applications, to propose a research direction for addressing the challenging problems in text mining.

IRJET Journal

RELATED PAPERS

Tom Odhiambo

Anales de la Facultad de Teología PUC-Chile

Sandra Arenas

Antonio Daniel Dalmasso

Darya Tsymbalyuk

Forensic Science International: Genetics Supplement Series

Raquel Herbstrith Carvalho

Phytochemistry Reviews

Justyna Makowska-Wąs

Procedia - Social and Behavioral Sciences

BMC Neuroscience

Theoden Netoff

Modern Economics

Olena Kravchenko

International Journal of Oral and Maxillofacial Surgery

L. Ardekian

Revista Brasileira de Parasitologia Veterinária

Eliane Mattos Piranda

BERITA BIOLOGI

Routledge Handbook of Jewish Ritual and Practice

ANAIS DO IV SEMINÁRIO DE ARTES DIGITAIS

Lab Front , Pablo Gobira

Carlos Somolinos

Stelios Assimakopoulos

Psychiatric Quarterly

Angelo Barbato

Matti Koivula

Applied Mathematics and Computation

Changbum Chun

Anaesthesia

kirsty forrest

bioRxiv (Cold Spring Harbor Laboratory)

Edward Avezov

Bulletin of the World Health Organization

Marcela Uhart

International Journal of Interactive Mobile Technologies (iJIM)

Dedi Kuswandi

Anne C. Dailey

arXiv (Cornell University)

See More Documents Like This

RELATED TOPICS

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024

VIDEO

  1. 3_session2 Importance of literature review, types of literature review, Reference management tool

  2. Text-Topic Classification: Validation EXAM 100% Accepted 0.16$ Toloka

  3. Data Science Course: Different Types Of Statistical Models 24

  4. Types of Literature Review

  5. Approaches to Literature Review

  6. Online Extremism Detection A Systematic Literature Review With Emphasis on Datasets, Classification

COMMENTS

  1. Techniques for text classification: Literature review and current trends

    Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification ...

  2. PDF Techniques for text classification: Literature review and current trends

    review if the paper describes research on text classification. This review does not describe all the text classification models and the techniques used to develop them in detail for practitioners. Our aim is to classify the papers with respect to their years, datasets, different feature selection

  3. Techniques for text classification: Literature review and current trends

    Techniques for text classification: Literature review and current trends. This paper has studied the existing work in the area of text classification and tried to summarize all existing information in a comprehensive and succinct manner to have a fair evaluation of the progress made in this field till date.

  4. Text Classification Techniques: A Literature Review

    current trends and future research options in text classification techniques. L ITERATURE R EVIEW This article is a literature review of various stud ies related to text classification approac hes ...

  5. The Research Trends of Text Classification Studies (2000-2020): A

    Text Classification (TC), also known as Document Classification or Text Categorization, is the process of assigning several predefined categories to a set of texts, often based on its content (Jindal et al., 2015; Wang & Deng, 2017).With the advent of the era of big data, the enormous quantity and diversity of digital documents have made it challenging for TC.

  6. Techniques for text classification: Literature review and current trends

    The main emphasis is laid on various steps involved in text classification process viz. document representation methods, feature selection methods, data mining methods and the evaluation technique used by each study to carry out the results on a particular dataset. Pages: 1-28. Keywords: Machine learning; Text classification; Feature selection ...

  7. Feature selection methods for text classification: a systematic

    Feature Selection (FS) methods alleviate key problems in classification procedures as they are used to improve classification accuracy, reduce data dimensionality, and remove irrelevant data. FS methods have received a great deal of attention from the text classification community. However, only a few literature surveys include them focusing on text classification, and the ones available are ...

  8. Techniques for text classification: Literature review and current

    Techniques for text classification: Literature review and current trends Rajni Jindal, Ruchika Malhotra, Abha Jain; Affiliations Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India. E-mail: rajnijindal (at) dce.ac.in Ruchika Malhotra Department of Computer Science & Engineering, Delhi ...

  9. The Research Trends of Text Classification Studies (2000-2020): A

    The Research Trends of Text Classification Studies (2000-2020): A Bibliometric Analysis ... Malhotra R., Jain A. (2015). Techniques for text classification: Literature review and current trends. ... Shaikh K. (2018). Prediction of cause of death from forensic autopsy reports using text classification techniques: A comparative study. Journal ...

  10. A Systematic Literature Review of Text Classification: Datasets and

    We study the literature in major journals and conferences on the usage of shallow learning and deep learning methods for text classification. Shallow learning techniques such as Naive Bayes, Support Vector Machine, Random Forests were initially widely used to solve problems in text classification. however, these techniques generally require the presence of a precise feature extraction model ...

  11. (Pdf) Trends and Patterns of Text Classification Techniques: a

    Due to the mass availability of textual data on Web, text classification (TC), classifying texts into predetermined sets becomes a spotlight for researchers.

  12. Clinical text classification research trends: Systematic literature

    Clinical text classification techniques have been employed in several types of free-text clinical reports, such as pathology reports, radiology reports, autopsy reports, death certificates, and biomedical documents. Overall, nine different types of clinical reports were identified from the literature as shown in Table 4. As shown here, majority ...

  13. Text Classification Techniques: A Literature Review

    The strengths, limitations, and current research trends in text classification in an advanced field like AI are identified to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining. Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an ...

  14. PDF Deep Learning Schema-based Event Extraction: Literature Review and

    role classification can be defined as an argument extraction task. The event classification is a multi-label text classification task to classify the type of each event. The role classification task is a multi-classification task based on word pairs, determining the role relationship between any pair of triggers and entities in a sentence.

  15. Techniques for text classification: Literature review and current trends

    This paper provides a comparison of the performance of well-known text classification techniques including genetic algorithm, k nearest neighbor, decision tree, support vector machine and Naïve Bayes. ... Literature review and current trends Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India ...

  16. Review on sentiment analysis for text classification techniques from

    Progression in the popularity of social media activities had provided huge amount of data in the form of text that can immeasurably augment its specialty. This textual data offers a platform for the reviewers to share their comments about any product, service or event on social media. These types of discussions among the reviewers boost the demand and supply in business and industry field ...

  17. PDF Text Classification Techniques: A Literature Review

    The text classification techniques section elaborately describes various approaches. The findings section explains various results observed from the articles reviewed. The discussions section explains research gaps, and the conclusion section highlights some of the current trends and future research options in text classification techniques.

  18. IJIKM

    Text Classification Techniques: A Literature Review. Aim/Purpose The aim of this paper is to analyze various text classification techniques employed in practice, their strengths and weaknesses, to provide an improved awareness regarding various knowledge extraction possibilities in the field of data mining. Background Artificial Intelligence is ...

  19. Techniques for text classification: Literature review and current trends

    Webology, Volume 12, Number 2, December, 2015 Home Table of Contents Titles & Subject Index Authors Index Techniques for text classification: Literature review and current trends Rajni Jindal Department of Computer Science & Engineering, Delhi Technological University, Delhi, India.

  20. A Review of Current Trends, Techniques, and Challenges in Large ...

    Natural language processing (NLP) has significantly transformed in the last decade, especially in the field of language modeling. Large language models (LLMs) have achieved SOTA performances on natural language understanding (NLU) and natural language generation (NLG) tasks by learning language representation in self-supervised ways. This paper provides a comprehensive survey to capture the ...

  21. Techniques for text classification: Literature review and current trends

    Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, ... Techniques for text classification: Literature review and current trends ...

  22. (Pdf) Trends and Patterns of Text Classification Techniques: a

    TRENDS AND PATTERNS OF TEXT CLASSIFICATION TECHNIQUES: A SYSTEMATIC MAPPING STUDY. Due to the mass availability of textual data on Web, text classification (TC), classifying texts into ...

  23. Techniques for text classification: Literature review and current

    Techniques for text classification: Literature review and current trends. Webology, Dec 2015 Rajni Jindal, Ruchika Malhotra, Abha ... Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us ...