Subscribe to the PwC Newsletter

Join the community, edit dataset, edit dataset tasks.

Some tasks are inferred based on the benchmarks list.

Add a Data Loader

Remove a data loader.

  • huggingface/datasets -
  • tensorflow/datasets -
  • pytorch/text -

Edit Dataset Modalities

Edit dataset languages, edit dataset variants.

The benchmarks section lists all benchmarks using a given dataset or any of its variants. We use variants to distinguish between results evaluated on slightly different versions of the same dataset. For example, ImageNet 32⨉32 and ImageNet 64⨉64 are variants of the ImageNet dataset.

Add a new evaluation result row

Imdb movie reviews.

sentiment analysis of imdb movie reviews

The IMDb Movie Reviews dataset is a binary sentiment analysis dataset consisting of 50,000 reviews from the Internet Movie Database (IMDb) labeled as positive or negative. The dataset contains an even number of positive and negative reviews. Only highly polarizing reviews are considered. A negative review has a score ≤ 4 out of 10, and a positive review has a score ≥ 7 out of 10. No more than 30 reviews are included per movie. The dataset contains additional unlabeled data.

Benchmarks Edit Add a new result Link an existing benchmark

Dataset loaders edit add remove.

sentiment analysis of imdb movie reviews

Similar Datasets

License edit, modalities edit, languages edit.

Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

Search code, repositories, users, issues, pull requests...

Provide feedback.

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly.

To see all available qualifiers, see our documentation .

  • Notifications

Explore sentiment analysis on the IMDB movie reviews dataset using Python. This Jupyter Notebook showcases text preprocessing, TF-IDF feature extraction, and model training (Multinomial Naive Bayes, Random Forest) for sentiment classification. Ideal for understanding NLP basics and applying ML to textual data.

qh21/Sentiment-Analysis-of-IMDB-Movie-Reviews

Folders and files, repository files navigation, sentiment-analysis-of-imdb-movie-reviews.

  • Jupyter Notebook 100.0%

1258. Crypto Market Sentiment | Data Bitcoin Analysis

  • Podcast Episode

Add a plot in your language

User reviews

  • September 18, 2023 (United Kingdom)
  • See more company credits at IMDbPro

Technical specs

  • Runtime 15 minutes

Related news

Contribute to this page.

  • IMDb Answers: Help fill gaps in our data
  • Learn more about contributing

More to explore

Production art

Recently viewed

Book cover

International Conference on Emerging Research in Computing, Information, Communication and Applications

ERCICA 2023: Advances in Computing and Information pp 107–129 Cite as

Sentiment Exploring on Feedback of E-commerce Data Using Machine Learning Algorithms

  • Amrithkala M. Shetty 39 ,
  • Mohammed Fadhel Aljunid 40 &
  • D. H. Manjaiah 39  
  • Conference paper
  • First Online: 16 December 2023

64 Accesses

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1104))

In today’s fast-growing Internet world, customer ratings and reviews play an essential role in online buying on e-commerce websites such as Amazon, Flipkart, and others. Sentiment analysis is crucial for increasing customer satisfaction on e-commerce sites since it contains a lot of consumer feedback. In this work, we have used Amazon Women's E-Commerce Clothing Reviews dataset. We have used CountVectorizer and TF-IDF and trained the data on five machine learning (ML) classifiers, namely logistic regression (LR), multinomial Naive Bayes (MNB), Bernoulli Naive Bayes (BNB), support vector machine (SVM), random forest (RF), and AdaBoosting (AB). When comparing the ML model’s accuracy scores concerning the CountVectorizer, it was discovered that MNB and LR models had the highest accuracy of 0.94, while RF had the lowest accuracy of 0.90. SVM achieved the maximum accuracy of 0.94 using the TF-IDF approach, and MNB achieved the lowest accuracy of 0.89. The accuracy, precision, recall, F1-score, and AUC-ROC curve help us to determine the performance of the ML algorithms. To examine the dataset’s attributes and comprehend the relationships between the variables, many statistical techniques were applied.

  • Sentiment analysis
  • CountVectorizer
  • Logistic regression
  • Multinomial Naive Bayes
  • Bernoulli Naive Bayes
  • Support vector machine
  • Random forest
  • AdaBoosting

Mohammed Fadhel Aljunid and D.H. Manjaiah: These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution .

Buying options

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

https://www.learndatasci.com/glossary/tf-idf-term-frequency-inverse-document-frequency/ .

https://www.ibm.com/in-en/topics/logistic-regression .

https://www.geeksforgeeks.org/naive-bayes-classifiers/ .

https://www.geeksforgeeks.org/support-vector-machine-algorithm/ .

https://towardsdatascience.com/understanding-random-forest-58381e0602d2 .

https://www.javatpoint.com/machine-learning-random-forest-algorithm .

https://www.scikit-yb.org/en/latest/api/classifier/rocauc.htmlmulti-class-rocauc-curves .

Katthik RV, Sannasi G (2021) A fuzzy recommendation system for predicting the customers interests using sentiment analysis and ontology in e-commerce. Appl Soft Comput 108:107396 (2021). https://doi.org/10.1016/j.asoc.2021.107396

Wankhade M, Rao ACS, Kulkarni C (2022) A survey on sentiment analysis methods, applications, and challenges. Artif Intell Rev 1–50

Google Scholar  

Liu B et al (2010) Sentiment analysis and subjectivity. In: Handbook of natural language processing, vol 2, pp 627–666

Kubrusly J, Neves AL, Marques TL (2022) A statistical analysis of textual e-commerce reviews using tree-based methods. Open J Stat 12(03):357–372

Article   Google Scholar  

Lin X (2020) Sentiment analysis of e-commerce customer reviews based on natural language processing. In: Proceedings of the 2020 2nd international conference on big data and artificial intelligence. ACM, New York, NY, USA

Agarap AF (2018) Statistical analysis on e-commerce reviews, with sentiment classification using bidirectional recurrent neural network (RNN). arXiv:1805.03687

Deniz E, Erbay H, Coşar M (2022) Multi-label classification of e-commerce customer reviews via machine learning. Axioms 11(9). https://doi.org/10.3390/axioms11090436

Wassan S, Shen T, Xi C, Gulati K, Vasan D, Suhail B (2022) Customer experience towards the product during a coronavirus outbreak. Behav Neurol 2022:4279346. https://doi.org/10.1155/2022/4279346

Balakrishnan V, Shi Z, Law CL, Lim R, Teh LL, Fan Y (2022) A deep learning approach in predicting products’ sentiment ratings: a comparative analysis. J Supercomput 78(5):7206–7226. https://doi.org/10.1007/s11227-021-04169-6

Nawaz Z, Zhao C, Nawaz F, Safeer AA, Irshad W (2021) Role of artificial neural networks techniques in development of market intelligence: a study of sentiment analysis of eWOM of a women’s clothing company. J Theor Appl Electron Commerce Res 16(5):1862–1876. https://doi.org/10.3390/jtaer16050104

Kabir AI, Ahmed K, Karim R (2020) Word cloud and sentiment analysis of amazon earphones reviews with r programming language. Informatica Economica 24 :55–71. https://doi.org/10.24818/issn14531305/24.4.2020.05

Kumar JA, Trueman TE, Cambria E (2022) Gender-based multi-aspect sentiment detection using multilabel learning. Inf Sci 606:453–468. https://doi.org/10.1016/j.ins.2022.05.057

Kim S-W, Gil J-M (2019) Research paper classification systems based on TF-IDF and LDA schemes. HCIS 9(1):1–21

Mahesh B (2019) Machine learning algorithms—A review

Singh G, Kumar B, Gaur L, Tyagi A (2019) Comparison between multi-nomial and Bernoulli naïve Bayes for text classification. In: 2019 International conference on automation, computational and technology management (ICACTM), pp 593–596. https://doi.org/10.1109/ICACTM.2019.8776800

Ray S (2019) A quick review of machine learning algorithms. In: 2019 International conference on machine learning, big data, cloud and parallel computing (COMITCon). IEEE, pp 35–39

Aljedaani W, Rustam F, Ludi S, Ouni A, Mkaouer MW (2021) Learning sentiment analysis for accessibility user reviews. In: 2021 36th IEEE/ACM international conference on automated software engineering workshops (ASEW). IEEE, pp 239–246

Behera RK, Jena M, Rath SK, Misra S (2021) Co-lstm: Convolutional LSTM model for sentiment analysis in social big data. Inf Process Manage 58(1):102435

Li H, Chen Q, Zhong Z, Gong R, Han G (2022) E-word of mouth sentiment analysis for user behavior studies. Inf Process Manage 59(1):102784. https://doi.org/10.1016/j.ipm.2021.102784

Download references

Author information

Authors and affiliations.

CS Department, Mangalore University, Mangaluru, Karnataka, 574199, India

Amrithkala M. Shetty & D. H. Manjaiah

Computer and Informatics Center, Thamar university, Thamar, Yemen

Mohammed Fadhel Aljunid

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Amrithkala M. Shetty .

Editor information

Editors and affiliations.

Central University of Karnataka, Kalaburagi, Karnataka, India

N. R. Shetty

Nitte Meenakshi Institute of Technology, Bangalore, Karnataka, India

N. H. Prasad

Nitte Meenakshi Institute of Technology, Bengaluru, Karnataka, India

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper.

Shetty, A.M., Aljunid, M.F., Manjaiah, D.H. (2024). Sentiment Exploring on Feedback of E-commerce Data Using Machine Learning Algorithms. In: Shetty, N.R., Prasad, N.H., Nalini, N. (eds) Advances in Computing and Information. ERCICA 2023. Lecture Notes in Electrical Engineering, vol 1104. Springer, Singapore. https://doi.org/10.1007/978-981-99-7622-5_8

Download citation

DOI : https://doi.org/10.1007/978-981-99-7622-5_8

Published : 16 December 2023

Publisher Name : Springer, Singapore

Print ISBN : 978-981-99-7621-8

Online ISBN : 978-981-99-7622-5

eBook Packages : Engineering Engineering (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

IMAGES

  1. Sentiment Analysis On IMDB Movie Review

    sentiment analysis of imdb movie reviews

  2. IMDB Sentiment Analysis based on comment Machine Learning

    sentiment analysis of imdb movie reviews

  3. GitHub

    sentiment analysis of imdb movie reviews

  4. GitHub

    sentiment analysis of imdb movie reviews

  5. Contentbased Movie Recommendation Using Sentiment Analysis

    sentiment analysis of imdb movie reviews

  6. Sentiment analysis of IMDB reviews at a level 2, and b level 3

    sentiment analysis of imdb movie reviews

VIDEO

  1. Sentiment Analysis for Movie Reviews

  2. Sentiment analysis for customer reviews

  3. CS50P Data Cleaning for Sentiment Analysis of IMDB Movie Reviews

  4. intel

  5. EEL6825 Pattern Recognition Project Sentiment Analysis for IMDB Movie Review using TF IDF modeling

  6. 26: Sentiment Analysis

COMMENTS

  1. Sentiment Analysis of IMDB Movie Reviews

    Explore and run machine learning code with Kaggle Notebooks | Using data from IMDB Dataset of 50K Movie Reviews

  2. PDF Sentiment analysis of IMDb reviews

    The report utilizes a methodology to conduct the analysis of the sentiment analysis of IMDb reviews, as shown in Fig. 1. First, the report illustrates and feeds the data into the data cleaning and preprocess. Next, the report removes the stop words and some irrelevant words from the original data; then, the vectorization techniques are applied ...

  3. IMDb Movie Reviews Dataset

    The IMDb Movie Reviews dataset is a binary sentiment analysis dataset consisting of 50,000 reviews from the Internet Movie Database (IMDb) labeled as positive or negative. The dataset contains an even number of positive and negative reviews. Only highly polarizing reviews are considered. A negative review has a score ≤ 4 out of 10, and a positive review has a score ≥ 7 out of 10.

  4. Sentiment Analysis of IMDb Movie Reviews: A Comparative Analysis of

    There have been several studies on sentiment analysis of movie reviews. One of the earliest studies on this topic was by Pang Lee et al. where they discussed ML techniques along with N-gram model for identifying best features that were used for sentiment analysis [].Rahman et al. described the possible ways of applying ML algorithms for sentiment analysis on a Bengali movie review dataset [].

  5. Unsupervised Semantic Sentiment Analysis of IMDB Reviews

    To demonstrate this approach, I use the well-known IMDB database. Released to the public by Stanford University, this dataset is a collection of 50,000 reviews from IMDB that contains an even number of positive and negative reviews with no more than 30 reviews per movie. As noted in the dataset introduction notes, "a negative review has a score ...

  6. How to Prepare Movie Review Data for Sentiment Analysis (Text

    The reviews were originally released in 2002, but an updated and cleaned up version was released in 2004, referred to as "v2.0". The dataset is comprised of 1,000 positive and 1,000 negative movie reviews drawn from an archive of the rec.arts.movies.reviews newsgroup hosted at IMDB. The authors refer to this dataset as the "polarity ...

  7. Sentiment Analysis of IMDB Movie Reviews Using Deep Learning ...

    Both of these methods will be used in this work to evaluate the performance of k-means. In this paper, we will implement three different transformer models for sentiment analysis on a labeled IMDB dataset that contains 50,000 movie reviews. The dataset contains a balanced amount of positive and negative reviews.

  8. Performing Sentiment Analysis on Movie Reviews

    1. Introduction and Importing Data. In this article, I will be using the IMDB movie reviews dataset for this study. The dataset contains 50,000 reviews — 25,000 positive and 25,000 negative reviews. An example of a review can be seen in Fig 1, where a user gave a 10/10 rating and a written review for the Oscar-winning movie Parasite (2020).

  9. Sentiment Analysis of IMDB Movie Reviews Using Deep ...

    Sentiment Analysis on IMDb movie reviews identifies the overall sentiment or opinion expressed by a reviewer towards a movie. Many researchers are working on pruning the sentiment analysis model ...

  10. Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory

    The sentiment analysis is an emerging research area where vast amount of data are being analyzed, to generate useful insights in regards to a specific topic. It ... In this paper the Long Short-Term Memory (LSTM) classifier is used for analyzing sentiments of the IMDb movie reviews. It is based on the Recurrent Neural Network (RNN) algorithm ...

  11. Sentiment Analysis on IMDB Movie Reviews using Machine Learning and

    In this paper, sentiment analysis on IMDB movie reviews dataset is implemented using Machine Learning (ML) and Deep Learning (DL) approaches to measure the accuracy of the model. ML algorithms are the traditional algorithms that work in a single layer while deep learning algorithms work on multilayers and gives better output. This paper helps ...

  12. Sentiment Analysis of IMDb Movie Reviews

    PDF | On Mar 25, 2022, Ayanabha Ghosh published Sentiment Analysis of IMDb Movie Reviews : A comparative study on Performance of Hyperparameter-tuned Classification Algorithms | Find, read and ...

  13. Sentiment-Analysis-of-IMDB-Movie-Reviews

    Explore sentiment analysis on the IMDB movie reviews dataset using Python. This Jupyter Notebook showcases text preprocessing, TF-IDF feature extraction, and model training (Multinomial Naive Bayes, Random Forest) for sentiment classification. Ideal for understanding NLP basics and applying ML to textual data. - qh21/Sentiment-Analysis-of-IMDB-Movie-Reviews

  14. Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory

    In this paper the Long Short-Term Memory (LSTM) classifier is used for analyzing sentiments of the IMDb movie reviews. It is based on the Recurrent Neural Network (RNN) algorithm. The data is ...

  15. Sentiment Analysis on IMDB Movie Reviews

    Notebook to train an XLNet model to perform sentiment analysis. The dataset used is a balanced collection of (50,000 - 1:1 train-test ratio) IMDB movie reviews with binary labels: postive or negative from the paper by Maas et al. (2011).The current state-of-the-art model on this dataset is XLNet by Yang et al. (2019) which has an accuracy of 96.2%.We get an accuracy of 92.2% due to the ...

  16. PDF Sentiment Analysis of IMDB Movie Reviews Using Deep Learning ...

    Both of these methods will be used in this work to evaluate the performance of k-means. In this paper, we will implement three different transformer models for sentiment analysis on a labeled IMDB dataset that contains 50,000 movie reviews. The dataset contains a balanced amount of positive and negative reviews.

  17. Sentiment Classification of IMDB Movie Review Data Using a PyTorch LSTM

    The IMDB Movie Review Data The IMDB movie review data consists of 50,000 reviews -- 25,000 for training and 25,000 for testing. The training and test files are evenly divided into 12,500 positive reviews and 12,500 negative reviews. Negative reviews are those reviews associated with movies that the reviewer rated as 1 through 4 stars.

  18. Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory

    The Long Short-Term Memory (LSTM) classifier is used for analyzing sentiments of the IMDb movie reviews, based on the Recurrent Neural Network (RNN) algorithm, and results show a best classification accuracy of 89.9%. The sentiment analysis is an emerging research area where vast amount of data are being analyzed, to generate useful insights in regards to a specific topic. It is an effective ...

  19. Sentiment Analysis on IMDB Movie Reviews using VADER with Lexical

    A novel approach to enhance sentiment analysis using the IMDB Dataset is presented, which combines the well-established VADER sentiment analysis tool with Lexical Affinity and Semantic Sentiment Expansion and provides a more nuanced understanding of sentiment expressions in text. Sentiment analysis plays a crucial role in understanding public opinion and user sentiments in vast amounts of ...

  20. Sentiment analysis on IMDB using lexicon and neural networks

    To find out what other people think has been an essential part of information-gathering behaviors. And in the case of movies, the movie reviews can provide an intricate insight into the movie and can help decide whether it is worth spending time on. However, with the growing amount of data in reviews, it is quite prudent to automate the process, saving on time. Sentiment analysis is an ...

  21. Sentiment Analysis

    Sentiment relates to the meaning of a word or sequence of words and is usually associated with an opinion or emotion. And analysis? Well, this is the process of looking at data and making inferences; in this case, using machine learning to learn and predict whether a movie review is positive or negative. Maybe you're interested in knowing ...

  22. PDF Deep learning for sentiment analysis of movie reviews

    for sentiment analysis. The sentiment of reviews is binary, meaning the IMDB rating <5 results in a sentiment score of 0, and rating 7 have a sentiment score of 1. No individual movie has more than 30 reviews. The 25,000 review labeled training set does not include any of the same movies as the 25,000 review test set. In addition, there are ...

  23. Utilizing Machine Learning for Sentiment Analysis of IMDB Movie Review

    For sentiment analysis acl imdb movie review data set has been used. Lastly, the impact of stop words and number of attributes in accuracy for sentiment analysis has also been illustrated.

  24. 1258. Crypto Market Sentiment

    IMDb is the world's most popular and authoritative source for movie, TV and celebrity content. Find ratings and reviews for the newest movie and TV shows. Get personalized recommendations, and learn where to watch across hundreds of streaming providers.

  25. Sentiment Exploring on Feedback of E-commerce Data Using ...

    These reviews are important; sentiment analysis is performed on them. In this paper, we analyze the Amazon Women's Clothing E-Commerce dataset. ... including reviews of women's clothing and reviews of movies from the IMDB dataset. With an F1-score of 93.52% in the dataset for women's clothing's recommended classification, DSC did well ...