Prediction of movie success based on machine learning and twitter sentiment analysis using internet movie database data

Jyoti Tripathi, Sunita Tiwari, Anu Saini, Sunita Kumari


Nowadays, predicting the success of a new movie is a crucial task. In this work, the hybrid approach considers the movie features as well as sentiment expressed in the movie review to predict the success rate of a movie. Multiple movie features such as title, director, star cast, and writer. Are considered for prediction. The related raw data is collected from the internet movie database (IMDb) website and after pre-processing, the collected data is used to generate the supervised machine learning model. Different supervised learning models are compared and the one with the best results is used further. The mean squared error, root mean squared error and r2 score of the models generated are comparable with existing models. Further, sentiment analysis of the movie-related tweets is performed. The accuracy of best sentiment analysis model is 88.47%. Finally, the two models are combined to give the success prediction rating of new movies and the results of the hybrid model are encouraging. The proposed model may be used to find the top-rated movies of a particular calendar year.


Decision tree; Entertainment industry; Naïve Bayes; Random forest; Regression; Supervised learning; Support vector machines

Full Text:




  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics