Optimizing Machine Learning in Hospitality Industry Implementation of Random Forest Model in Forecasting Hotel Guest Length of Stay

Main Article Content

Yerik Singgalen

Abstract

This study explores the application of the Random Forest algorithm in predicting the length of stay (LoS) of hotel guests, a critical metric for optimizing operational efficiency and revenue management in the hospitality industry. The research is grounded in the growing need for predictive analytics to address challenges posed by fluctuating demand, diverse customer preferences, and dynamic market conditions. Accurate LoS predictions allow for better resource allocation, enhanced guest experiences, and optimized pricing strategies, making this study highly relevant for advancing data-driven decision-making in the sector. The methodology involved analyzing a dataset of 453 accounts, which included key features such as ratings, guest types, room preferences, and country of origin. Comprehensive data preprocessing steps, including standardization, feature selection, and dataset splitting into training and testing subsets, ensured the reliability and robustness of the predictive model. The Random Forest algorithm, known for its ability to handle non-linear relationships and high-dimensional data, was implemented to analyze patterns and relationships. The model demonstrated high accuracy, achieving a Mean Squared Error (MSE) of 1.89, Mean Absolute Error (MAE) of 0.80, and Root Mean Squared Error (RMSE) of 1.37, effectively capturing the complexity of the dataset. The findings reveal that ratings and guest types are the most influential predictors, underscoring their importance in shaping guest behaviors. While the results are promising, limitations such as dataset size and scope suggest opportunities for further research. Future studies could incorporate more extensive, diverse datasets and explore alternative algorithms to enhance predictive accuracy and adaptability. This research contributes to advancing machine learning applications in hospitality, providing actionable insights to improve operational performance, guest satisfaction, and competitive positioning.

Article Details

Section
Articles

References

Ampountolas, A., & Legg, M. P. (2021). A segmented machine learning modeling approach of social media for predicting occupancy. International Journal of Contemporary Hospitality Management, 33(6), 2001–2021. https://doi.org/10.1108/IJCHM-06-2020-0611

Chang, V., Liu, L., Xu, Q., Li, T., & Hsu, C. H. (2023). An improved model for sentiment analysis on luxury hotel review. In Expert Systems (Vol. 40, Issue 2). https://doi.org/10.1111/exsy.12580

Chang, Y. M., Chen, C. H., Lai, J. P., Lin, Y. L., & Pai, P. F. (2021). Forecasting hotel room occupancy using long short-term memory networks with sentiment analysis and scores of customer online reviews. Applied Sciences (Switzerland), 11(21). https://doi.org/10.3390/app112110291

Dang, T. D., & Nguyen, M. T. (2024). Understanding Customer Perception and Brand Equity in the Hospitality Sector: Integrating Sentiment Analysis and Topic Modeling. In Springer Proceedings in Business and Economics (pp. 413–425). https://doi.org/10.1007/978-3-031-49105-4_24

Darvishmotevali, M., Arici, H. E., & Koseoglu, M. A. (2024). Customer satisfaction antecedents in uncertain hospitality conditions: an exploratory data mining approach. Journal of Hospitality and Tourism Insights. https://doi.org/10.1108/JHTI-11-2023-0845

Dursun-Cengizci, A., & Caber, M. (2024). Using machine learning methods to predict future churners: an analysis of repeat hotel customers. International Journal of Contemporary Hospitality Management. https://doi.org/10.1108/IJCHM-06-2023-0844

Dutta, K. B., Sahu, A., Sharma, B., Rautaray, S. S., & Pandey, M. (2021). Machine learning-based prototype for restaurant rating prediction and cuisine selection. In Advances in Intelligent Systems and Computing (Vol. 1166, pp. 57–68). https://doi.org/10.1007/978-981-15-5148-2_6

Filieri, R., Lin, Z., Li, Y., Lu, X., & Yang, X. (2022). Customer Emotions in Service Robot Encounters: A Hybrid Machine-Human Intelligence Approach. Journal of Service Research, 25(4), 614–629. https://doi.org/10.1177/10946705221103937

Hafiz, E. A., & Kaur, N. (2022). Improved Hotel Recommendation System Using Machine Learning Technique. In Proceedings - 2022 IEEE World Conference on Applied Intelligence and Computing, AIC 2022 (pp. 769–773). https://doi.org/10.1109/AIC55036.2022.9848942

Hamdan, I. Z. P., & Othman, M. (2022). Predicting Customer Loyalty Using Machine Learning for Hotel Industry. Journal of Soft Computing and Data Mining, 3(2), 31–42. https://doi.org/10.30880/jscdm.2022.03.02.004

Hamdan, I. Z. P., Othman, M., Hassim, Y. M. M., Marjudi, S., & Yusof, M. M. (2023). Customer Loyalty Prediction for Hotel Industry Using Machine Learning Approach. International Journal on Informatics Visualization, 7(3), 695–703. https://doi.org/10.30630/joiv.7.3.1335

Innork, K., Polpinij, J., Namee, K., Kaenampornpan, M., Saisangchan, U., & Wiangsamut, S. (2023). A Comparative Study of Multi-class Sentiment Classification Models for Hotel Customer Reviews. In 4th Research, Invention, and Innovation Congress: Innovative Electricals and Electronics: Innovation for Better Life, RI2C 2023 (pp. 88–92). https://doi.org/10.1109/RI2C60382.2023.10355942

Jaouhar, E. M., El Kafhali, S., & Saadi, Y. (2022). A Study of Machine Learning Based Approach for Hotels’ Matching. In Lecture Notes in Networks and Systems: Vol. 489 LNNS (pp. 373–384). https://doi.org/10.1007/978-3-031-07969-6_28

Kong, C., Ren, S., Wang, H., & Zhou, H. (2024). A Study on Esports Hotel Price Prediction Based on Random Forest Model. In 2024 IEEE 2nd International Conference on Image Processing and Computer Applications, ICIPCA 2024 (pp. 290–294). https://doi.org/10.1109/ICIPCA61593.2024.10709221

Kozlovskis, K., Liu, Y., Lace, N., & Meng, Y. (2023). Application of Machine Learning Algorithms To Predict Hotel Occupancy. Journal of Business Economics and Management, 24(3), 594–613. https://doi.org/10.3846/jbem.2023.19775

Kumar, M., Kumar, C., Kumar, N., & Kavitha, S. (2024). Efficient Hotel Rating Prediction from Reviews Using Ensemble Learning Technique. Wireless Personal Communications, 137(2), 1161–1187. https://doi.org/10.1007/s11277-024-11457-w

Li, X., & Liu, C. (2020). Comparison of Machine Learning Models for Sentimental Analysis of Hotel Reviews. In IOP Conference Series: Materials Science and Engineering (Vol. 806, Issue 1). https://doi.org/10.1088/1757-899X/806/1/012029

Nassif, A. B., Darya, A. M., & Elnagar, A. (2022). Empirical Evaluation of Shallow and Deep Learning Classifiers for Arabic Sentiment Analysis. ACM Transactions on Asian and Low-Resource Language Information Processing, 21(1). https://doi.org/10.1145/3466171

Parikh, S. N., Shah, J., Sutaria, K., & Vala, B. (2023). Theoretical Evaluation of Machine Learning Approaches for Hotel Recommendation. In Proceedings - 5th International Conference on Smart Systems and Inventive Technology, ICSSIT 2023 (pp. 1130–1137). https://doi.org/10.1109/ICSSIT55814.2023.10061074

Parikh, S. N., Shah, J., Sutaria, K., & Vala, B. (2024). Machine Learning Approaches for Hotel Recommendation. In AIP Conference Proceedings (Vol. 3107, Issue 1). https://doi.org/10.1063/5.0209059

Patel, A., Shah, N., Parul, V. B., & Suthar, K. S. (2023). Hotel Recommendation using Feature and Machine Learning Approaches: A Review. In Proceedings - 5th International Conference on Smart Systems and Inventive Technology, ICSSIT 2023 (pp. 1144–1149). https://doi.org/10.1109/ICSSIT55814.2023.10061034

Prabha, R., Senthil, G. A., Nisha, A. S. A., Snega, S., Keerthana, L., & Sharmitha, S. (2022). Comparison of Machine Learning Algorithms for Hotel Booking Cancellation in Automated Method. In 2022 1st International Conference on Computer, Power and Communications, ICCPC 2022 - Proceedings (pp.413–418). https://doi.org/10.1109/ICCPC55978.2022.10072135

Qureshi, S., & Menezes, J. (2023). Prediction of Hotel Booking Cancellation Using Machine Learning Algorithms. In IET Conference Proceedings (Vol. 2023, Issue 44, pp. 140–145). https://doi.org/10.1049/icp.2024.0914

Saputro, P. H., & Nanang, H. (2021). Exploratory Data Analysis & Booking Cancelation Prediction on Hotel Booking Demands Datasets. Journal of Applied Data Sciences, 2(1), 40–56. https://doi.org/10.47738/jads.v2i1.20

Sharma, H., & Aggarwal, A. G. (2020). What factors determine reviewer credibility?: An econometric approach validated through predictive modeling. Kybernetes, 49(10), 2547–2567. https://doi.org/10.1108/K-08-2019-0537

Shifullah, K., Rakibullah, H. M., Islam, N., Raihan, H., Iqbal, M. A., Ziaul Karim, D., & Rasel, A. A. (2022). Classification of Hotel Reviews Using Sentiment Analysis and Machine Learning. In Proceedings of 2022 25th International Conference on Computer and Information Technology, ICCIT 2022 (pp. 710–715). https://doi.org/10.1109/ICCIT57492.2022.10054884

Singh, I. (2022). Dynamic Pricing using Reinforcement Learning in Hospitality Industry. In IBSSC 2022 - IEEE Bombay Section Signature Conference. https://doi.org/10.1109/IBSSC56953.2022.10037523

Taherkhani, L., Daneshvar, A., Amoozad Khalili, H., & Sanaei, M. R. (2023). Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm. Advances in Civil Engineering, 2023. https://doi.org/10.1155/2023/6029121

Trivedi, S. K., Singh, A., & Malhotra, S. K. (2023). Prediction of polarities of online hotel reviews: an improved stacked decision tree (ISD) approach. Global Knowledge, Memory and Communication, 72(8–9), 765–778. https://doi.org/10.1108/GKMC-12-2021-0197

Yoo, M., Singh, A. K., & Loewy, N. (2024). Predicting hotel booking cancelation with machine learning techniques. Journal of Hospitality and Tourism Technology, 15(1), 54–69. https://doi.org/10.1108/JHTT-07-2022-0227

Zhao, Z., Zhou, W., Qiu, Z., Li, A., & Wang, J. (2022). Research on Ctrip Customer Churn Prediction Model Based on Random Forest. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 107, pp. 511–523). https://doi.org/10.1007/978-3-030-92632-8_48