Optimizing Machine Learning in Hospitality Industry Implementation of Random Forest Model in Forecasting Hotel Guest Length of Stay
Main Article Content
Abstract
This study explores the application of the Random Forest algorithm in predicting the length of stay (LoS) of hotel guests, a critical metric for optimizing operational efficiency and revenue management in the hospitality industry. The research is grounded in the growing need for predictive analytics to address challenges posed by fluctuating demand, diverse customer preferences, and dynamic market conditions. Accurate LoS predictions allow for better resource allocation, enhanced guest experiences, and optimized pricing strategies, making this study highly relevant for advancing data-driven decision-making in the sector. The methodology involved analyzing a dataset of 453 accounts, which included key features such as ratings, guest types, room preferences, and country of origin. Comprehensive data preprocessing steps, including standardization, feature selection, and dataset splitting into training and testing subsets, ensured the reliability and robustness of the predictive model. The Random Forest algorithm, known for its ability to handle non-linear relationships and high-dimensional data, was implemented to analyze patterns and relationships. The model demonstrated high accuracy, achieving a Mean Squared Error (MSE) of 1.89, Mean Absolute Error (MAE) of 0.80, and Root Mean Squared Error (RMSE) of 1.37, effectively capturing the complexity of the dataset. The findings reveal that ratings and guest types are the most influential predictors, underscoring their importance in shaping guest behaviors. While the results are promising, limitations such as dataset size and scope suggest opportunities for further research. Future studies could incorporate more extensive, diverse datasets and explore alternative algorithms to enhance predictive accuracy and adaptability. This research contributes to advancing machine learning applications in hospitality, providing actionable insights to improve operational performance, guest satisfaction, and competitive positioning.
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
Ampountolas, A., & Legg, M. P. (2021). A segmented machine learning modeling approach of social media for predicting occupancy. International Journal of Contemporary Hospitality Management, 33(6), 2001–2021. https://doi.org/10.1108/IJCHM-06-2020-0611
Chang, V., Liu, L., Xu, Q., Li, T., & Hsu, C. H. (2023). An improved model for sentiment analysis on luxury hotel review. In Expert Systems (Vol. 40, Issue 2). https://doi.org/10.1111/exsy.12580
Chang, Y. M., Chen, C. H., Lai, J. P., Lin, Y. L., & Pai, P. F. (2021). Forecasting hotel room occupancy using long short-term memory networks with sentiment analysis and scores of customer online reviews. Applied Sciences (Switzerland), 11(21). https://doi.org/10.3390/app112110291
Dang, T. D., & Nguyen, M. T. (2024). Understanding Customer Perception and Brand Equity in the Hospitality Sector: Integrating Sentiment Analysis and Topic Modeling. In Springer Proceedings in Business and Economics (pp. 413–425). https://doi.org/10.1007/978-3-031-49105-4_24
Darvishmotevali, M., Arici, H. E., & Koseoglu, M. A. (2024). Customer satisfaction antecedents in uncertain hospitality conditions: an exploratory data mining approach. Journal of Hospitality and Tourism Insights. https://doi.org/10.1108/JHTI-11-2023-0845
Dursun-Cengizci, A., & Caber, M. (2024). Using machine learning methods to predict future churners: an analysis of repeat hotel customers. International Journal of Contemporary Hospitality Management. https://doi.org/10.1108/IJCHM-06-2023-0844
Dutta, K. B., Sahu, A., Sharma, B., Rautaray, S. S., & Pandey, M. (2021). Machine learning-based prototype for restaurant rating prediction and cuisine selection. In Advances in Intelligent Systems and Computing (Vol. 1166, pp. 57–68). https://doi.org/10.1007/978-981-15-5148-2_6
Filieri, R., Lin, Z., Li, Y., Lu, X., & Yang, X. (2022). Customer Emotions in Service Robot Encounters: A Hybrid Machine-Human Intelligence Approach. Journal of Service Research, 25(4), 614–629. https://doi.org/10.1177/10946705221103937
Hafiz, E. A., & Kaur, N. (2022). Improved Hotel Recommendation System Using Machine Learning Technique. In Proceedings - 2022 IEEE World Conference on Applied Intelligence and Computing, AIC 2022 (pp. 769–773). https://doi.org/10.1109/AIC55036.2022.9848942
Hamdan, I. Z. P., & Othman, M. (2022). Predicting Customer Loyalty Using Machine Learning for Hotel Industry. Journal of Soft Computing and Data Mining, 3(2), 31–42. https://doi.org/10.30880/jscdm.2022.03.02.004
Hamdan, I. Z. P., Othman, M., Hassim, Y. M. M., Marjudi, S., & Yusof, M. M. (2023). Customer Loyalty Prediction for Hotel Industry Using Machine Learning Approach. International Journal on Informatics Visualization, 7(3), 695–703. https://doi.org/10.30630/joiv.7.3.1335
Innork, K., Polpinij, J., Namee, K., Kaenampornpan, M., Saisangchan, U., & Wiangsamut, S. (2023). A Comparative Study of Multi-class Sentiment Classification Models for Hotel Customer Reviews. In 4th Research, Invention, and Innovation Congress: Innovative Electricals and Electronics: Innovation for Better Life, RI2C 2023 (pp. 88–92). https://doi.org/10.1109/RI2C60382.2023.10355942
Jaouhar, E. M., El Kafhali, S., & Saadi, Y. (2022). A Study of Machine Learning Based Approach for Hotels’ Matching. In Lecture Notes in Networks and Systems: Vol. 489 LNNS (pp. 373–384). https://doi.org/10.1007/978-3-031-07969-6_28
Kong, C., Ren, S., Wang, H., & Zhou, H. (2024). A Study on Esports Hotel Price Prediction Based on Random Forest Model. In 2024 IEEE 2nd International Conference on Image Processing and Computer Applications, ICIPCA 2024 (pp. 290–294). https://doi.org/10.1109/ICIPCA61593.2024.10709221
Kozlovskis, K., Liu, Y., Lace, N., & Meng, Y. (2023). Application of Machine Learning Algorithms To Predict Hotel Occupancy. Journal of Business Economics and Management, 24(3), 594–613. https://doi.org/10.3846/jbem.2023.19775
Kumar, M., Kumar, C., Kumar, N., & Kavitha, S. (2024). Efficient Hotel Rating Prediction from Reviews Using Ensemble Learning Technique. Wireless Personal Communications, 137(2), 1161–1187. https://doi.org/10.1007/s11277-024-11457-w
Li, X., & Liu, C. (2020). Comparison of Machine Learning Models for Sentimental Analysis of Hotel Reviews. In IOP Conference Series: Materials Science and Engineering (Vol. 806, Issue 1). https://doi.org/10.1088/1757-899X/806/1/012029
Nassif, A. B., Darya, A. M., & Elnagar, A. (2022). Empirical Evaluation of Shallow and Deep Learning Classifiers for Arabic Sentiment Analysis. ACM Transactions on Asian and Low-Resource Language Information Processing, 21(1). https://doi.org/10.1145/3466171
Parikh, S. N., Shah, J., Sutaria, K., & Vala, B. (2023). Theoretical Evaluation of Machine Learning Approaches for Hotel Recommendation. In Proceedings - 5th International Conference on Smart Systems and Inventive Technology, ICSSIT 2023 (pp. 1130–1137). https://doi.org/10.1109/ICSSIT55814.2023.10061074
Parikh, S. N., Shah, J., Sutaria, K., & Vala, B. (2024). Machine Learning Approaches for Hotel Recommendation. In AIP Conference Proceedings (Vol. 3107, Issue 1). https://doi.org/10.1063/5.0209059
Patel, A., Shah, N., Parul, V. B., & Suthar, K. S. (2023). Hotel Recommendation using Feature and Machine Learning Approaches: A Review. In Proceedings - 5th International Conference on Smart Systems and Inventive Technology, ICSSIT 2023 (pp. 1144–1149). https://doi.org/10.1109/ICSSIT55814.2023.10061034
Prabha, R., Senthil, G. A., Nisha, A. S. A., Snega, S., Keerthana, L., & Sharmitha, S. (2022). Comparison of Machine Learning Algorithms for Hotel Booking Cancellation in Automated Method. In 2022 1st International Conference on Computer, Power and Communications, ICCPC 2022 - Proceedings (pp.413–418). https://doi.org/10.1109/ICCPC55978.2022.10072135
Qureshi, S., & Menezes, J. (2023). Prediction of Hotel Booking Cancellation Using Machine Learning Algorithms. In IET Conference Proceedings (Vol. 2023, Issue 44, pp. 140–145). https://doi.org/10.1049/icp.2024.0914
Saputro, P. H., & Nanang, H. (2021). Exploratory Data Analysis & Booking Cancelation Prediction on Hotel Booking Demands Datasets. Journal of Applied Data Sciences, 2(1), 40–56. https://doi.org/10.47738/jads.v2i1.20
Sharma, H., & Aggarwal, A. G. (2020). What factors determine reviewer credibility?: An econometric approach validated through predictive modeling. Kybernetes, 49(10), 2547–2567. https://doi.org/10.1108/K-08-2019-0537
Shifullah, K., Rakibullah, H. M., Islam, N., Raihan, H., Iqbal, M. A., Ziaul Karim, D., & Rasel, A. A. (2022). Classification of Hotel Reviews Using Sentiment Analysis and Machine Learning. In Proceedings of 2022 25th International Conference on Computer and Information Technology, ICCIT 2022 (pp. 710–715). https://doi.org/10.1109/ICCIT57492.2022.10054884
Singh, I. (2022). Dynamic Pricing using Reinforcement Learning in Hospitality Industry. In IBSSC 2022 - IEEE Bombay Section Signature Conference. https://doi.org/10.1109/IBSSC56953.2022.10037523
Taherkhani, L., Daneshvar, A., Amoozad Khalili, H., & Sanaei, M. R. (2023). Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm. Advances in Civil Engineering, 2023. https://doi.org/10.1155/2023/6029121
Trivedi, S. K., Singh, A., & Malhotra, S. K. (2023). Prediction of polarities of online hotel reviews: an improved stacked decision tree (ISD) approach. Global Knowledge, Memory and Communication, 72(8–9), 765–778. https://doi.org/10.1108/GKMC-12-2021-0197
Yoo, M., Singh, A. K., & Loewy, N. (2024). Predicting hotel booking cancelation with machine learning techniques. Journal of Hospitality and Tourism Technology, 15(1), 54–69. https://doi.org/10.1108/JHTT-07-2022-0227
Zhao, Z., Zhou, W., Qiu, Z., Li, A., & Wang, J. (2022). Research on Ctrip Customer Churn Prediction Model Based on Random Forest. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 107, pp. 511–523). https://doi.org/10.1007/978-3-030-92632-8_48