skip to main content

Optimization of Prediction for Cancellation of Hotel Room Reservation Using Decision Tree with Feature Selection and Resampling

*Eka Rahmawati  -  Information Systems, Faculty of Engineering and Informatics, Universitas Bina Sarana Informatika, Jakarta, 10450, Indonesia, Indonesia
Galih Setiawan Nurohim  -  Information Systems, Faculty of Engineering and Informatics, Universitas Bina Sarana Informatika, Jakarta, 10450, Indonesia, Indonesia
Open Access Copyright (c) 2025 Jurnal Sistem Informasi Bisnis

Citation Format:
Abstract

The hotel industry is highly competitive and faces challenges, such as fluctuating demand, intense competition, and shifting consumer preferences. One critical issue that hotels frequently encounter is the cancellation of room reservations, which disrupts operational planning and resource management and leads to significant financial losses. Accurately predicting the likelihood of reservation cancellation is essential to mitigate these negative impacts and optimize revenue management strategies. This study focuses on the development of a predictive model for hotel room reservation cancellations using a decision-tree algorithm. The Decision Tree was selected for its ability to manage complex relationships between variables and ease of interpretation, making it accessible to hotel managers without technical expertise. To enhance the performance of the model, a forward selection technique was employed to identify the most relevant features, ensuring a balance between the model complexity and predictive accuracy. Additionally, resampling techniques were applied to address class imbalance in the dataset, which is common in cancellation cases where non-cancelled reservations outnumber cancelled reservations. This study explores the prediction of hotel room reservation cancellations using a decision tree algorithm enhanced by feature selection and resampling. The model achieved an accuracy improvement to 90%, with precision and recall each increasing by 5,5% after applying these techniques. These findings suggest practical applications for improving cancellation predictions and optimizing revenue management strategies for hotels. The study provides insights into how data-driven approaches can enhance decision-making processes within the competitive hospitality industry.

Fulltext View|Download
Keywords: Hotel Reservation Cancellation; Decision Tree Algorithm; Feature Selection; Resampling Technique; Revenue Management Optimization

Article Metrics:

  1. Aldoseri, A., Al-Khalifa, K. N., & Hamouda, A. M. (2023). Re-Thinking Data Strategy and Integration for Artificial Intelligence: Concepts, Opportunities, and Challenges. In Applied Sciences (Switzerland) (Vol. 13, Issue 12). MDPI. https://doi.org/10.3390/app13127082
  2. Antonio, N., Almeida, A. de, & Nunes, L. (2017). Predicting hotel booking cancellations to decrease uncertainty and increase revenue. Tourism & Management Studies, 13(2), 25–39. https://doi.org/10.18089/tms.2017.13203
  3. Antonio, N., de Almeida, A., & Nunes, L. (2019). Big Data in Hotel Revenue Management: Exploring Cancellation Drivers to Gain Insights Into Booking Cancellation Behavior. Cornell Hospitality Quarterly, 60(4), 298–319. https://doi.org/10.1177/1938965519851466
  4. Asim Shahid, M., Alam, M. M., & Mohd Su’ud, M. (2024). A fact based analysis of decision trees for improving reliability in cloud computing. PLoS ONE, 19(12 December). https://doi.org/10.1371/journal.pone.0311089
  5. Chen, S., Ngai, E. W. T., Ku, Y., Xu, Z., Gou, X., & Zhang, C. (2023). Prediction of hotel booking cancellations: Integration of machine learning and probability model based on interpretable feature interaction. Decision Support Systems, 170. https://doi.org/10.1016/j.dss.2023.113959
  6. El-Hasnony, I. M., Barakat, S. I., Elhoseny, M., & Mostafa, R. R. (2020). Improved Feature Selection Model for Big Data Analytics. IEEE Access, 8, 66989–67004. https://doi.org/10.1109/ACCESS.2020.2986232
  7. Hamdard, Asst. P. M. S., & Lodin, Asst. P. H. (2023). Effect of Feature Selection on the Accuracy of Machine Learning Model. International Journal Of Multidisciplinary Research And Analysis, 06(09). https://doi.org/10.47191/ijmra/v6-i9-66
  8. Herrera, A., Arroyo, Á., Jiménez, A., & Herrero, Á. (2024). Forecasting hotel cancellations through machine learning. Expert Systems. https://doi.org/10.1111/exsy.13608
  9. Kappen, T. H., van Klei, W. A., van Wolfswinkel, L., Kalkman, C. J., Vergouwe, Y., & Moons, K. G. M. (2018). Evaluating the impact of prediction models: lessons learned, challenges, and recommendations. Diagnostic and Prognostic Research, 2(1). https://doi.org/10.1186/s41512-018-0033-6
  10. Khushi, M., Shaukat, K., Alam, T. M., Hameed, I. A., Uddin, S., Luo, S., Yang, X., & Reyes, M. C. (2021). A Comparative Performance Analysis of Data Resampling Methods on Imbalance Medical Data. IEEE Access, 9, 109960–109975. https://doi.org/10.1109/ACCESS.2021.3102399
  11. Kim, A., & Jung, I. (2023). Optimal selection of resampling methods for imbalanced data with high complexity. PLoS ONE, 18(7 July). https://doi.org/10.1371/journal.pone.0288540
  12. Kuo, C.-Y., Tung Chang, Y., & Chang, Y.-T. (2023). Introduction of spatial analysis approaches for Emergency Medical Services data: Using the Kaggle dataset as an example Paramedicine and Emergency Response Introduction of spatial analysis approaches for Emergency Medical Ser-vices data: Using the Kaggle dataset as an example. https://doi.org/10.30216/JPER.202310_(4).0004
  13. Kuptametee, C., & Aunsri, N. (2022). A review of resampling techniques in particle filtering framework. In Measurement: Journal of the International Measurement Confederation (Vol. 193). Elsevier B.V. https://doi.org/10.1016/j.measurement.2022.110836
  14. Maharana, K., Mondal, S., & Nemade, B. (2022). A review: Data pre-processing and data augmentation techniques. Global Transitions Proceedings, 3(1), 91–99. https://doi.org/10.1016/j.gltp.2022.04.020
  15. Mienye, I. D., & Jere, N. (2024). A Survey of Decision Trees: Concepts, Algorithms, and Applications. IEEE Access, 12, 86716–86727. https://doi.org/10.1109/ACCESS.2024.3416838
  16. Sarailidis, G., Wagener, T., & Pianosi, F. (2023). Integrating scientific knowledge into machine learning using interactive decision trees. Computers and Geosciences, 170. https://doi.org/10.1016/j.cageo.2022.105248
  17. Shahrizan, M., Rahman, A., Azliana, N., Jamaludin, A., Zainol, Z., Mohd, T., & Sembok, T. (2023). The Application of Decision Tree Classification Algorithm on Decision-Making for Upstream Business. IJACSA) International Journal of Advanced Computer Science and Applications, 14(8), 2023. https://dx.doi.org/10.14569/IJACSA.2023.0140873
  18. Yoo, M., Singh, A. K., & Loewy, N. (2024). Predicting hotel booking cancelation with machine learning techniques. Journal of Hospitality and Tourism Technology, 15(1), 54–69. https://doi.org/10.1108/JHTT-07-2022-0227

Last update:

No citation recorded.

Last update: 2025-06-14 04:46:23

No citation recorded.