Implementation of the Ensemble Machine Learning Algorithm for Student Dropout Prediction Analysis

Winarsih Winarsih; Heri Sutanto; Aris Puji Widodo

doi:10.14710/vol15iss2pp159-166

DOI: https://doi.org/10.14710/vol15iss2pp159-166

Implementation of the Ensemble Machine Learning Algorithm for Student Dropout Prediction Analysis

*Winarsih Winarsih - Doctoral Program of Information System, School of Post Graduate Studies, Diponegoro University, Jl. Imam Bardjo S.H., No. 5, Pleburan, Semarang, Indonesia 50241, Indonesia

Heri Sutanto - Department of Physics, Faculty of Science and Mathematics, Diponegoro University, Jl. Prof. Soedarto, S.H., Tembalang, Semarang, Indonesia 50275, Indonesia

Aris Puji Widodo - Department of Informatics, Faculty of Science and Mathematics, Diponegoro University, Jl. Prof. Soedarto, S.H., Tembalang, Semarang, Indonesia 50275, Indonesia

Citation Format:

Abstract

Educational Data Mining provides an effective approach to tackle numerous issues within the education sector, including the capacity to perform predictive analyses regarding student attrition based on academic information. In this research, data from the Open University Learning Analytics dataset (OULAD), which is publicly accessible, has been employed, which encompasses student information collected during online learning. We apply various Machine Learning models, including Decision Trees, Naïve Bayes, Logistic Regression, and ensemble approaches like Random Forest and AdaBoost. Among the models tested, Random Forest (RF) achieved the highest accuracy of 89.37%, along with a precision of 89.57% and a recall of 93.86%, using the data splitting approach. When employing an alternative evaluation model, specifically K-Fold Cross Validation, the maximum F1 score achieved was 9.45%. In summary, the ensemble machine learning algorithm, specifically Random Forest (RF), exhibited strong performance in predicting student academic achievement quality.

Fulltext View|Download

Keywords: Bi-criteria scheduling; Machine Learning; Student dropout prediction; Data Mining; Random Forest; Open University Learning Analytics (OULAD)

Article Metrics:

Article Info

Section: Research Articles

Language : EN

In Vol 15, No 2 (2025): Volume 15 Number 2 Year 2025

Recent articles

The 7D BIM Modeling for Building Asset Data Management Using Revit, COBIe Extension, and QR code Machine Learning Methods for Academic Achievement Prediction: A Bibliometric Review Smart Village Tourism: Barriers and Facilitators in Adopting a Smart City Perspective Using SWOT Analysis Preface JSINBIS_15 (2) 2025 Back matter JSINBIS_15 (2) 2025 Front matter JSINBIS_15 (2) 2025 More recent articles

Most cited articles

Aplikasi Penentuan Tarif Listrik Menggunakan Metode Fuzzy Sugeno Sistem Pemilihan Perumahan dengan Metode Kombinasi Fuzzy C-Means Clustering dan Simple Additive Weighting APLIKASI DIAGNOSA GEJALA DEMAM PADA BALITA MENGGUNAKAN METODE CERTAINTY FACTOR (CF) DAN JARINGAN SYARAF TIRUAN (JST) Penggunaan Jaringan Syaraf Tiruan Backpropagation Untuk Seleksi Penerimaan Mahasiswa Baru Pada Jurusan Teknik Komputer Di Politeknik Negeri Sriwijaya Sistem Informasi Geografis Berbasis Web untuk Pemetaan Sebaran Alumni Menggunakan Metode K-Means More cited articles

Al-Zawqari, A., Peumans, D., & Vandersteen, G. (2022). A flexible feature selection approach for predicting students’ academic performance in online courses. Computers and Education: Artificial Intelligence, 3(November), 100103. https://doi.org/10.1016/j.caeai.2022.100103
Alhothali, A., Albsisi, M., Assalahi, H., & Aldosemani, T. (2022). Predicting Student Outcomes in Online Courses Using Machine Learning Techniques: A Review. Sustainability (Switzerland), 14(10), 1–23. https://doi.org/10.3390/su14106199
Bagunaid, W., Chilamkurti, N., & Veeraraghavan, P. (2022). AISAR: Artificial Intelligence-Based Student Assessment and Recommendation System for E-Learning in Big Data. Sustainability (Switzerland), 14(17). https://doi.org/10.3390/su141710551
Barros, T. M., Neto, P. A. S., Silva, I., & Guedes, L. A. (2019). Predictive models for imbalanced data: A school dropout perspective. Education Sciences, 9(4). https://doi.org/10.3390/educsci9040275
Daza Vergaray, A., Miranda, J. C. H., Cornelio, J. B., López Carranza, A. R., & Ponce Sánchez, C. F. (2023). Predicting the depression in university students using stacking ensemble techniques over oversampling method. Informatics in Medicine Unlocked, 41(June). https://doi.org/10.1016/j.imu.2023.101295
Hameed, M., & Akhtar, N. (2021). Student Performance Prediction in Intelligent E-Learning for Tertiary Education How to Cite: Mustafa Hameed and Nadeem Akhtar (2021). Student Performance Prediction in Intelligent E-Learning for Tertiary Education. International Journal of Computational I. International Journal of Computational Intelligence in Control, 13(2), 293–299
Ika Alfina, Rio Mulia, Mohamad Ivan Fanany, Y. E. (1999). Hate Speech Detection in the Indonesian Language: A Dataset and Preliminary Study. 473–481
Jawad, K., Shah, M. A., & Tahir, M. (2022). Students’ Academic Performance and Engagement Prediction in a Virtual Learning Environment Using Random Forest with Data Balancing. Sustainability (Switzerland), 14(22). https://doi.org/10.3390/su142214795
Khanday, A. M. U. D., Rabani, S. T., Khan, Q. R., & Malik, S. H. (2022). Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques. International Journal of Information Management Data Insights, 2(2), 100120. https://doi.org/10.1016/j.jjimei.2022.100120
Mastour, H., Dehghani, T., Moradi, E., & Eslami, S. (2023). Early prediction of medical students’ performance in high-stakes examinations using machine learning approaches. Heliyon, 9(7), e18248. https://doi.org/10.1016/j.heliyon.2023.e18248
Renò, V., Stella, E., Patruno, C., Capurso, A., Dimauro, G., & Maglietta, R. (2022). Learning Analytics: Analysis of Methods for Online Assessment. Applied Sciences (Switzerland), 12(18), 1–10. https://doi.org/10.3390/app12189296
Rodríguez-Hernández, C. F., Musso, M., Kyndt, E., & Cascallar, E. (2021). Artificial neural networks in academic performance prediction: Systematic implementation and predictor evaluation. Computers and Education: Artificial Intelligence, 2(December 2020). https://doi.org/10.1016/j.caeai.2021.100018
Sawangarreerak, S., & Thanathamathee, P. (2020). Random forest with sampling techniques for handling imbalanced prediction of university student depression. Information (Switzerland), 11(11), 1–13. https://doi.org/10.3390/info11110519
Taamneh, M. M., Taamneh, S., Alomari, A. H., & Abuaddous, M. (2023). Analyzing the Effectiveness of Imbalanced Data Handling Techniques in Predicting Driver Phone Use. Sustainability (Switzerland), 15(13). https://doi.org/10.3390/su151310668
Tsai, J. K., & Hung, C. H. (2021). Improving adaboost classifier to predict enterprise performance after covid-19. Mathematics, 9(18), 1–10. https://doi.org/10.3390/math9182215

Last update:

No citation recorded.

Last update: 2026-01-06 07:44:02

No citation recorded.

Authors who submit the manuscripts to Journal JSINBIS must understand and agree that if the manuscript is accepted for publication, the copyright of the article belongs to JSINBIS and Diponegoro University as the journal publisher.

Copyright includes the exclusive right to reproduce and provide articles in all forms and media, including reprints, photographs, microfilm and any other similar reproductions, as well as translations. The author reserves the rights to the following:

Reproduce all or part of published material for use by the author himself as teaching material in class or oral presentation material in various forums;
Reuse part or all of the material as compilation material for the author's written work;
Make copies of published materials for distribution within the institution where the author works.

JSINBIS and Diponegoro University and the Editors make every effort to ensure that no false or misleading data, opinions or statements are published in this journal. The content of articles published in JSINBIS is the sole and exclusive responsibility of the respective authors.

Copyright transfer agreement can be found here: [Copyright transfer agreement in doc] and [Copyright transfer agreement in pdf].