skip to main content

Increasing the Accuracy of Random Forest Algorithm Using Bagging Techniques in Cases of Stunting Toddlers

*Amir Ali  -  Medical Record and Health Information, Dr. Soetomo Hospital Foundation College Of Health Sciences, Surabaya 60286, Indonesia, Indonesia
Purwanto Purwanto  -  Department of Information Systems, School of Post Graduate Studies, Diponegoro University, Jl. Imam Bardjo S.H., No. 5, Pleburan, Semarang, Indonesia 50241, Indonesia
Mundakir Mundakir  -  Department of Health Faculty, Muhammadiyah Surabaya University, Surabaya, Indonesia 60113, Indonesia
Open Access Copyright (c) 2025 Jurnal Sistem Informasi Bisnis

Citation Format:
Abstract

Increasing the accuracy value can be increased by using other algorithms. Increasing the accuracy value of a classification algorithm, the level of success of the algorithm's prediction is more precise and appropriate in providing its label. The purpose of the research is look performance of accuracy value for prediction with bagging algorithm. This research use random forest algorithm and bagging algorithm used for optimization. 12 data whose position is far from other data. 12 data deviate from the data pattern and are outliers. With z-score process, it will be processed to eliminate outlier data. After removing the outlier data, the data clean is 137 toddler data. After removing outliers and standardizing the data, the accuracy value obtained was 71% up to 100th accuracy with random forest algorithm. Optimization of a bagging algorithm to predict stunting in a dataset of toddlers that has been acquired and assessed its performance. This can be seen from the optimization of prediction results up to the 100th iteration, where the prediction accuracy results were 80.67%. Using the Random Forest algorithm and bagging techniques, the prediction of stunting in toddlers works well. Optimization of prediction results up to the 100th iteration, where the prediction accuracy results were 80.67%.

Fulltext View|Download
Keywords: Random Forest; Bagging algorithm; Outlier removal; Stunting prediction; Accuracy Improvement

Article Metrics:

  1. Abhaya, A., & Patra, B. K. (2023). An efficient method for autoencoder based outlier detection. Expert Systems with Applications, 213, 118904. https://doi.org/10.1016/j.eswa.2022.118904
  2. Aggarwal, V., Gupta, V., Singh, P., Sharma, K., & Sharma, N. (2019). Detection of spatial outlier by using improved Z-score test. In Proceedings of the International Conference on Trends in Electronics and Informatics (ICOEI 2019) (pp. 788–790). https://doi.org/10.1109/icoei.2019.8862582
  3. Bhagat, H. V., & Singh, M. (2022). NMVI: A data-splitting based imputation technique for distinct types of missing data. Chemometrics and Intelligent Laboratory Systems, 223. https://doi.org/10.1016/j.chemolab.2022.104518
  4. Curioso, I., et al. (2023). Addressing the curse of missing data in clinical contexts: A novel approach to correlation-based imputation. Journal of King Saud University - Computer and Information Sciences, 35(6). https://doi.org/10.1016/j.jksuci.2023.101562
  5. Gan, G., & Ng, M. K. P. (2017). K-means clustering with outlier removal. Pattern Recognition Letters, 90, 8–14. https://doi.org/10.1016/j.patrec.2017.03.008
  6. Gubbala, K., Kumar, M. N., & Sowjanya, A. M. (2023). AdaBoost based Random forest model for emotion classification of facial images. MethodsX, 11, 102422. https://doi.org/10.1016/j.mex.2023.102422
  7. Huda, S., et al. (2018). An ensemble oversampling model for class imbalance problem in software defect prediction. IEEE Access, 6, 24184–24195. https://doi.org/10.1109/ACCESS.2018.2817572
  8. Jamshidi, E. J., Yusup, Y., Kayode, J. S., & Kamaruddin, M. A. (2022). Detecting outliers in a univariate time series dataset using unsupervised combined statistical methods: A case study on surface water temperature. Ecological Informatics, 69. https://doi.org/10.1016/j.ecoinf.2022.101672
  9. Jiang, F., Liu, G., Du, J., & Sui, Y. (2016). Initialization of K-modes clustering using outlier detection techniques. Information Sciences, 332, 167–183. https://doi.org/10.1016/j.ins.2015.11.005
  10. Lin, E., Lin, C. H., & Lane, H. Y. (2021). Applying a bagging ensemble machine learning approach to predict functional outcome of schizophrenia with clinical symptoms and cognitive functions. Scientific Reports, 11, 1–9. https://doi.org/10.1038/s41598-021-86382-0
  11. Lin, E., Lin, C. H., & Lane, H. Y. (2022). A bagging ensemble machine learning framework to predict overall cognitive function of schizophrenia patients with cognitive domains and tests. Asian Journal of Psychiatry, 69, 103008. https://doi.org/10.1016/j.ajp.2022.103008
  12. Liu, C. L., et al. (2024). A bagging approach for improved predictive accuracy of intradialytic hypotension during hemodialysis treatment. Computers in Biology and Medicine, 172, 108244. https://doi.org/10.1016/j.compbiomed.2024.108244
  13. Liu, L., Chin, S. P., & Tran, T. D. (2019). Reducing sampling ratios and increasing number of estimates improve bagging in sparse regression. In 2019 53rd Annual Conference on Information Sciences and Systems (CISS). https://doi.org/10.1109/CISS.2019.8692865
  14. Saifudin, A., Nabillah, U. U., Yulianti, & Desyani, T. (2020). Bagging technique to reduce misclassification in coronary heart disease prediction based on random forest. Journal of Physics: Conference Series, 1477(3), 032009. https://doi.org/10.1088/1742-6596/1477/3/032009
  15. Shrifan, N. H. M. M., Akbar, M. F., & Isa, N. A. M. (2022). An adaptive outlier removal aided k-means clustering algorithm. Journal of King Saud University - Computer and Information Sciences, 34(8), 6365–6376. https://doi.org/10.1016/j.jksuci.2021.07.003
  16. Souiden, I., Omri, M. N., & Brahmi, Z. (2022). A survey of outlier detection in high dimensional data streams. Computer Science Review, 44, 100463. https://doi.org/10.1016/j.cosrev.2022.100463
  17. Zhao, Y., Li, H., Yu, X., Ma, N., Yang, T., & Zhou, J. (2022). An independent central point OPTICS clustering algorithm for semi-supervised outlier detection of continuous glucose measurements. Biomedical Signal Processing and Control, 71, 103196. https://doi.org/10.1016/j.bspc.2021.103196

Last update:

No citation recorded.

Last update: 2025-06-14 04:37:34

No citation recorded.