ANALISIS PERBANDINGAN KINERJA CART KONVENSIONAL, BAGGING DAN RANDOM FOREST PADA KLASIFIKASI OBJEK: HASIL DARI DUA SIMULASI

*Yogo Aryo Jatmiko -  Badan Pusat Statistik, Indonesia
Septiadi Padmadisastra -  Universitas Padjajaran, Indonesia
Anna Chadidjah -  Universitas Padjajaran, Indonesia
Received: 19 Mar 2018; Published: 24 Jul 2019.
Open Access Copyright (c) 2019 MEDIA STATISTIKA
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Citation Format:
Article Info
Section: Articles
Language: ID
Full Text:
Statistics: 168 206
Abstract

The conventional CART method is a nonparametric classification method built on categorical response data. Bagging is one of the popular ensemble methods whereas, Random Forests (RF) is one of the relatively new ensemble methods in the decision tree that is the development of the Bagging method. Unlike Bagging, Random Forest was developed with the idea of adding layers to the random resampling process in bagging. Therefore, not only randomly sampled sample data to form a classification tree, but also independent variables are randomly selected and newly selected as the best divider when determining the sorting of trees, which is expected to produce more accurate predictions. Based on the above, the authors are interested to study the three methods by comparing the accuracy of classification on binary and non-binary simulation data to understand the effect of the number of sample sizes, the correlation between independent variables, the presence or absence of certain distribution patterns to the accuracy generated classification method. Results of the research on simulation data show that the Random Forest ensemble method can improve the accuracy of classification.

Article Metrics: