MULTICLASS CLASSIFICATION OF MARKETPLACE PRODUCTS WITH MACHINE LEARNING

Farhan Satria Aditama; Dewi Krismawati; Setia Pramana

doi:10.14710/medstat.17.1.25-35

DOI: https://doi.org/10.14710/medstat.17.1.25-35

MULTICLASS CLASSIFICATION OF MARKETPLACE PRODUCTS WITH MACHINE LEARNING

*Farhan Satria Aditama - Directorate of Statistical Dissemination, BPS Statistics Indonesia, Jakarta, Indonesia, Indonesia

Dewi Krismawati - Directorate of Analysis and Statistics Development, BPS Statistics Indonesia, Jakarta, Indonesia, Indonesia

Setia Pramana - Politeknik Statistika STIS, Jakarta, Indonesia, Indonesia

Citation Format:

Abstract

The use of marketplace data and machine learning in the collection of commodity data can provide an opportunity for Statistics Indonesia to complete the commodity directories for various surveys. This research adopts machine learning to train a product classification model based on existing datasets to predict whether a new dataset falls into which KBKI category. The dataset contains more than 32,000 products from 26 classes consisting of product data from two biggest marketplaces in Indonesia. Algorithms used for classification include Random Forests (RF), Support Vector Machines (SVM), and Multinomial Naive Bayes (MNB). Results indicate that MNB is the most effective algorithm when considering the trade-off between accuracy and processing time. MNB achieved the highest micro-average F1 scores, with 91.8% for Tokopedia and 95.4% for Shopee, and has the fastest execution time approximately 5 seconds.

Note: This article has supplementary file(s).

Fulltext View|Download | Hasil Riset

Tidak berjudul

Subject
Type	Hasil Riset
	Download (11KB) Indexing metadata

Instrumen Riset

Tidak berjudul

Subject
Type	Instrumen Riset
	Download (75KB) Indexing metadata

Keywords: Machine Learning, Marketplace, Multiclass Classification.

Article Metrics:

Article Info

Section: Articles

Language : EN

In Vol 17, No 1 (2024): Media Statistika