BibTex Citation Data :
@article{Medstat71825, author = {Mastika Mastika and Titin Siswantining and Alhadi Bustamam}, title = {COMPARISON OF MISSING VALUE IMPUTATION USING MEAN, BAYESIAN KNN, AND NON-BAYESIAN KNN ON TEP GENE EXPRESSION DATA}, journal = {MEDIA STATISTIKA}, volume = {18}, number = {1}, year = {2025}, keywords = {Mean Absolute Error; Mean Squared Error; Normalized Root Mean Squared Error; Gaussian Process; Optimization}, abstract = {Analysis of gene expression data, particularly in cancer data, often faces challenges due to the presence of missing values. One approach to overcome this is data imputation. This study evaluates the performance of three imputation methods, namely mean imputation, K-Nearest Neighbors (KNN), and KNN with Bayesian optimization using Gaussian Process modeling, on Tumor Educated Platelets (TEP) gene expression data. Missing values were introduced using Missing Completely at Random (MCAR) gradually at levels of 5%, 10%, 15%, and up to 60%, and performance was evaluated using three metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), and Normalized Root Mean Squared Error (NRMSE). The results show that the three methods produce relatively similar performance, with differences in MAE, MSE, and NRMSE values only at a small decimal scale. Although Bayesian Optimization is expected to improve the accuracy of KNN, the resulting improvement on this dataset is not significant. These findings indicate that simple imputation such as the average and KNN-based methods still provide competitive results on TEP data with data characteristics that have 14,020,496 zeros out of a total of 16,512,496 existing values, which is approximately 84.91% of the total data.}, issn = {2477-0647}, pages = {61--72} doi = {10.14710/medstat.18.1.61-72}, url = {https://ejournal.undip.ac.id/index.php/media_statistika/article/view/71825} }
Refworks Citation Data :
Note: This article has supplementary file(s).
Article Metrics:
Last update:
Last update: 2025-10-16 20:53:46
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Media Statistika journal and Department of Statistics, Universitas Diponegoro as the publisher of the journal. Copyright encompasses the rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations.
Media Statistika journal and Department of Statistics, Universitas Diponegoro and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in Media Statistika journal are the sole and exclusive responsibility of their respective authors and advertisers.
The Copyright Transfer Form can be downloaded here: [Copyright Transfer Form Media Statistika]. The copyright form should be signed originally and send to the Editorial Office in the form of original mail, scanned document or fax :
Dr. Di Asih I Maruddani (Editor-in-Chief) Editorial Office of Media StatistikaDepartment of Statistics, Universitas DiponegoroJl. Prof. Soedarto, Kampus Undip Tembalang, Semarang, Central Java, Indonesia 50275Telp./Fax: +62-24-7474754Email: maruddani@live.undip.ac.id
Media Statistika
Department of Statistics, Faculty of Science and Mathematics, Universitas Diponegoro
Gedung F Lantai 3, Jalan Prof Jacub Rais, Kampus Tembalang
Semarang 50275
Indexing: