skip to main content

Structural Correlation Patterns in Regional COVID-19 Surveillance Data and Implications for Epidemiological Monitoring

1Department of Informatics, Sumbawa University of Technology, Indonesia

2Department of Electrical Engineering, Kookmin University, South Korea

Received: 27 Aug 2025; Revised: 12 Feb 2026; Accepted: 22 Feb 2026; Published: 24 Feb 2026.
Open Access Copyright (c) 2026 The authors. Published by Department of Informatics Universitas, Diponegoro
Creative Commons License This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Citation Format:
Abstract

The Covid-19 pandemic has had a significant impact on the health sector in various regions, including Kabupaten Sumbawa. This study aims to analyze relationships among attributes in the Covid-19 dataset using the Correlation Matrix algorithm within the CRISP-DM methodology. The dataset was obtained from the official website of the Government of Kabupaten Sumbawa, comprising 10,573 records, of which 405 were cleaned after the data cleaning process. The analysis was conducted using RapidMiner 9.9 software. The findings indicate a very strong correlation between the attributes KONTAK ERAT-DISCARDE, SUSPEK-DISCARDE, and KONFIRMASI-MENINGGAL DUNIA with the increase in total Covid-19 cases. In addition, a significant negative correlation was observed between the attribute PP-MASIH KARANTINA and the number of deaths. Furthermore, an almost perfect correlation was found between PROBABLE-DISCARDE and PROBABLE-MENINGGAL. Based on these findings, it is recommended that the government prioritize monitoring cases before they are declared discarded and strengthen the quarantine system for travelers. This study provides a data-driven foundation for formulating evidence-based pandemic response policies.

Fulltext View|Download
Keywords: Covid-19, correlation matrix, data mining, dataset attributes

Article Metrics:

  1. Y. R. T. Hutagaol, R. P. P. Sinurat, and S. M. Shalahuddin, “Strategi Penguatan Keuangan Negara Dalam Menghadapi Ancaman Resesi Global 2023 Melalui Green Economy [Strategy to Strengthen State Finances in Facing the Threat of a Global Recession in 2023 Through a Green Economy],” Jurnal Pajak dan Keuangan Negara (PKN), vol. 4, no. 1S, pp. 378–385, Dec. 2022, doi: 10.31092/jpkn.v4i1S.1911
  2. S. Sumarno, K. Karsim, D. Dwiyanto, F. Ekobelawati, and F. Christian, “Kerja jarak jauh dan produktivitas karyawan: Mengevaluasi dampak jangka panjang dari telecommuting pasca Covid-19 [Remote work and employee productivity: Evaluating the long-term impact of telecommuting post-Covid-19],” Journal of Management and Digital Business, vol. 4, no. 3, pp. 775–786, Dec. 2024, doi: 10.53088/jmdb.v4i3.1265
  3. M. H. D. R. Yahya and I. Y. N. Nanda, “Evaluasi Program Bantual Sosial Tunai (BST) Pada Masa Pandemi COVID-19 (Studi kasus; Kelurahan Langgini Kecamatan Bangkinang Kota Kabupaten Kampar) [Evaluation of the Cash Social Assistance Program During the COVID-19 Pandemic (Case Study: Langgini Village, Bangkinang District, Kampar Regency)],” SUMUR- Jurnal Sosial Humaniora, vol. 3, no. 1, pp. 27–34, Feb. 2025, doi: 10.58794/sumur.v3i1.1341
  4. F. Sari, “Implementasi Data Mining Dalam Menganalisis Tingkat Kepuasan Pelanggan Menggunakan Metode Rough Set [Implementation of Data Mining in Analyzing Customer Satisfaction Levels Using the Rough Set Method],” Jurnal Buana Informatika, vol. 8, no. 1, Jan. 2017, doi: 10.24002/jbi.v8i1.1071
  5. L. F. Azmi and L. Zahrotun, “Implementasi Data Mining untuk Estimasi Produksi Cabai menggunakan Metode Exponential Smoothing [Implementation of Data Mining for Chili Production Estimation using the Exponential Smoothing Method],” Jurnal Buana Informatika, vol. 15, no. 01, pp. 59–68, Apr. 2024, doi: 10.24002/jbi.v15i1.8333
  6. A. A. Alya Putri and S. A. Rahmah, “Implementasi Data Mining dengan Algoritma K-Means Clustering untuk Analisis Bisnis pada Perusahaan Asuransi [Implementation of Data Mining with K-Means Clustering Algorithm for Business Analysis in Insurance Companies],” Djtechno: Jurnal Teknologi Informasi, vol. 5, no. 1, pp. 139–152, Apr. 2024, doi: 10.46576/djtechno.v5i1.4537
  7. S. Ma, Y. Huang, Y. Liu, H. Liu, Y. Chen, J. Wang, and J. Xu, “Big data-driven correlation analysis based on clustering for energy-intensive manufacturing industries,” Appl Energy, vol. 349, p. 121608, Nov. 2023, doi: 10.1016/j.apenergy.2023.121608
  8. L. Xu, Y. Wang, L. Mo, Y. Tang, F. Wang, and C. Li, “The research progress and prospect of data mining methods on corrosion prediction of oil and gas pipelines,” Eng Fail Anal, vol. 144, p. 106951, Feb. 2023, doi: 10.1016/j.engfailanal.2022.106951
  9. M. Shantal, Z. Othman, and A. A. Bakar, “A Novel Approach for Data Feature Weighting Using Correlation Coefficients and Min–Max Normalization,” Symmetry (Basel), vol. 15, no. 12, p. 2185, Dec. 2023, doi: 10.3390/sym15122185
  10. N. Nur, S. Situju, and S. Aulia Rachmini, “Data Mining dan Manajemen Pengetahuan [Data Mining and Knowledge Management],” 2024. [Online]. Available: https://www.researchgate.net/publication/388498331
  11. D.-R. Nichita, M. Dima, L. Boboc, and M.-G. Hâncean, “Data analysis evidence beyond correlation of a possible causal impact of weather on the COVID-19 spread, mediated by human mobility,” Sci Rep, vol. 14, no. 1, p. 17782, Aug. 2024, doi: 10.1038/s41598-024-67918-6
  12. N. K. Bergman and R. Fishman, “Correlations of mobility and Covid-19 transmission in global data,” PLoS One, vol. 18, no. 7, p. e0279484, Jul. 2023, doi: 10.1371/journal.pone.0279484
  13. D. Conesa, C. L. Roja, T. Gullon, A. T. Campo, C. Prats, E. A. Lacalle, and B. Echebarria, “A mixture of mobility and meteorological data provides a high correlation with COVID-19 growth in an infection-naive population: a study for Spanish provinces,” Front Public Health, vol. 12, Mar. 2024, doi: 10.3389/fpubh.2024.1288531
  14. E. Cleary et al., “Comparing lagged impacts of mobility changes and environmental factors on COVID-19 waves in rural and urban India: A Bayesian spatiotemporal modelling study,” PLOS Global Public Health, vol. 5, no. 4, p. e0003431, Apr. 2025, doi: 10.1371/journal.pgph.0003431
  15. U. Kannengiesser and J. S. Gero, “Modelling the Design of Models: An Example Using CRISP-DM,” Proceedings of the Design Society, vol. 3, pp. 2705–2714, Jul. 2023, doi: 10.1017/pds.2023.271
  16. D. Pratmanto, F. F. D. Imaniawan, and V. Maarif, “Analisis Sentimen Pada Ulasan Pengguna Aplikasi Identitas Kependudukan Digital Dengan Metode Naive Bayes Dan K-Nearest [Sentiment Analysis on User Reviews of Digital Population Identity Applications Using Naive Bayes and K-Nearest Methods],” Computatio : Journal of Computer Science and Information Systems, vol. 7, no. 2, pp. 155–166, Dec. 2023, doi: 10.24912/computatio.v7i2.26322

Last update:

No citation recorded.

Last update: 2026-02-24 14:40:12

No citation recorded.