Algoritma K-Means Clustering Untuk Pengelompokan Ayat Al Quran Pada Terjemahan Bahasa Indonesia

Clustering process can make the process of grouping data so that the data in the same cluster have high similarity with the data in the same cluster. One of the clustering algorithm that is widely used is the K-Means because it has advantages such as simple, efficient, easy to understand and easy to apply. Grouping paragraph dealing with similar themes will allow users to find a theme in the Qur'an. This study aims to produce an information system that can perform grouping Quran with K-Means method. This research was conducted with a pre-processing stage process for text data, weighting by TFIDF, grouping data with K-Means clustering, labeling data for keywords. The resulting system is able to display a verse in groups associated with the keyword. The test results by using the index on the silhouette of Surah Al Fatihah generate positive value of 0.336 which means that the data in the right group, while the frequency of keywords versus the amount of data to produce a percentage of 53%, which means the keyword represents half of the data in the cluster. Tests also showed that the test results silhouette will be directly proportional to the number of clusters and inversely proportional to the number of data dimensions. To increase the value of testing required centroid method for early elections, the reduction of data dimensions and methods of measurement of distance and similarity.
Article Metrics:
- Abbas, N.H, 2009. Quran ‘Search for a Concept’ Tool and Website, Thesis Master of Science, The University of Leeds.
- Aggarwal C.C, Zhai C, 2012. Mining Text Data, Springer, New York.
- Ahlgren, P. Colliander, C., 2009. Document-document similarity approaches and science mapping : Experimental comparison of five approaches. Journal of Informetrics 3. 49-63.
- Ahmad, O., 2013. A Survey of Searching and Information Extraction on a Classical Text Using Ontology-based semantics modeling: A Case of Quran. Life Science Journal.
- Alghamdi, H.M., 2014. Arabic Web Pages Clustering And Annotation Using Semantic Class Features, Journal of King Saud University – Computer and Information Sciences 26, 388–397.
- Arifin, A.Z, Mahendra I., Ciptaningtyas H., 2010. Enhanced Confix Stripping Stemmer And Ants Algorithm For Classifying News Document In Indonesian Language, The 5th International Conference on Information & Communication Technology and Systems, pp 149-158.
- Atwell, E., Dukes, K., Sharaf, A.-B., Louw, N. H. B., Shawar, B. A., McEnery, T., et al. 2010. Understanding the Quran: A new Grand Challenge for Computer Science and Artificial Intelligence. Paper presented at the British Computer Society Workshop, Edinburgh.
- Darawaty, I, 2010. Intelegent Searching using Association Analysis for law Documents of Indonesian Government, Second International Conference on Advances in Computing, Control and Telecomunication Technologies, pp 122-124.
- Ksasbeh M.Z., 2009. Using Ontology to Define the Structure of the Holy Quran, 4th International Conference on Information Technology, Amman.
- Larose, D.T., 2005. Discovering Knowledge in Data : An Introduction to Data Mining, Wiley-Interscience, New Jersey.
- Liu B., 2007. Web Data Mining, Springer, New York.
- Manning, C.D., 2008. Introduction to Information Retrieval, Cambridge University Press, New York.
- Mardia, K.V., Kent, J.T., Bibby, J.M., 1979. Multivariate Analysis. Academic Press, London.
- Pulukadang D.R, 2014. Pendekataan Clustering untuk Pengelolaan Pengetahuan pada Sistem Manajemen Pengetahuan, Tesis Magister Sistem Informasi Undip.
- Rousseeuw, P.J., 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics 20, pg 53-65.
- Steinbach, M., Karypis, G., Kumar, V., 2000. A Comparison of Document Clustering Techniques, Technical Report of University of Minnesota, Minnesota.
Last update: 2021-01-16 00:55:48
Last update: 2021-01-16 00:55:48
-
Segmentation of Leaf Spots Disease in Apple Plants Using Particle Swarm Optimization and K-means Algorithm
Anam S.. Journal of Physics: Conference Series, 127 (1), 2020. doi: 10.1088/1742-6596/1562/1/012011 -
Soil moisture clustering using the k-means clustering method in the UNS's agricultural laboratory at Jumantono
Kurniawan Y.B.. AIP Conference Proceedings, 127 , 2020. doi: 10.1063/5.0000861 -
Business trends based on news portal websites for analysis of big data using k-means clustering
Hidayat W.. 2019 International Conference on Information and Communications Technology, ICOIACT 2019, 2019. doi: 10.1109/ICOIACT46704.2019.8938413
Penulis yang mengirimkan naskah harus memahami dan menyetujui bahwa jika diterima untuk dipublikasikan, hak cipta dari artikel adalah milik JSINBIS dan Universitas Diponegoro sebagai penerbit jurnal.
Hak cipta (copyright) meliputi hak eksklusif untuk mereproduksi dan memberikan artikel dalam semua bentuk dan media, termasuk cetak ulang, foto, mikrofilm dan setiap reproduksi lain yang sejenis, serta terjemahan. Penulis mempunyai hak untuk hal-hal berikut:
- menggandakan seluruh atau sebagian materi yang dipublikasikan untuk digunakan oleh penulis sendiri sebagai bahan pengajaran di kelas atau bahan presentasi lisan dalam berbagai forum;
- menggunakan kembali sebagian atau keseluruhan materi sebagai bahan kompilasi bagi karya tulis penulis;
- membuat salinan dari bahan yang dipublikasikan untuk didistribusikan di lingkungan institusi tempat penulis bekerja.
JSINBIS dan Universitas Diponegoro serta Editor melakukan segala upaya untuk memastikan bahwa tidak ada data, pendapat atau pernyataan yang salah atau menyesatkan yang dipublikasikan di jurnal ini. Isi artikel yang diterbitkan di JSINBIS adalah tanggung jawab tunggal dan eksklusif dari masing-masing penulis.