Implementation of K-Means Clustering for Social Assistance Recipients with Silhouette Score Evaluation

Authors

  • Herfia Rhomadhona Politeknik Negeri Tanah Laut, Indonesia
  • Wiwik Kusrini Politeknik Negeri Tanah Laut, Indonesia
  • Winda Aprianti Politeknik Negeri Tanah Laut, Indonesia
  • Jaka Permadi Politeknik Negeri Tanah Laut, Indonesia

DOI:

https://doi.org/10.47709/brilliance.v5i1.5900

Keywords:

Data Mining, K-Means Clustering, Silhouette Coefficient, Social Assistance

Abstract

The distribution of direct social assistance continues to face several challenges, particularly regarding inaccurate targeting and unequal allocation. One of the main causes of this issue is the lack of transparency in the distribution process, where assistance is often granted to individuals with familial ties to local committee members or even government officials. As a result, the groups most in need frequently do not receive the aid they deserve. This condition is also evident in Tanjung Village, Bajuin Subdistrict, Tanah Laut Regency. The manual process of grouping prospective aid recipients contributes to inaccuracies in targeting, which in turn leads to public dissatisfaction. To address this issue, this study applies the K-Means Clustering method to group potential social assistance recipients using data from 150 individuals and three main attributes: age, occupation, and income. The method clusters the data based on the similarity of characteristics, thus supporting a more equitable and efficient identification process. The evaluation is conducted using the Silhouette Coefficient to assess the quality of clustering. The results indicate that the highest Silhouette Score is achieved at k=2k = 2k=2, with a value of 0.8278, suggesting that dividing the data into two clusters provides the most optimal configuration. The Silhouette Score tends to decrease as the number of clusters increases, confirming that adding more clusters does not necessarily improve the quality of separation.

References

Amrulloh, K., Pudjiantoro, T. H., Sabrina, P. N., & Hadiana, A. I. (2022). Comparison Between Davies-Bouldin Index and Silhouette Coefficient Evaluation Methods in Retail Store Sales Transaction Data Clusterization Using K-Medoids Algorithm. In Proceedings of the 3rd South American International Industrial Engineering and Operations Management Conference, Asuncion, (pp. 1952-1961). Paraguay.

Aprianti, W., & Permadi, J. (2018). K-means clustering untuk data kecelakaan lalu lintas jalan raya di Kecamatan Pelaihari. Jurnal Teknologi Informasi dan Ilmu Komputer, 5(5), 613-620.

BPS. (2023). Kabupaten Tanah Laut dalam Angka. Pelaihari: Badan Pusat Statistik Kabupaten Tanah Laut. Retrieved from https://tanahlautkab.bps.go.id.

Fammaldo, E., & Hakim, L. (2018). Penerapan Algoritma K-Means Clustering Untuk Pengelompokan Tingkat Kesejahteraan Keluarga Untuk Program Kartu Indonesia Pintar. Jurnal Ilmiah Teknologi Infomasi Terapan, 5(1), 23-31.

Joshi, A. P., & Patel, B. V. (2020). Data Preprocessing: The Techniques for Preparing Clean and Quality Data for Data Analytics Process. Orient.J. Comp. Sci. and Technol, 13(2-3), 78–81.

Kasim, R. J., Bahri, S., & Amir, S. (2021). Implementasi Metode K-Means Untuk Clustering Data Penduduk Miskin Dengan Systematic Random Sampling. Prosiding SISFOTEK. 5, pp. 95-101. Padang: Ikatan Ahli Informatika Indonesia (IAII).

Mulyani, H., Setiawan, R. A., & Fathi, H. (2023). Optimization Of K Value In Clustering Using Silhouette Score (Case Study: Mall Customers Data). Journal Of Information Technology And Its Utilization, 6(2), 45-50.

Nugraha, R. P., Laxmi, G. F., & Riana, F. (2024). Penerapan K-Means++ Untuk Pengelompokan Mahasiswa Berpotensi Drop Out Studi Kasus: Universitas Ibn Khaldun Bogor. JATI (Jurnal Mahasiswa Teknik Informatika), 8(3), 3493-3500.

Nur, I. M., & Abdurakhman, A. (2024). Analysis of Social Vulnerability in Java Island using K-Medoids Algorithm with Variation of Distance Measurements (Euclidean, Manhattan, Minkowski). Indonesian Journal of Artificial Intelligence and Data Mining (IJAIDM), 7(2), 467-475.

Rhomadhona, H., & Permadi, J. (2019). Klasifikasi Berita Kriminal Menggunakan Naive Bayes Classifier (NBC) dengan Pengujian K-Fold Cross Validation. Jurnal Sains dan Informatika, 5(2), 108-117.

Saraswat , P., & Raj, S. (2022). Data Pre-processing Techniques in Data Mining: A Review. International Journal of Innovative Research in Computer Science & Technology (IJIRCST), 10(1), 122-125.

Sari, S., & Utamajaya, J. N. (2022). Sistem Pendukung Keputusan Penerima Bantuan Langsung Tunai Dana Desa Menggunakan Metode Algoritma K-Means Clustering. Jurnal JUPITER, 14(1), 150-160.

Shahapure, K. R., & Nicholas, C. (2020). Cluster Quality Analysis Using Silhouette Score. International Journal of Computer Applications. 182, pp. 1-5. Sydney: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA).

Singh, A. K., Mittal, S., Malhotra, P., & Srivastava, Y. V. (2020). Clustering Evaluation by Davies-Bouldin Index(DBI) in Cereal data using K-Means. 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC), (pp. 306-310). Erode, India.

Sriaji, C. D., Zaenah, Z., Istiawan, D., Prayogi, S. Y., & Mahiruna, A. (2021). Pengelompokan Tingkat Kemiskinan Kabupaten/Kota Di Provinsi Jawa Timur Mengunakan Algoritma K-Means++. Journal of Applied Statistics and Data Mining, 2(2), 29-40.

Downloads

Published

2025-05-28

How to Cite

Rhomadhona, H., Kusrini, W., Aprianti, W., & Permadi, J. (2025). Implementation of K-Means Clustering for Social Assistance Recipients with Silhouette Score Evaluation. Brilliance: Research of Artificial Intelligence, 5(1), 136–143. https://doi.org/10.47709/brilliance.v5i1.5900