Optimizing Cluster Centroids using Hybrid Firefly-Genetic Algorithm for Village Development Clustering

Authors

  • Annisa Rahma Directorate of Services Statistics, Badan Pusat Statistik RI, Jakarta, Indonesia
  • Rani Nooraeni Departement of Statistical Computing, Politeknik Statistika STIS, Jakarta, Indonesia
  • Raditya Hizra Departement of Statistical Computing, Politeknik Statistika STIS, Jakarta, Indonesia

DOI:

https://doi.org/10.20956/j.v22i3.50288

Keywords:

Village Development Index, Cluster Analysis, Genetic Algorithm, Firefly Algorithm

Abstract

Clustering in the Building Village Index (BVI) offers an alternative approach to identifying village groupings based on numerical and categorical characteristics similarities. K-Prototype (KP) is a popular clustering algorithm for handling . This study proposes the hybridization that combines the exploitation strength of the Firefly Algorithm (FA) with the exploration capability of the Genetic Algorithm (GA) to optimize centroid initialization of KP. The results show that FA-GA hybridization enables the centroid initialization process to be faster with optimal fitness compared to either a single FA or a single GA. FGAKP is the best clustering algorithm in this study because it produces the smallest Total Cost (TC) and the largest Cluster Validity (CV) index with the most efficient centroid initialization time across all training data. The implementation of FGAKP in village grouping based on BVI indicators in North Kalimantan Province in 2024 grouped 484 villages into 5 categories based on their village development potential, achieving an 11.69% reduction in TC and a 16.75% improvement in CV compared to the standard KP without optimization.

 

Author Biography

Rani Nooraeni, Departement of Statistical Computing, Politeknik Statistika STIS, Jakarta, Indonesia

Rani Nooraeni is an associate professor at the Statistics Polytechnic STIS Jakarta, specializing in the application of big data and data science. She holds a master's degree in Statistics from Padjadjaran University. Alongside her active role in teaching within the Computational Statistics program, she serves as the Editor-in-Chief of a national affiliated with the university. Her expertise lies at the intersection of statistical analysis and cutting-edge data technologies, driving innovation and advancements in the field.

References

Abdullah, A., Deris, S., Mohamad, S., Zaiton, S., & Hashim, M., 2012. A New Hybrid Firefly Algorithm for Complex and Nonlinear Problem. Advances in Intelligent and Soft Computing (AISC), 151, 673–680. https://doi.org/10.1007/978-3-642-28765-7_81

[2] Ali, N., Othman, A., & Misran, M. H., 2014. A Review of Firefly Algorithm. Asian Research Publishing Network (ARPN) Journal of Engineering and Applied Sciences, 9(10), 1732–1736. www.arpnjournals.com

[3] Elkhechafi, M., Hachimi, H., & Elkettani, Y., 2017. A New Hybrid Firefly With Genetic Algorithm for Global Optimization. International Journal of Management and Applied Science, 3(4), 47–51. http://iraj

[4] Ezzeldin, R., Zelenakova, M., Abd-Elhamid, H. F., Pietrucha-Urbanik, K., & Elabd, S., 2023. Hybrid Optimization Algorithms of Firefly with GA and PSO for the Optimal Design of Water Distribution Networks. Water (Switzerland), 15(10), 1–15. https://doi.org/10.3390/w15101906

[5] Farahani, S. M., Abshouri, A. A., Nasiri, B., & Meybodi, M. R., 2012. Some Hybrid Models to Improve Firefly Algorithm Performance. International Journal of Artificial Intelligence, 8. https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=baf0337d48539f33150622a2bfbb89fc34cf93cd

[6] Hsu, C. C., & Chen, Y. C., 2007. Mining of Mixed Data with Application to Catalog Marketing. Expert Systems with Applications, 32(1), 12–23. https://doi.org/10.1016/j.eswa.2005.11.017

[7] Huang, T., Yin, H., & Huang, X., 2024. Improved Genetic Algorithm for Multi-Threshold Optimization in Digital Pathology Image Segmentation. Sci Rep, 14(22454), 1–21. https://doi.org/10.1038/s41598-024-73335-6

[8] Izamahendra, Y., 2025. Penerapan Metode Analisis Klaster dalam Pengelompokkan Desa di Kecamatan Pariaman Timur Berdasarkan Indeks Desa Membangun 2023. Jurnal Informatika Dan Teknik Elektro Terapan, 13(2), 204–211. https://doi.org/10.23960/jitet.v13i2.6138

[9] Janosi, A., Steinbrunn, W., Pfisterer, M., & Detrano, R., 1988. Heart Disease [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C52P4X

[10] Kaur, A., Kumar, Y., & Sidhu, J., 2024. Exploring Meta‑heuristics for Partitional Clustering : Methods , Metrics , Datasets , and Challenges. Artificial Intelligence Review, 57(10). https://doi.org/10.1007/s10462-024-10920-1

[11] Kemendesa PDTT., 2016. Permendesa Nomor 2 Tahun 2016. Diakses pada 22 November 2024, dari https://peraturan.bpk.go.id/Details/150585/permendes-pdtt-no-2-tahun-2016

[12] Kemendesa PDTT., 2024. Peringkat Prov-Kab-Kec IDM 2024. Diakses pada 2 Februari 2025, dari https://idm.kemendesa.go.id/

[13] Kemendesa PDTT., 2024. SOP IDM 2024. Diakses pada 13 Desember 2024, dari https://idm.kemendesa.go.id/view/detil/3/publikasi

[14] Nand, R., & Sharma, P., 2019. Iteration Split with Firefly Algorithm and Genetic Algorithm to Solve Multidimensional Knapsack Problems. 2019 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2019. https://doi.org/10.1109/CSDE48274.2019.9162422

[15] Nooraeni, R., 2015. Cluster Method Using A Combination of Cluster K-Prototype Algorithm and Genetic Algorithm for Mixed Data. Jurnal Aplikasi Statistika & Komputasi Statistik, 7(2), 81–97. https://doi.org/https://doi.org/10.34123/jurnalasks.v7i2.23

[16] Nooraeni, R., & Nurfalah, G., 2022. Kajian Penerapan Jarak Euclidean , Manhattan , Minkowski , dan Chebyshev pada Algoritma Clustering K-Prototype. Sains, Aplikasi, Komputasi Dan Teknologi Informasi, 4(2), 72–82. https://doi.org/10.30872/jsakti.v4i2.9241

[17] Nooraeni, R., Suprijadi, J., & Zulhanif., 2019. K-Prototype untuk Pengelompokan Sata Campuran. Jurnal Statistika Teori dan Aplikasi: Biomedics, Industry & Business and Social Statistics, 13(1), 9–16. https://www.researchgate.net/publication/337847692

[18] Okwu, M. O., & Tartibu, L. K., 2021. Metaheuristic Optimization: Nature-Inspired Algorithms Swarm and Computational Intelligence, Theory and Applications (Vol. 927). https://doi.org/https://doi.org/10.1007/978-3-030-61111-8

[19] Pham, D. -T., Suarez-Alvarez, M. M., & Prostov , Y. I., 2011. Random Search with K -Prototypes Algorithm for Clustering Mixed Datasets. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Science, 467(2132), 2387–2403. https://doi.org/10.1098/rspa.2010.0594

[20] Radhwani, A. M. N. A. & Algamal, Z. Y., 2021. Improving K-Means Clustering based on Firefly Algorithm. Journal of Physics: Conference Series, 1897(12004), 1–8 https://doi.org/10.1088/1742-6596/1897/1/012004

[21] Revata, S., & Rino., 2023. Implementation of Data Mining Classification of People ’ s Personalities using Naïve Bayes Algorithm. Bit-Tech, 6(1). https://doi.org/10.32877/bt.v6i1.741

[22] Rovea, O., 2014. Genetic algorithm and firefly algorithm hybrid schemes for cultivation processes modelling. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8790, 1–4. https://doi.org/10.1007/978-3-662-44994-3

[23] Samita, G. R., Wisesa, W., Setiawan, E. D., Respati K. I., & Hidayat, R., 2024. Integrasi Artificial Intelligence dan Teori Bounded Rationality dalam Mengatasi Ketidakpastian Pengambilan Keputusan Bisnis di Era Big Data. Jurnal Bisnis Dan Komunikasi Digital, 2(2), 1–12. https://doi.org/10.47134/jbkd.v2i2.3460

[24] Senthilnath, J., Omkar, S. N., & Mani, V., 2011. Clustering using firefly algorithm : Performance study. Swarm and Evolutionary Computation, 1(3), 164–171. https://doi.org/10.1016/j.swevo.2011.06.003

[25] Sihotang, H. T., Albert, M., Riandari, F., & Rendell, L., 2023. Efficient Optimization Algorithms for Various Machine Learning Tasks, Including Classification, Regression, and Clustering. Idea : Future Research, 1(1). https://doi.org/http://dx.doi.org/10.35335/idea.v1i1.3

[26] Song, J., Wang, B., & Hao, X., 2024. Optimization Algorithms and Their Applications and Prospects in Manufacturing Engineering. Materials, 17(16). https://doi.org/10.3390/ma17164093

[27] Suwirmayanti, P., Putra, I. K. G. D., & Kumara, I. N. S., 2014. Optimasi pusat cluster K-Prototype dengan algoritma genetika. Majalah Ilmiah Teknologi Elektro, 13(2). https://doi.org/10.24843/10.24843/MITE.2015.v13i02p02

[28] Wang, L., Duan, H., Liu, Z., Peng, Y., & Liu, X., 2024. Research on Modeling Method for Optimal Allocation of Wellhead Targets in Large Well Clusters. Processes, 12(8), 1–11. https://doi.org/10.3390/pr12081705

[29] Yang, X. -S., 2013. Cuckoo Search and Firefly Algorithm. https://doi.org/https://doi.org/10.1007/978-3-319-02141-6

[30] Yang, X. -S., & He, X., 2013. Firefly Algorithm : Recent Advances and Applications. Int. J. Swarm Intelligence, 1(1), 36–50. https://doi.org/https://doi.org/10.1504/IJSI.2013.055801

[31] Yang, X. -S., 2010. Nature-Inspired Metaheuristic Algorithms Second Edition. Luniver Press. https://staff.fmi.uvt.ro/~daniela.zaharie/ma2016/projects/techniques/FireflyAlgorithm/Yang_nature_book_part.pdf

[32] Yuslimah., 2023. Peran Big Data terhadap Kualitas Data Statistik Indonesia di Era Digital. Madani: Jurnal Ilmiah Multidisiplin, 1(11). https://doi.org/10.5281/zenodo.10402327.

[33] Zhang, W., Jiao, C., & Zhou, Q., 2025. Firefly Algorithm with Multiple Learning Ability based On Gender Difference. Sci Rep, 15(28400), 1–31. https://doi.org/10.1038/s41598-025-09523-

Downloads

Published

2026-05-14

How to Cite

Rahma, A., Nooraeni, R., & Hizra, R. (2026). Optimizing Cluster Centroids using Hybrid Firefly-Genetic Algorithm for Village Development Clustering . Jurnal Matematika, Statistika Dan Komputasi, 22(3), 761–780. https://doi.org/10.20956/j.v22i3.50288

Issue

Section

Research Articles

Most read articles by the same author(s)