Comparison of Serial and Parallel Computation on Predicting Missing Data with EM Algorithm
DOI:
https://doi.org/10.20956/j.v18i1.14003Keywords:
Parallel Computing, Missing Data, EM AlgorithmAbstract
One way to deal with the presence of missing value or incomplete data is to impute the data using EM Algorithm. The need for large and fast data processing is necessary to implement parallel computing on EM algorithm serial program. In the parallel program architecture of EM Algorithm in this study, the controller is only related to the EM module whereas the EM module itself uses matrix and vector modules intensively. Parallelization is done by using OpenMP in EM modules which results in faster compute time on parallel programs than serial programs. Parallel computing with a thread of 4 (four) increases speed up, reduces compute time, and reduces efficiency when compared to parallel computing by the number of threads 2 (two).
References
Andri Lesmana, W., Maria Angela, K., Sri, M. 2017. Komputasi Paralel Untuk Pengolahan Prestasi Akademik Mahasiswa. Jurnal Teknologi Elektro 8.
Dempster, A.P., Laird, N.M., Rubin, D.B. 1977. Maximum Likelihood from Incomplete Data Via the Em Algorithm. Journal of the Royal Statistical Society: Series B 39, 1-22.
Lumbanraja, F.R., Aristoteles, A., Muttaqina, N.R. 2020. Analisa Komputasi Paralel Mengurutkan Data Dengan Metode Radix Dan Selection. Jurnal Komputasi 8, 77-93.
Lumbanraja, F.R., Aristoteles, A., Nadila Rizqi, M. 2020. Analisa Komputasi Paralel Mengurutan Data Dengan Metode Radix Dan Selection. Jurnal Komputasi 8, 77-93.
Morrison, D.F., Marshall, L.C., Sahlin, H.L., 1976. Multivariate Statistical Methods. McGraw–Hill Book Company, New York.
Mulya, M., Abdiansah, A. 2013. Penerapan Multi-Threading Untuk Meningkatkan Kinerja Pengolahan Citra Digital. Jurnal Generic 8, 230-237.
Musil, C.M., Warner, C.B., Yobas, P.K., Jones, S.L. 2002. A Comparison of Imputation Techniques for Handling Missing Data. Western Journal of Nursing Research 24, 815-829.
Nova, M., Mukid, M. 2011. Pendugaan Data Hilang Dengan Menggunakan Data Augmentation. Jurnal Media Statistika 4, 73-86.
Nurmawati, E., 2006. Rekayasa Program Penanganan Data Hilang (Dengan Metode Listwise, Pairwise, Dan Expectation-Maximization Algorithm). Sekolah Tinggi Ilmu Statistik, Jakarta.
Rabbani, H., Gunawan, P.H. 2018. Kinerja Openmp Pada Pengolahan Citra Dengan Model Curvature Motion. Indonesia Journal on Computing 3, 97-102.
Rastogi, S., Zaheer, H., Year. Significance of Parallel Computation over Serial Computation. pp. 2307-2310.
Richard A. Johnson, D.W.W., 2002. Applied Multivariate Statistical Analysis Fifth Edition. Prentice Hall, Upper Saddle River, New Jersey, p.^pp. 235-255.
Santoso, S., 2002. Spss Statistik Multivariat. Elex Media Komputindo, Jakarta.
Yusman, M., Aristoteles, A., Irawati, A.R. 2012. Analisis Komputasi Paralel Dan Serial Pada Algoritma Merge Sort. Jurnal Sains MIPA 18.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Author and publisher
This work is licensed under a Creative Commons Attribution 4.0 International License.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Jurnal Matematika, Statistika dan Komputasi is an Open Access journal, all articles are distributed under the terms of the Creative Commons Attribution License, allowing third parties to copy and redistribute the material in any medium or format, transform, and build upon the material, provided the original work is properly cited and states its license. This license allows authors and readers to use all articles, data sets, graphics and appendices in data mining applications, search engines, web sites, blogs and other platforms by providing appropriate reference.