Comparison of Serial and Parallel Computation on Predicting Missing Data with EM Algorithm

Authors

  • Erna Nurmawati Politeknik Statiska STIS
  • Robby Hasan Pangaribuan
  • Ibnu Santoso

DOI:

https://doi.org/10.20956/j.v18i1.14003

Keywords:

Parallel Computing, Missing Data, EM Algorithm

Abstract

One way to deal with the presence of missing value or incomplete data is to impute the data using EM Algorithm. The need for large and fast data processing is necessary to implement parallel computing on EM algorithm serial program. In the parallel program architecture of EM Algorithm in this study, the controller is only related to the EM module whereas the EM module itself uses matrix and vector modules intensively. Parallelization is done by using OpenMP in EM modules which results in faster compute time on parallel programs than serial programs. Parallel computing with a thread of 4 (four) increases speed up, reduces compute time, and reduces efficiency when compared to parallel computing by the number of threads 2 (two).

References

Andri Lesmana, W., Maria Angela, K., Sri, M. 2017. Komputasi Paralel Untuk Pengolahan Prestasi Akademik Mahasiswa. Jurnal Teknologi Elektro 8.

Dempster, A.P., Laird, N.M., Rubin, D.B. 1977. Maximum Likelihood from Incomplete Data Via the Em Algorithm. Journal of the Royal Statistical Society: Series B 39, 1-22.

Lumbanraja, F.R., Aristoteles, A., Muttaqina, N.R. 2020. Analisa Komputasi Paralel Mengurutkan Data Dengan Metode Radix Dan Selection. Jurnal Komputasi 8, 77-93.

Lumbanraja, F.R., Aristoteles, A., Nadila Rizqi, M. 2020. Analisa Komputasi Paralel Mengurutan Data Dengan Metode Radix Dan Selection. Jurnal Komputasi 8, 77-93.

Morrison, D.F., Marshall, L.C., Sahlin, H.L., 1976. Multivariate Statistical Methods. McGraw–Hill Book Company, New York.

Mulya, M., Abdiansah, A. 2013. Penerapan Multi-Threading Untuk Meningkatkan Kinerja Pengolahan Citra Digital. Jurnal Generic 8, 230-237.

Musil, C.M., Warner, C.B., Yobas, P.K., Jones, S.L. 2002. A Comparison of Imputation Techniques for Handling Missing Data. Western Journal of Nursing Research 24, 815-829.

Nova, M., Mukid, M. 2011. Pendugaan Data Hilang Dengan Menggunakan Data Augmentation. Jurnal Media Statistika 4, 73-86.

Nurmawati, E., 2006. Rekayasa Program Penanganan Data Hilang (Dengan Metode Listwise, Pairwise, Dan Expectation-Maximization Algorithm). Sekolah Tinggi Ilmu Statistik, Jakarta.

Rabbani, H., Gunawan, P.H. 2018. Kinerja Openmp Pada Pengolahan Citra Dengan Model Curvature Motion. Indonesia Journal on Computing 3, 97-102.

Rastogi, S., Zaheer, H., Year. Significance of Parallel Computation over Serial Computation. pp. 2307-2310.

Richard A. Johnson, D.W.W., 2002. Applied Multivariate Statistical Analysis Fifth Edition. Prentice Hall, Upper Saddle River, New Jersey, p.^pp. 235-255.

Santoso, S., 2002. Spss Statistik Multivariat. Elex Media Komputindo, Jakarta.

Yusman, M., Aristoteles, A., Irawati, A.R. 2012. Analisis Komputasi Paralel Dan Serial Pada Algoritma Merge Sort. Jurnal Sains MIPA 18.

Downloads

Published

2021-09-02

Issue

Section

Research Articles