Perbandingan Performa Quadratic Discriminant Analysis Klasik dan Robust pada Data Hasil Principal Component Analysis untuk Klasifikasi Jenis Kaca
DOI:
https://doi.org/10.20956/ejsa.v7i1.44982Keywords:
Quadratic Discriminant Analysis, Robust QDA, Principal Component Analysis, Multivariate Outliers, Glass IdentificationAbstract
This study aims to compare the performance of classical Quadratic Discriminant Analysis (QDA) and Robust Quadratic Discriminant Analysis (RQDA) after applying dimensionality reduction using Principal Component Analysis (PCA) on the Glass Identification Dataset. The dataset consists of eight chemical composition variables used to classify glass types based on their elemental characteristics. Prior to classification, discriminant analysis assumptions were examined, multivariate outliers were identified, and PCA was applied to address multicollinearity and enhance data stability. The PCA results indicate that the eight original variables can be reduced to five principal components, which collectively explain 93.20% of the total data variability. Classification was then performed using classical QDA and RQDA, where the latter incorporates the Minimum Covariance Determinant (MCD) estimator to obtain robust estimates of the mean vector and covariance matrix. Model performance was evaluated using a confusion matrix and the Apparent Error Rate (APER). The results show that both QDA and RQDA achieve the same classification accuracy of 63.7%, corresponding to an APER of 36.3%. These findings suggest that the application of PCA contributes to stabilizing the data structure and reducing the influence of outliers, thereby diminishing the advantage of robust estimation in this case. Nevertheless, RQDA remains a valuable alternative for classification tasks involving datasets with strong outliers or significant deviations from multivariate normality.
References
Johnson, R. A., & Wichern, D. W. Applied Multivariate Statistical Analysis. Upper Saddle River, NJ: Pearson Prentice Hall, 2007.
Sharma, S. Applied Multivariate Techniques. New York: John Wiley & Sons, 1996.
Hubert, M., & Debruyne, M. Minimum Covariance Determinant. WIREs Computational Statistics, 2(1), 36–43, 2010.
Rousseeuw, P. J., & Van Driessen, K. A Fast Algorithm for the Minimum Covariance Determinant Estimator. Technometrics, 41(3), 212–223, 1999.
Hubert, M., & Van Driessen, K. Fast and Robust Discriminant Analysis. Computational Statistics & Data Analysis, 45(2), 301–320, 2004.
Croux, C., & Dehon, C. Robust Linear Discriminant Analysis Using S-Estimators. Canadian Journal of Statistics, 29(3), 473–493, 2001.
Rencher, A. C. Methods of Multivariate Analysis. New York: John Wiley & Sons, 2002.
Asnidar, Nirwan, & Raupong. Penerapan Analisis Diskriminan Kuadratik Robust pada Klasifikasi Desa. ESTIMASI: Journal of Statistics and Its Application, 4(1), 197–205, 2024.
Martha, U. M., Vionanda, D., Permana, D., & Zilrahmi. Perbandingan Analisis Diskriminan Kuadratik dengan Analisis Diskriminan Kuadratik Robust. UNP Journal of Statistics and Data Science, 2(4), 469–474, 2024. https://doi.org/10.24036/ujsds/vol2-iss4/315
Hadi, A. S. Identifying Multiple Outliers in Multivariate Data. New York: Wiley, 1992.
Sari, P. P., Herdiani, E. T., & Sunusi, N. Outlier Detection Using Minimum Vector Variance Algorithm with Depth Function and Mahalanobis Distance. Jurnal Matematika, Statistika & Komputasi, 17(3), 418–427, 2021. https://doi.org/10.20956/j.v17i3.12629
Mattjik, A. A., & Sumertajaya, I. M. Sidik Peubah Ganda dengan Menggunakan SAS. Bogor: IPB Press, 2011.
James, G., Witten, D., Hastie, T., & Tibshirani, R. An Introduction to Statistical Learning (2nd ed.). Springer, 2021.
Downloads
Published
Issue
Section
License
Copyright
It is the author's responsibility to ensure that his or her submitted work does not infringe any existing copyright. Authors should obtain permission to reproduce or adapt copyrighted material and provide evidence of approval upon submitting the final version of a manuscript.
