Evaluation of NMF-VAE Integrative Approach for Biclustering and Glioblastoma Biomarker Identification

Authors

  • Agatha Silalahi Universitas Indonesia
  • Titin Siswantining Universitas Indonesia

DOI:

https://doi.org/10.20956/j.v21i3.43158

Keywords:

Glioblastoma, Biclustering, Non-negative Matrix Factorization, Variational Autoencoder, Biomarker

Abstract

Glioblastoma (GBM) represents the most aggressive primary brain tumor with poor prognosis. This research develops a novel computational framework that merges the strengths of Non-negative Matrix Factorization (NMF) with Variational Autoencoder (VAE) to improve biclustering performance in GBM gene expression data analysis. Using the GSE4290 dataset, this study analyzes gene expression data from 180 samples (136 tumors and 44 normal controls). The implementation of the NMF-VAE method successfully identified 10 biclusters with coherence values of 0.711 and variance of 0.713, validated through latent space visualization and reconstruction error analysis (15-50 MSE). Differential expression analysis identified three main potential biomarkers: ANXA2, TNFRSF1A, and NAMPT, which demonstrated significant expression changes (fold change 2.5, 2.0, and 3.0) and correlated with tumor cell proliferation, inflammation, and energy metabolism. Visualization of bicluster patterns and gene expression value distributions confirmed the consistency of these biomarkers overexpression in tumor samples. These findings provide new insights into the development of gene expression-based treatment strategies for GBM patients

References

[1] Chen, L., Zhang, Y., & Liu, B., 2021. Advances in cancer treatment: A new therapeutic target, Annexin A2. Journal of Cancer Research.

[2] Chen, X., Zhang, L., Zhang, Y., Wang, W., Zhang, S., & Zhou, H., 2018. Association of TNFRSF19 with a TNF family-based prognostic model and subtypes in gliomas using machine learning. Heliyon.

[3] Cheng, Y. & Church, G.M., 2000. Biclustering of expression data. Proceedings of the International Conference on Intelligent Systems for Molecular Biology, Vol. 8, pp. 93–103.

[4] Kim, J., Lee, J. & Park, H., 2019. Biclustering analysis of transcriptome big data identifies condition-specific microRNA targets. Nucleic Acids Research, Vol. 47, No. 9, e53. Available at: https://doi.org/10.1093/nar/gkz139.

[5] Kingma, D.P. & Welling, M., 2014. Auto-encoding variational Bayes. arXiv preprint arXiv:1312.6114. Available at: https://arxiv.org/abs/1312.6114.

[6] Lee, D.D. & Seung, H.S., 1999. Learning the parts of objects by non-negative matrix factorization. Nature, Vol. 401, No. 6755, pp. 788–791. Available at: https://doi.org/10.1038/44565.

[7] Lucena-Cacace, A., Otero-Albiol, D., Jiménez-García, M.P., Peinado-Serrano, J. & Carnero, A., 2018. NAMPT as a dedifferentiation-inducer gene: NAD+ as core axis for glioma cancer stem-like cells maintenance. Frontiers in Oncology, Vol. 8, Article 292. Available at: https://doi.org/10.3389/fonc.2018.00292.

[8] Madeira, S.C. & Oliveira, A.L., 2004. Biclustering algorithms for biological data analysis: A survey. IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 1, No. 1, pp. 24–45. Available at: https://doi.org/10.1109/TCBB.2004.2.

[9] Ostrom, Q.T., Patil, N. & Barnholtz-Sloan, J.S., 2022. CBTRUS statistical report: Primary brain and other central nervous system tumors diagnosed in the United States in 2015–2019. Neuro-Oncology, Vol. 24, Suppl. 1, pp. v1–v95.

[10] Stein-O'Brien, G.L., Arora, R. & Culhane, A.C., 2018. PatternMarkers & GWCoGAPS for novel data-driven biomarkers via whole transcriptome NMF. Bioinformatics, Vol. 34, No. 15, pp. 2589–2597.

[11] Verhaak, R.G., Hoadley, K.A., Purdom, E., Wang, V., Qi, Y. et al., 2010. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell, Vol. 143, No. 5, pp. 740–752. Available at: https://doi.org/10.1016/j.cell.2010.01.004.

[12] Wang, J., Li, Y., Wang, X., Chen, W., Sun, H. & Wang, J., 2021. Advances in cancer treatment: A new therapeutic target, Annexin A2. Journal of Cancer Research.

[13] Wang, Q., Hu, B. & Verhaak, R.G.W., 2021. Tumor evolution in glioblastoma: Molecular subtypes and implications for therapy. Cancer Cell, Vol. 39, No. 8, pp. 1129–1143. Available at: https://doi.org/10.1016/j.ccell.2021.06.002.

[14] Way, G.P., Zietz, M. & Greene, C.S., 2020. How machine learning will transform biomedicine. Cell. Available at: https://doi.org/10.1016/j.cell.2020.03.022.

[15] Zhang, H., Wang, Y., Li, X., Chen, W. & Wang, H., 2023. Tumor metabolism: A new field for the treatment of glioma. Bioconjugate Chemistry. Available at: https://doi.org/10.1021/acs.bioconjchem.4c00287.

[16] Zhang, Y., Song, H., Chen, R., Liu, Y., Li, Y. et al., 2012. Annexin A2 as a prognostic biomarker in glioma patients. Cancer Gene Therapy. Available at: https://doi.org/10.1007/s11060-012-0973-6.

Downloads

Published

2025-05-14

How to Cite

Silalahi, A., & Siswantining, T. (2025). Evaluation of NMF-VAE Integrative Approach for Biclustering and Glioblastoma Biomarker Identification. Jurnal Matematika, Statistika Dan Komputasi, 21(3), 684–697. https://doi.org/10.20956/j.v21i3.43158

Issue

Section

Research Articles