Evaluation of NMF-VAE Integrative Approach for Biclustering and Glioblastoma Biomarker Identification
DOI:
https://doi.org/10.20956/j.v21i3.43158Keywords:
Glioblastoma, Biclustering, Non-negative Matrix Factorization, Variational Autoencoder, BiomarkerAbstract
Glioblastoma (GBM) represents the most aggressive primary brain tumor with poor prognosis. This research develops a novel computational framework that merges the strengths of Non-negative Matrix Factorization (NMF) with Variational Autoencoder (VAE) to improve biclustering performance in GBM gene expression data analysis. Using the GSE4290 dataset, this study analyzes gene expression data from 180 samples (136 tumors and 44 normal controls). The implementation of the NMF-VAE method successfully identified 10 biclusters with coherence values of 0.711 and variance of 0.713, validated through latent space visualization and reconstruction error analysis (15-50 MSE). Differential expression analysis identified three main potential biomarkers: ANXA2, TNFRSF1A, and NAMPT, which demonstrated significant expression changes (fold change 2.5, 2.0, and 3.0) and correlated with tumor cell proliferation, inflammation, and energy metabolism. Visualization of bicluster patterns and gene expression value distributions confirmed the consistency of these biomarkers overexpression in tumor samples. These findings provide new insights into the development of gene expression-based treatment strategies for GBM patients
References
[1] Chen, L., Zhang, Y., & Liu, B., 2021. Advances in cancer treatment: A new therapeutic target, Annexin A2. Journal of Cancer Research.
[2] Chen, X., Zhang, L., Zhang, Y., Wang, W., Zhang, S., & Zhou, H., 2018. Association of TNFRSF19 with a TNF family-based prognostic model and subtypes in gliomas using machine learning. Heliyon.
[3] Cheng, Y. & Church, G.M., 2000. Biclustering of expression data. Proceedings of the International Conference on Intelligent Systems for Molecular Biology, Vol. 8, pp. 93–103.
[4] Kim, J., Lee, J. & Park, H., 2019. Biclustering analysis of transcriptome big data identifies condition-specific microRNA targets. Nucleic Acids Research, Vol. 47, No. 9, e53. Available at: https://doi.org/10.1093/nar/gkz139.
[5] Kingma, D.P. & Welling, M., 2014. Auto-encoding variational Bayes. arXiv preprint arXiv:1312.6114. Available at: https://arxiv.org/abs/1312.6114.
[6] Lee, D.D. & Seung, H.S., 1999. Learning the parts of objects by non-negative matrix factorization. Nature, Vol. 401, No. 6755, pp. 788–791. Available at: https://doi.org/10.1038/44565.
[7] Lucena-Cacace, A., Otero-Albiol, D., Jiménez-García, M.P., Peinado-Serrano, J. & Carnero, A., 2018. NAMPT as a dedifferentiation-inducer gene: NAD+ as core axis for glioma cancer stem-like cells maintenance. Frontiers in Oncology, Vol. 8, Article 292. Available at: https://doi.org/10.3389/fonc.2018.00292.
[8] Madeira, S.C. & Oliveira, A.L., 2004. Biclustering algorithms for biological data analysis: A survey. IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 1, No. 1, pp. 24–45. Available at: https://doi.org/10.1109/TCBB.2004.2.
[9] Ostrom, Q.T., Patil, N. & Barnholtz-Sloan, J.S., 2022. CBTRUS statistical report: Primary brain and other central nervous system tumors diagnosed in the United States in 2015–2019. Neuro-Oncology, Vol. 24, Suppl. 1, pp. v1–v95.
[10] Stein-O'Brien, G.L., Arora, R. & Culhane, A.C., 2018. PatternMarkers & GWCoGAPS for novel data-driven biomarkers via whole transcriptome NMF. Bioinformatics, Vol. 34, No. 15, pp. 2589–2597.
[11] Verhaak, R.G., Hoadley, K.A., Purdom, E., Wang, V., Qi, Y. et al., 2010. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell, Vol. 143, No. 5, pp. 740–752. Available at: https://doi.org/10.1016/j.cell.2010.01.004.
[12] Wang, J., Li, Y., Wang, X., Chen, W., Sun, H. & Wang, J., 2021. Advances in cancer treatment: A new therapeutic target, Annexin A2. Journal of Cancer Research.
[13] Wang, Q., Hu, B. & Verhaak, R.G.W., 2021. Tumor evolution in glioblastoma: Molecular subtypes and implications for therapy. Cancer Cell, Vol. 39, No. 8, pp. 1129–1143. Available at: https://doi.org/10.1016/j.ccell.2021.06.002.
[14] Way, G.P., Zietz, M. & Greene, C.S., 2020. How machine learning will transform biomedicine. Cell. Available at: https://doi.org/10.1016/j.cell.2020.03.022.
[15] Zhang, H., Wang, Y., Li, X., Chen, W. & Wang, H., 2023. Tumor metabolism: A new field for the treatment of glioma. Bioconjugate Chemistry. Available at: https://doi.org/10.1021/acs.bioconjchem.4c00287.
[16] Zhang, Y., Song, H., Chen, R., Liu, Y., Li, Y. et al., 2012. Annexin A2 as a prognostic biomarker in glioma patients. Cancer Gene Therapy. Available at: https://doi.org/10.1007/s11060-012-0973-6.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Jurnal Matematika, Statistika dan Komputasi

This work is licensed under a Creative Commons Attribution 4.0 International License.

This work is licensed under a Creative Commons Attribution 4.0 International License.
Jurnal Matematika, Statistika dan Komputasi is an Open Access journal, all articles are distributed under the terms of the Creative Commons Attribution License, allowing third parties to copy and redistribute the material in any medium or format, transform, and build upon the material, provided the original work is properly cited and states its license. This license allows authors and readers to use all articles, data sets, graphics and appendices in data mining applications, search engines, web sites, blogs and other platforms by providing appropriate reference.




