Biclustering Analysis Using Plaid Model on Gene Expression Data of Colon Cancer

Titin Siswantining; Achmad Eriza Aminanto; Devvi Sarwinda; Olivia Swasti

doi:10.17713/ajs.v50i5.1195

Biclustering Analysis Using Plaid Model on Gene Expression Data of Colon Cancer

Authors

Titin Siswantining University of Indonesia
Achmad Eriza Aminanto University of Indonesia
Devvi Sarwinda University of Indonesia
Olivia Swasti

DOI:

https://doi.org/10.17713/ajs.v50i5.1195

Abstract

Unlike other typical clustering analysis, which considers column only, biclustering analysis processes a matrix into sub-matrices based on rows and columns simultaneously. One method of bicluster analysis uses the probabilistic model, like the plaid model, that provides overlapping bicluster. The plaid model calculates the value of an element given from a particular sub-matrix for each cell; thus, the value can be seen as the number of contributions of a particular bicluster. The algorithm begins with preparing the input data as a matrix, then an initial model is assessed and makes a residual matrix from the model. After that, we determine bicluster candidates, which are evaluated for its effect parameters and bicluster membership parameters. Finally, the bicluster candidate is pruned to give the optimal bicluster. We implemented the algorithm on gene expression dataset of colon cancer, where the rows and columns contain observations and types of genes, respectively. We carried out in six distinct scenarios in which each scenario uses different model parameters and threshold values. We measured the results using Jaccard index and coherence variance. Our experiments show that biclustering analysis on a model with mean, row, and column effects of colon cancer data output low coherence variance.

Downloads

Published

2021-08-25

How to Cite

Siswantining, T., Aminanto, A. E., Sarwinda, D., & Swasti, O. (2021). Biclustering Analysis Using Plaid Model on Gene Expression Data of Colon Cancer. Austrian Journal of Statistics, 50(5), 101–114. https://doi.org/10.17713/ajs.v50i5.1195

Download Citation

Issue

Vol. 50 No. 5 (2021): Regular Issue

Section

Articles

License

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

The Austrian Journal of Statistics publish open access articles under the terms of the Creative Commons Attribution (CC BY) License.

The Creative Commons Attribution License (CC-BY) allows users to copy, distribute and transmit an article, adapt the article and make commercial use of the article. The CC BY license permits commercial and non-commercial re-use of an open access article, as long as the author is properly attributed.

Copyright on any research article published by the Austrian Journal of Statistics is retained by the author(s). Authors grant the Austrian Journal of Statistics a license to publish the article and identify itself as the original publisher. Authors also grant any third party the right to use the article freely as long as its original authors, citation details and publisher are identified.

Manuscripts should be unpublished and not be under consideration for publication elsewhere. By submitting an article, the author(s) certify that the article is their original work, that they have the right to submit the article for publication, and that they can grant the above license.

Biclustering Analysis Using Plaid Model on Gene Expression Data of Colon Cancer

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Developed By

Information