The Full Bayesian Significance Test for Mixture Models: Results in Gene Expression Clustering.

Julio Michael Stern; Marcelo de Souza Lauretto; Carlos Alberto de Braganca Pereira

The Full Bayesian Significance Test for Mixture Models: Results in Gene Expression Clustering.

Julio Michael Stern, Marcelo de Souza Lauretto & Carlos Alberto de Braganca Pereira

Genetics and Molecular Research 7 (3):883-897 (2008) Copy BIBT_EX

Abstract

Gene clustering is a useful exploratory technique to group together genes with similar expression levels under distinct cell cycle phases or distinct conditions. It helps the biologist to identify potentially meaningful relationships between genes. In this study, we propose a clustering method based on multivariate normal mixture models, where the number of clusters is predicted via sequential hypothesis tests: at each step, the method considers a mixture model of m components (m = 2 in the first step) and tests if in fact it should be m - 1. If the hypothesis is rejected, m is increased and a new test is carried out. The method continues (increasing m) until the hypothesis is accepted. The theoretical core of the method is the full Bayesian significance test, an intuitive Bayesian approach, which needs no model complexity penalization nor positive probabilities for sharp hypotheses. Numerical experiments were based on a cDNA microarray dataset consisting of expression levels of 205 genes belonging to four functional categories, for 10 distinct strains of Saccharomyces cerevisiae. To analyze the method’s sensitivity to data dimension, we performed principal components analysis on the original dataset and predicted the number of classes using 2 to 10 principal components. Compared to Mclust (model-based clustering), our method shows more consistent results.

View on PhilPapers

Author's Profile

Julio Michael Stern

University of São Paulo

Archival history

Archival date: 2021-07-24
View all versions

Keywords

Gene clustering mixture models significance test gene expression data analysis

Reprint years

Analytics

Added to PP
2021-07-24

Downloads
360 (#75,981)

6 months
92 (#75,581)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

The Full Bayesian Significance Test for Mixture Models: Results in Gene Expression Clustering.

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics