Clustering Gene-Expression Data With Repeated Measurements

Ka Yee Yeung, University of Washington Tacoma
Mario Medvedovic
Roger E. Bumgarner

Abstract

Clustering is a common methodology for the analysis of array data, and many research laboratories are generating array data with repeated measurements. We evaluated several clustering algorithms that incorporate repeated measurements, and show that algorithms that take advantage of repeated measurements yield more accurate and more stable clusters. In particular, we show that the infinite mixture model-based approach with a built-in error model produces superior results.