"Model-Based Clustering With Data Correction for Removing Artifacts in " by William Chad Young, Adrian E. Raftery et al.

School of Engineering and Technology Publications

Title

Model-Based Clustering With Data Correction for Removing Artifacts in Gene Expression Data

Authors

William Chad Young
Adrian E. Raftery
Ka Yee Yeung, University of Washington TacomaFollow

Publication Date

2017

Document Type

Article

Abstract

The NIH Library of Integrated Network-based Cellular Signatures (LINCS) contains gene expression data from over a million experiments, using Luminex Bead technology. Only 500 colors are used to measure the expression levels of the 1000 landmark genes measured, and the data for the resulting pairs of genes are deconvolved. The raw data are sometimes inadequate for reliable deconvolution, leading to artifacts in the final processed data. These include the expression levels of paired genes being flipped or given the same value and clusters of values that are not at the true expression level. We propose a new method called model-based clustering with data correction (MCDC) that is able to identify and correct these three kinds of artifacts simultaneously. We show that MCDC improves the resulting gene expression data in terms of agreement with external baselines, as well as improving results from subsequent analysis.

Publication Title

The Annals of Applied Statistics

Volume

Issue

DOI

10.1214/17-AOAS1051

Publisher Policy

publisher's pdf

Recommended Citation

Young, William Chad; Raftery, Adrian E.; and Yeung, Ka Yee, "Model-Based Clustering With Data Correction for Removing Artifacts in Gene Expression Data" (2017). School of Engineering and Technology Publications. 276.
https://digitalcommons.tacoma.uw.edu/tech_pub/276

This document is currently not available here.

Find in your library

COinS

UW Tacoma Digital Commons

School of Engineering and Technology Publications

Title

Authors

Publication Date

Document Type

Abstract

Publication Title

Volume

Issue

DOI

Publisher Policy

Recommended Citation

Browse

Author Corner

Links

SelectedWorks Sites

UW Tacoma Digital Commons

School of Engineering and Technology Publications

Title

Authors

Publication Date

Document Type

Abstract

Publication Title

Volume

Issue

DOI

Publisher Policy

Recommended Citation

Share

Browse

Author Corner

Links

SelectedWorks Sites