Control of dataset bias in combined Affymetrix cohorts of triple negative breast cancer

Thomas Karn, Achim Rody, Volkmar Müller, Marcus Schmidt, Sven Becker, Uwe Holtrich, Lajos Pusztai

Abstract

Heterogenous subtypes of breast cancer need to be analyzed separately. Pooling of datasets can provide reasonable sample sizes but dataset bias is an important concern. We assembled a combined dataset of 579 Affymetrix microarrays from triple negative breast cancer (TNBC) in Gene Expression Omnibus (GEO) series GSE31519. We developed a method for selecting comparable datasets and to control for the amount of dataset bias of individual probesets.

Original languageEnglish
JournalGenomics data
Volume2
Pages (from-to)354-6
Number of pages3
ISSN2213-5960
DOIs
Publication statusPublished - 2014

Fingerprint

Dive into the research topics of 'Control of dataset bias in combined Affymetrix cohorts of triple negative breast cancer'. Together they form a unique fingerprint.

Cite this