Abstract
Heterogenous subtypes of breast cancer need to be analyzed separately. Pooling of datasets can provide reasonable sample sizes but dataset bias is an important concern. We assembled a combined dataset of 579 Affymetrix microarrays from triple negative breast cancer (TNBC) in Gene Expression Omnibus (GEO) series GSE31519. We developed a method for selecting comparable datasets and to control for the amount of dataset bias of individual probesets.
Original language | English |
---|---|
Journal | Genomics data |
Volume | 2 |
Pages (from-to) | 354-6 |
Number of pages | 3 |
ISSN | 2213-5960 |
DOIs | |
Publication status | Published - 2014 |