Abstract
Heterogenous subtypes of breast cancer need to be analyzed separately. Pooling of datasets can provide reasonable sample sizes but dataset bias is an important concern. We assembled a combined dataset of 579 Affymetrix microarrays from triple negative breast cancer (TNBC) in Gene Expression Omnibus (GEO) series GSE31519. We developed a method for selecting comparable datasets and to control for the amount of dataset bias of individual probesets.
| Original language | English |
|---|---|
| Journal | Genomics Data |
| Volume | 2 |
| Pages (from-to) | 354-6 |
| Number of pages | 3 |
| ISSN | 2213-5960 |
| DOIs | |
| Publication status | Published - 2014 |