Abstract
We present a new feature weighting method to improve k-Nearest-Neighbor (k-NN) classification. The proposed method minimizes the largest distance between equally labeled data tuples, while retaining a minimum distance between data tuples of different classes, with the goal to group equally labeled data together. It can be implemented as a simple linear program, and in contrast to other feature weighting methods, it does not depend on the initial scaling of the data dimensions. Two versions, a hard and a soft one, are evaluated on real-world datasets from the UCI repository. In particular the soft version compares very well with competing methods. Furthermore, an evaluation is done on challenging gene expression data sets, where the method shows its ability to automatically reduce the dimensionality of the data.
| Originalsprache | Englisch |
|---|---|
| Zeitschrift | Pattern Recognition Letters |
| Jahrgang | 52 |
| Seiten (von - bis) | 48-52 |
| Seitenumfang | 5 |
| ISSN | 0167-8655 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 15.01.2014 |
UN SDGs
Dieser Output leistet einen Beitrag zu folgendem(n) Ziel(en) für nachhaltige Entwicklung
-
SDG 3 – Gesundheit und Wohlergehen
-
SDG 9 – Industrie, Innovation und Infrastruktur
Zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver