Abstract
Predicting the survival of cancer patients is of high importance for the medical community, e.g. for evaluating therapy strategies. This study is based on lung cancer data retrieved from seven German cancer registries according to the German basic oncology dataset. After data integration and preprocessing, we predicted the survival for 6, 12, 18 and 24 months respectively using a gradient boosting algorithm. To gain insight into the decision process of the models, we identified the features that have a high impact on patient survival using permutation feature importance scores as explainability metric. They show that age at diagnosis as well as the presence of distant metastases are key factors for long-term survival. The found factors can be used in a next step for multi-variate survival analysis.
Original language | English |
---|---|
Journal | Studies in Health Technology and Informatics |
Volume | 327 |
Pages (from-to) | 457-461 |
Number of pages | 5 |
ISSN | 0926-9630 |
DOIs | |
Publication status | Published - 15.05.2025 |
Research Areas and Centers
- Research Area: Center for Population Medicine and Public Health (ZBV)
- Centers: Center for Artificial Intelligence Luebeck (ZKIL)
DFG Research Classification Scheme
- 2.22-02 Public Health, Healthcare Research, Social and Occupational Medicine
- 2.22-07 Medical Informatics and Medical Bioinformatics
Fingerprint
Dive into the research topics of 'Lung Cancer Survival Estimation Using Data from Seven German Cancer Registries'. Together they form a unique fingerprint.Datasets
-
AI-Care Binary Classification of Lung Cancer Survival
Germer, S. (Scientific Creator), Rudolph, C. (Scientific Creator), Katalinic, A. (Scientific Creator), Rath, N. (Scientific Creator), Rausch, K. (Scientific Creator) & Handels, H. (Scientific Creator), ZENODO, 24.06.2025
DOI: 10.5281/zenodo.15728131, https://zenodo.org/records/15728131
Dataset