Abstract
Foundational models, pretrained on a large scale, have demonstrated substantial success across non-medical domains. However, training these models typically requires large, comprehensive datasets, which contrasts with the smaller and more specialized datasets common in biomedical imaging. Here we propose a multi-task learning strategy that decouples the number of training tasks from memory requirements. We trained a universal biomedical pretrained model (UMedPT) on a multi-task database including tomographic, microscopic and X-ray images, with various labeling strategies such as classification, segmentation and object detection. The UMedPT foundational model outperformed ImageNet pretraining and previous state-of-the-art models. For classification tasks related to the pretraining database, it maintained its performance with only 1% of the original training data and without fine-tuning. For out-of-domain tasks it required only 50% of the original training data. In an external independent validation, imaging features extracted using UMedPT proved to set a new standard for cross-center transferability.
| Originalsprache | Englisch |
|---|---|
| Zeitschrift | Nature Computational Science |
| Jahrgang | 4 |
| Ausgabenummer | 7 |
| Seiten (von - bis) | 495-509 |
| Seitenumfang | 15 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 07.2024 |
Fördermittel
This research was funded by the German ministry of education and research (BMBF) through the project SynDICAD (01IS21067C; R.S., T.N., A.L., D.M., H.H., F.F., J.L.) and the German Research Foundation (DFG), CRC 1382 (403224013; F.K.). Our work uses datasets that are licensed under CC BY NC-SA 4.0 (ref. ), CC BY 4.0 (ref. ) and CC BY SA 4.0 (ref. ). We thank the authors of the datasets for their contributions.
| Träger | Trägernummer |
|---|---|
| Bundesministerium für Bildung und Forschung | 01IS21067C |
| Deutsche Forschungsgemeinschaft | CRC 1382, 403224013 |
Fingerprint
Untersuchen Sie die Forschungsthemen von „Overcoming data scarcity in biomedical imaging with a foundational multi-task model“. Zusammen bilden sie einen einzigartigen Fingerprint.Zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver