Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

Overcoming data scarcity in biomedical imaging with a foundational multi-task model

Raphael Schäfer, Till Nicke, Henning Höfener, Annkristin Lange, Dorit Merhof, Friedrich Feuerhake, Volkmar Schulz, Johannes Lotz*, Fabian Kiessling*

*Korrespondierende/r Autor/-in für diese Arbeit

Abstract

Foundational models, pretrained on a large scale, have demonstrated substantial success across non-medical domains. However, training these models typically requires large, comprehensive datasets, which contrasts with the smaller and more specialized datasets common in biomedical imaging. Here we propose a multi-task learning strategy that decouples the number of training tasks from memory requirements. We trained a universal biomedical pretrained model (UMedPT) on a multi-task database including tomographic, microscopic and X-ray images, with various labeling strategies such as classification, segmentation and object detection. The UMedPT foundational model outperformed ImageNet pretraining and previous state-of-the-art models. For classification tasks related to the pretraining database, it maintained its performance with only 1% of the original training data and without fine-tuning. For out-of-domain tasks it required only 50% of the original training data. In an external independent validation, imaging features extracted using UMedPT proved to set a new standard for cross-center transferability.

OriginalspracheEnglisch
ZeitschriftNature Computational Science
Jahrgang4
Ausgabenummer7
Seiten (von - bis)495-509
Seitenumfang15
DOIs
PublikationsstatusVeröffentlicht - 07.2024

Fördermittel

This research was funded by the German ministry of education and research (BMBF) through the project SynDICAD (01IS21067C; R.S., T.N., A.L., D.M., H.H., F.F., J.L.) and the German Research Foundation (DFG), CRC 1382 (403224013; F.K.). Our work uses datasets that are licensed under CC BY NC-SA 4.0 (ref. ), CC BY 4.0 (ref. ) and CC BY SA 4.0 (ref. ). We thank the authors of the datasets for their contributions.

TrägerTrägernummer
Bundesministerium für Bildung und Forschung01IS21067C
Deutsche ForschungsgemeinschaftCRC 1382, 403224013

    Fingerprint

    Untersuchen Sie die Forschungsthemen von „Overcoming data scarcity in biomedical imaging with a foundational multi-task model“. Zusammen bilden sie einen einzigartigen Fingerprint.

    Zitieren