Metadata matching is an important step towards integrating heterogeneous healthcare data and facilitating secondary use. MDRCupid supports this step by providing a configurable metadata matching toolbox incorporating lexical and statistical matching approaches. The matching configuration can be adapted to different purposes by manually selecting algorithms and their weights or by using the optimization module with corresponding training data. The toolbox can be accessed as a web service via programming or user interface. For every selected metadata element, the metadata elements with the highest similarity scores are presented to the user and can be manually confirmed via the user interface, while the programming interface uses a similarity threshold to select corresponding elements. An HL7 FHIR ConceptMap is used to save the matches. Manually confirmed matches may be used as new training data for the optimizer to improve the matching parameters further.
|Title of host publication
|Studies in Health Technology and Informatics
|Published - 21.08.2019