Skip to main navigation Skip to search Skip to main content

Data quality and patient characteristics in European ANCA-associated vasculitis registries: data retrieval by federated querying

Karl Gisslander*, Matthew Rutherford, Louis Aslett, Neil Basu, François Dradin, Lucy Hederman, Zdenka Hruskova, Hicham Kardaoui, Peter Lamprecht, Sabina Lichołai, Jacek Musial, Declan O'sullivan, Xavier Puechal, Jennifer Scott, Mårten Segelmark, Richard Straka, Benjamin Terrier, Vladimir Tesar, Michelangelo Tesi, Augusto VaglioDagmar Wandrei, Arthur White, Krzysztof Wójcik, Beyza Yaman, Mark A. Little, Aladdin J. Mohammad

*Corresponding author for this work

Abstract

Objectives This study aims to describe the data structure and harmonisation process, explore data quality and define characteristics, treatment, and outcomes of patients across six federated antineutrophil cytoplasmic antibody-associated vasculitis (AAV) registries. Methods Through creation of the vasculitis-specific Findable, Accessible, Interoperable, Reusable, VASCulitis ontology, we harmonised the registries and enabled semantic interoperability. We assessed data quality across the domains of uniqueness, consistency, completeness and correctness. Aggregated data were retrieved using the semantic query language SPARQL Protocol and Resource Description Framework Query Language (SPARQL) and outcome rates were assessed through random effects meta-analysis. Results A total of 5282 cases of AAV were identified. Uniqueness and data-type consistency were 100% across all assessed variables. Completeness and correctness varied from 49%-100% to 60%-100%, respectively. There were 2754 (52.1%) cases classified as granulomatosis with polyangiitis (GPA), 1580 (29.9%) as microscopic polyangiitis and 937 (17.7%) as eosinophilic GPA. The pattern of organ involvement included: lung in 3281 (65.1%), ear-nose-throat in 2860 (56.7%) and kidney in 2534 (50.2%). Intravenous cyclophosphamide was used as remission induction therapy in 982 (50.7%), rituximab in 505 (17.7%) and pulsed intravenous glucocorticoid use was highly variable (11%-91%). Overall mortality and incidence rates of end-stage kidney disease were 28.8 (95% CI 19.7 to 42.2) and 24.8 (95% CI 19.7 to 31.1) per 1000 patient-years, respectively. Conclusions In the largest reported AAV cohort-study, we federated patient registries using semantic web technologies and highlighted concerns about data quality. The comparison of patient characteristics, treatment and outcomes was hampered by heterogeneous recruitment settings.

Original languageEnglish
JournalAnnals of the Rheumatic Diseases
Volume83
Issue number1
Pages (from-to)112-120
Number of pages9
ISSN0003-4967
DOIs
Publication statusPublished - 31.10.2023

Funding

FundersFunder number
Health and Social Care, Research and Development Division, Northern Ireland
European Regional Development Fund
Health Service Executive Ballyshannon
Health Research Board203930/B/16/Z
Horizon 2020K/NCB/000058
Wellcome Trust203930
Not added13/RC/2106_P2
Horizon 2020 Framework Programme825575

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Research Areas and Centers

    • Academic Focus: Center for Infection and Inflammation Research (ZIEL)

    DFG Research Classification Scheme

    • 2.21-05 Immunology
    • 2.22-18 Rheumatology

    Fingerprint

    Dive into the research topics of 'Data quality and patient characteristics in European ANCA-associated vasculitis registries: data retrieval by federated querying'. Together they form a unique fingerprint.

    Cite this