Skip to main navigation Skip to search Skip to main content

Data-driven subclassification of ANCA-associated vasculitis: model-based clustering of a federated international cohort

Karl Gisslander*, Arthur White, Louis Aslett, Zdenka Hruskova, Peter Lamprecht, Jacek Musial, Jamsheela Nazeer, James Ng, Declan O'Sullivan, Xavier Puéchal, Matthew Rutherford, Mårten Segelmark, Benjamin Terrier, Vladimir Tesař, Michelangelo Tesi, Augusto Vaglio, Krzysztof Wójcik, Mark A. Little, Aladdin J. Mohammad, Adrian TassoniAlessandra Bettiol, Arlette Tais, Beyza Yaman, Cecil Armstrong, Dagmar Wandrei, Dipak Kalra, Fabian Schubach, François Dradin, Giacomo Emmi, Giacomo Bagni, Gabriele Ihorst, Hannelore Aerts, Hicham Kardaoui, Irene Mattioli, Iris Sengers, Jacek Musial, Jennifer Scott, John Mills, Julie Julie Power, Katarzyna Wawrzycka-Adamczyk, Kris McGlinn, Lucy Hederman, Margaret Dunne, Marco A. Alba, Maria Christofidou, Matija Crnogorac, Nathan Lea, Neil Basu, Peter Verhoeven, Raïssa de Boer, Richard Straka, Sabina Lichołai, Sabrina Arnold, Vladimir Tesar, Zdenka Hruskova

*Corresponding author for this work

Abstract

Background: Antineutrophil cytoplasmic antibody (ANCA)-associated vasculitis is a heterogenous autoimmune disease. While traditionally stratified into two conditions, granulomatosis with polyangiitis (GPA) and microscopic polyangiitis (MPA), the subclassification of ANCA-associated vasculitis is subject to continued debate. Here we aim to identify phenotypically distinct subgroups and develop a data-driven subclassification of ANCA-associated vasculitis, using a large real-world dataset. Methods: In the collaborative data reuse project FAIRVASC (Findable, Accessible, Interoperable, Reusable, Vasculitis), registry records of patients with ANCA-associated vasculitis were retrieved from six European vasculitis registries: the Czech Registry of ANCA-associated vasculitis (Czech Republic), the French Vasculitis Study Group Registry (FVSG; France), the Joint Vasculitis Registry in German-speaking Countries (GeVas; Germany), the Polish Vasculitis Registry (POLVAS; Poland), the Irish Rare Kidney Disease Registry (RKD; Ireland), and the Skåne Vasculitis Cohort (Sweden). We performed model-based clustering of 17 mixed-type clinical variables using a parsimonious mixture of two latent Gaussian variable models. Clinical validation of the optimal cluster solution was made through summary statistics of the clusters' demography, phenotypic and serological characteristics, and outcome. The predictive value of models featuring the cluster affiliations were compared with classifications based on clinical diagnosis and ANCA specificity. People with lived experience were involved throughout the FAIRVASVC project. Findings: A total of 3868 patients diagnosed with ANCA-associated vasculitis between Nov 1, 1966, and March 1, 2023, were included in the study across the six registries (Czech Registry n=371, FVSG n=1780, GeVas n=135, POLVAS n=792, RKD n=439, and Skåne Vasculitis Cohort n=351). There were 2434 (62·9%) patients with GPA and 1434 (37·1%) with MPA. Mean age at diagnosis was 57·2 years (SD 16·4); 2006 (51·9%) of 3867 patients were men and 1861 (48·1%) were women. We identified five clusters, with distinct phenotype, biochemical presentation, and disease outcome. Three clusters were characterised by kidney involvement: one severe kidney cluster (555 [14·3%] of 3868 patients) with high C-reactive protein (CRP) and serum creatinine concentrations, and variable ANCA specificity (SK cluster); one myeloperoxidase (MPO)-ANCA-positive kidney involvement cluster (782 [20·2%]) with limited extrarenal disease (MPO-K cluster); and one proteinase 3 (PR3)-ANCA-positive kidney involvement cluster (683 [17·7%]) with widespread extrarenal disease (PR3-K cluster). Two clusters were characterised by relative absence of kidney involvement: one was a predominantly PR3-ANCA-positive cluster (1202 [31·1%]) with inflammatory multisystem disease (IMS cluster), and one was a cluster (646 [16·7%]) with predominantly ear–nose–throat involvement and low CRP, with mainly younger patients (YR cluster). Compared with models fitted with clinical diagnosis or ANCA status, cluster-assigned models demonstrated improved predictive power with respect to both patient and kidney survival. Interpretation: Our study reinforces the view that ANCA-associated vasculitis is not merely a binary construct. Data-driven subclassification of ANCA-associated vasculitis exhibits higher predictive value than current approaches for key outcomes. Funding: European Union's Horizon 2020 research and innovation programme under the European Joint Programme on Rare Diseases.

Original languageEnglish
JournalThe Lancet Rheumatology
Volume6
Issue number11
Pages (from-to)e762-e770
DOIs
Publication statusPublished - 11.2024

Funding

FundersFunder number
ERN-RITA
Horizon 2020COFUND-EJP 825575

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Research Areas and Centers

    • Academic Focus: Center for Infection and Inflammation Research (ZIEL)

    DFG Research Classification Scheme

    • 2.21-05 Immunology
    • 2.22-18 Rheumatology

    Fingerprint

    Dive into the research topics of 'Data-driven subclassification of ANCA-associated vasculitis: model-based clustering of a federated international cohort'. Together they form a unique fingerprint.

    Cite this