Principals about principal components in statistical genetics

Fentaw Abegaz, Kridsadakorn Chaichoompu, Emmanuelle Génin, David W Fardo, Inke R König, Jestinah M Mahachie John, Kristel Van Steen

Abstract

Principal components (PCs) are widely used in statistics and refer to a relatively small number of uncorrelated variables derived from an initial pool of variables, while explaining as much of the total variance as possible. Also in statistical genetics, principal component analysis (PCA) is a popular technique. To achieve optimal results, a thorough understanding about the different implementations of PCA is required and their impact on study results, compared to alternative approaches. In this review, we focus on the possibilities, limitations and role of PCs in ancestry prediction, genome-wide association studies, rare variants analyses, imputation strategies, meta-analysis and epistasis detection. We also describe several variations of classic PCA that deserve increased attention in statistical genetics applications.

Original languageEnglish
JournalBriefings in bioinformatics
Pages (from-to)1-17
Number of pages17
ISSN1467-5463
DOIs
Publication statusPublished - 14.09.2018

Fingerprint

Dive into the research topics of 'Principals about principal components in statistical genetics'. Together they form a unique fingerprint.

Cite this