SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it

Bioessays 35 (9):780-786 (2013)
  Copy   BIBTEX

Abstract

Whole genome sequencing and SNP genotyping arrays can paint strikingly different pictures of demographic history and natural selection. This is because genotyping arrays contain biased sets of pre‐ascertained SNPs. In this short review, we use comparisons between high‐coverage whole genome sequences of African hunter‐gatherers and data from genotyping arrays to highlight how SNP ascertainment bias distorts population genetic inferences. Sample sizes and the populations in which SNPs are discovered affect the characteristics of observed variants. We find that SNPs on genotyping arrays tend to be older and present in multiple populations. In addition, genotyping arrays cause allele frequency distributions to be shifted towards intermediate frequency alleles, and estimates of linkage disequilibrium are modified. Since population genetic analyses depend on allele frequencies, it is imperative that researchers are aware of the effects of SNP ascertainment bias. With this in mind, we describe multiple ways to correct for SNP ascertainment bias.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,069

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Analytics

Added to PP
2013-10-28

Downloads
45 (#363,753)

6 months
4 (#862,832)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references