Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
8
pubmed:dateCreated
2010-11-24
pubmed:abstractText
An important component in the analysis of genome-wide association studies involves the imputation of genotypes that have not been measured directly in the studied samples. The imputation procedure uses the linkage disequilibrium (LD) structure in the population to infer the genotype of an unobserved single nucleotide polymorphism. The LD structure is normally learned from a dense genotype map of a reference population that matches the studied population. In many instances there is no reference population that exactly matches the studied population, and a natural question arises as to how to choose the reference population for the imputation. Here we present a Coalescent-based method that addresses this issue. In contrast to the current paradigm of imputation methods, our method assigns a different reference dataset for each sample in the studied population, and for each region in the genome. This allows the flexibility to account for the diversity within populations, as well as across populations. Furthermore, because our approach treats each region in the genome separately, our method is suitable for the imputation of recently admixed populations. We evaluated our method across a large set of populations and found that our choice of reference data set considerably improves the accuracy of imputation, especially for regions with low LD and for populations without a reference population available as well as for admixed populations such as the Hispanic population. Our method is generic and can potentially be incorporated in any of the available imputation methods as an add-on.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Dec
pubmed:issn
1098-2272
pubmed:author
pubmed:copyrightInfo
© 2010 Wiley-Liss, Inc.
pubmed:issnType
Electronic
pubmed:volume
34
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
773-82
pubmed:meshHeading
pubmed-meshheading:21058333-African Continental Ancestry Group, pubmed-meshheading:21058333-American Native Continental Ancestry Group, pubmed-meshheading:21058333-European Continental Ancestry Group, pubmed-meshheading:21058333-Genetic Variation, pubmed-meshheading:21058333-Genetics, Population, pubmed-meshheading:21058333-Genome, Human, pubmed-meshheading:21058333-Genome-Wide Association Study, pubmed-meshheading:21058333-Genotype, pubmed-meshheading:21058333-Hispanic Americans, pubmed-meshheading:21058333-Humans, pubmed-meshheading:21058333-Linkage Disequilibrium, pubmed-meshheading:21058333-Models, Genetic, pubmed-meshheading:21058333-Polymorphism, Single Nucleotide, pubmed-meshheading:21058333-Reference Standards, pubmed-meshheading:21058333-Sensitivity and Specificity, pubmed-meshheading:21058333-Software
pubmed:year
2010
pubmed:articleTitle
A generic coalescent-based framework for the selection of a reference panel for imputation.
pubmed:affiliation
International Computer Science Institute, Berkeley, California 94704, USA. bogdan@icsi.berkeley.edu
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, Non-P.H.S., Research Support, Non-U.S. Gov't