Skip to main content
padlock icon - secure page this page is secure

Free Content Accounting for Genotyping Errors in Tagging SNP Selection

Download Article:

You have access to the full text article on a website external to Ingenta Connect.

Please click here to view this article on Wiley Online Library.

You may be required to register and activate access on Wiley Online Library before you can obtain the full text. If you have any queries please visit Wiley Online Library


One limitation of the existing tagging SNP selection algorithms is that they assume the reported genotypes are error free. However, genotyping errors are often unavoidable in practice. Many tagging SNP selection methods depend heavily on the estimated haplotype frequencies. Recent studies have demonstrated that even slight genotyping errors can lead to serious consequences with regard to haplotype reconstruction and frequency estimation. Here we present a tagging SNP selection method that allows for genotyping errors. Our method is a modification of the pair-wise r2 tagging SNP selection algorithm proposed by Carlson et al. (2004). We have replaced the standard EM algorithm in Carlson's method with an EM that accounts for genotyping errors, in an attempt to obtain better estimates of the haplotype frequencies and r2 measure. Through simulation studies we compared the performance of our modified algorithm with that of the original algorithm. We found that the number of tags selected by both methods increased with increasing genotyping errors, though our method led to smaller increase. The power of haplotype association tests using the selected tags decreased dramatically with increasing genotyping errors. The power of single marker tests also decreased, but the reduction was not as much as the reduction in power of haplotype tests. When restricting the mean number of tags selected by both methods to be similar to the baseline number, Carlson's method and our method led to similar power for the subsequent haplotype and single marker tests. Our results showed that, by accounting for random genotyping errors, our method can select tagging SNPs more efficiently than Carlson's method. The computer program that implements our modified tagging SNP selection algorithm is available at our web site:
No References
No Citations
No Supplementary Data
No Article Media
No Metrics

Keywords: EM; genotyping errors; tagging SNPs

Document Type: Research Article

Affiliations: 1: College of Information Sciences and Technology, Penn State University Park 2: Division of Biostatistics, Dept. of Health Evaluation Sciences, Penn State College of Medicine

Publication date: July 1, 2007

  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more