Skip to main content

Table 3 Comparison of the accuracies of the imputed genotypes in different populations

From: Imputation of missing genotypes: an empirical evaluation of IMPUTE

Accuracy

Population (sample size)

AA (258)

SCA (111)

THAI (104)

NNC (270)

NNC (135)

NECS (280)

Complete missing

Overall

85.66%

(0.26,0.75,0.93,1)

87.39%

(0.02,0.77,0.95,1)

94.23%

(0.01,0.86,0.98,1)

97.05%

(0.01,0.92,0.99,1)

97.06%

(0.02,0.93,0.99,1)

96.43%

(0.01,0.91,0.99,1)

 

0.95 P.P.

96.70%

(0.00,0.92,0.99,1)

97.22%

(0.00,0.94,1.00,1)

98.06%

(0.00,0.95,1.00,1)

99.22%

(0.00,0.98,1.00,1)

99.24%

(0.00,0.98,1.00,1)

99.15%

(0.00,0.98,1.00,1)

 

Percentage

59.00%

60.77%

72.56%

80.38%

80.40%

77.43%

80% missing

Overall

85.92%

(0.29,0.75,0.93,1)

87.64%

(0.01,0.78,0.94,1)

93.98%

(0.01,0.86,0.98,1)

97.24%

(0.01,0.93,0.99,1)

97.25%

(0.01,0.93,0.99,1)

96.43%

(0.01,0.91,0.99,1)

 

0.95 P.P.

96.79%

(0.00,0.92,0.99,1)

97.37%

(0.00,0.93,1.00,1)

98.46%

(0.00,0.95,1.00,1)

99.27%

(0.00,0.98,1.00,1)

99.08%

(0.00,0.98,1.00,1)

99.08%

(0.00,0.98,1.00,1)

 

Percentage

59.03%

61.27%

72.82%

80.53%

80.44%

77.43%

  1. The columns report the accuracy of imputation when 10% of SNPs were imputed. As in Table 1, the first three rows labelled as ''Complete missing'' summarize the accuracy when the genotype data were completely removed, while the last three rows labelled ''80% missing'' summarize the accuracy when 80% of the genotype data were randomly removed. The row labelled ''Overall'' reports the median accuracy and the minimum, 1st quartile, 3rd quartile, and maximum accuracy within brackets. The row labelled ''0.95.P.P'' reports the median accuracy of the imputed genotypes when a minimum posterior probability of 0.95 was required for an imputed genotype to be acceptable. The row labelled ''Percentage'' reports the percentage of imputed genotype data that were acceptable by using the minimum posterior probability of 0.95 as requirement.