Interested in the science behind the country connection

The country match is determined using a population genetics equation known as the Hardy-Weinberg (HW) equation. It is used to calculate the frequency of occurrence of your genotype (genes present at a locus) with databases from different populations. As an example, data from four populations for the D2S1338 locus are shown. All four of the populations have alleles 15 through 25 in their database, but the frequency of occurrence of each allele differs among the populations. For example, allele 16 occurs at a frequency of 21.52% in population 1, 19.08% in population 2, 28.90% in population 3, and 9.23% in population 4. These are typical of the allele frequency differences detected among populations. See the attached Chart. (Note: the allele frequencies in a population should add up to 1.00 (100%). In this example they do not because there are minor alleles below 15 and above 25 that are not included in this example.)

In order to determine the frequency of occurrence of a genotype, the Hardy-Weinberg (HW) equation is used. For example if your genotype (the allele(s) present in your DNA profile) were 20,21 at the D2S1338 locus, we could determine how often we would expect to encounter a person who was 20,21 by using the HW equation. Using the allele frequencies for the 20 and 21 alleles from the four populations the calculation is:

Population 1 – (0.0346 x 0.0282 x 2) = 0.00195 or 1 in 513

Population 2 – (0.2050 x 0.2450 x 2) = 0.10045 or 1 in 10

Population 3 – (0.0642 x 0.1055 x 2) = 0.01355 or 1 in 74

Population 4 – (0.0290 x 0.0691 x 2) = 0.00401 or 1 in 249

Based on the genotype frequency at the D2S1338 locus, the order of ranking from the most similar to the least similar is:

Population 2
Population 3
Population 4
Population 1

This is only meant as an example of the method used for the determination of the country match. For the final match determined by ConnectMyDNA, additional loci are used in the calculation as well as many more databases. The final frequency of occurrence is the multiplication of the individual locus genotype frequencies since the individual loci are independent and can thus be combined by multiplication.