Skip to main content
  • Methodology article
  • Open access
  • Published:

Maximum likelihood estimates of two-locus recombination fractions under some natural inequality restrictions

Abstract

Background

The goal of linkage analysis is to determine the chromosomal location of the gene(s) for a trait of interest such as a common disease. Three-locus linkage analysis is an important case of multi-locus problems. Solutions can be found analytically for the case of triple backcross mating. However, in the present study of linkage analysis and gene mapping some natural inequality restrictions on parameters have not been considered sufficiently, when the maximum likelihood estimates (MLEs) of the two-locus recombination fractions are calculated.

Results

In this paper, we present a study of estimating the two-locus recombination fractions for the phase-unknown triple backcross with two offspring in each family in the framework of some natural and necessary parameter restrictions. A restricted expectation-maximization (EM) algorithm, called REM is developed. We also consider some extensions in which the proposed REM can be taken as a unified method.

Conclusion

Our simulation work suggests that the REM performs well in the estimation of recombination fractions and outperforms current method. We apply the proposed method to a published data set of mouse backcross families.

Background

Molecular genetics has made much progress in recent years, among which linkage analysis fulfills an important role. Genetic linkage refers to the ordering of genetic loci on a chromosome and to estimating genetic distances among them, where these distances are determined on the basis of a statistical phenomenon. Statistical machinery has been used to analyze family data and to detect linkage [1–4]. The degree of linkage can be measured by recombination fraction. The proportion of recombinant haplotypes (or offspring) potentially produced by a doubly heterozygous parent is called recombination fraction, which is also the probability of occurrence of a recombination. Many map functions under different assumptions have been derived [5–7], from which the genetic distance and the recombination fraction can be mutually transformed. Human gene mapping is now an important field of science. A critical first step in finding gene loci that contribute to a genetic trait is to demonstrate linkage with a gene of known location (marker). So estimating the recombination fractions is important in linkage analysis.

In several respects, three-locus analysis yields more information than does two-locus analysis [8–11]. Three-locus linkage analysis is also an important case of multi-locus problems. Methods for detecting multilocus linkage in humans and estimation of recombination have been proposed by Lathrop et al. [12], and Lathrop [13]. More recently, Ott [3] has considered the estimation of two-locus recombination fractions for phase-unknown triple backcross families with two offspring in each family. The author gave the presentations of the estimates of the two-locus recombination fractions. Wu et al. [9] considered simultaneous estimation of linkage and linkage phases in outcrossing species. However, as mentioned in Ott [3], the estimates suggested by the author may not satisfy some natural restrictions which two-locus recombination fractions should satisfy in fact. One may not obtain a reasonable interpretation on the recombination phenomenon among loci based on the estimates. Furthermore, illegimate estimates of recombination fractions may also reduce the power to detect linkage which can provide irresponsible evidence to the researchers. In addition, the restrictions on recombination fractions given in the context are necessary in linkage analysis. For example, they can be applied to determine the locus order on the chromosome [9–11].

This estimation problem of two-locus recombination fractions in three-locus linkage analysis belongs to the constrained parameter problems which are not only important but also appear in many areas. The reader is referred to [14–17]. However, the methods provided in the literatures cannot be directly applied to the above genetics problem.

Motivated by this unsolved problem that the restrictions on recombination fractions have not been considered sufficiently, in this paper, we consider the estimation of the two-locus recombination fractions under some natural and necessary restrictions. We develop a restricted EM algorithm, called REM, which gives estimating results through taking account of the natural inequality restrictions on the two-locus recombination fractions, and the algorithm has been implemented by computer. Moreover, this algorithm can be easily generalized to other cases, and the REM performs well as a unified approach. Simulation studies show that our new method works well in each scenario and has advantages over current method, in other words, the major advantages of our method is its robustness and efficiency. An example is used to validate the application of our method to linkage analysis.

Methods

Consider three biallele marker loci, where alleles are designed as A, a; B, b; C, c at loci A, B, C, respectively, with the order of loci being A-B-C. Assume a triply homozygous parent abc/abc, and a triply heterozygous parent (A/a, B/b, C/c). For the latter, there are four possible phases: (I) ABC/abc, (II) ABc/abC, (III) AbC/aBc, (IV) Abc/aBC. As Ott [3] pointed out, under regular conditions (linkage equilibrium), each of these phases occurs with probability 1/4. When it is not the case, we let the prior probability be h i (i = 1, 2, 3, 4) in a later section, and give corresponding feasible approach.

Each offspring only receives haplotype abc from the triply homozygous parent, but receives one of the eight possible kinds of haplotypes from the heterozygous parent, which can be seen at the second column of Table 1. The last four columns of Table 1 give the conditional probabilities with which the offspring phenotypes occur given the parental phase, and the first column presents the code for each haplotype that we will use. For the phase-unknown triple backcross, each haplotype symbol listed in Table 1 just corresponds to one offspring phenotype of the markers.

Table 1 Conditional haplotype probabilities given phase produced by a triply heterozygous parent

Let θ AB , θ BC and θ AC , respectively denote two-locus recombination fractions between loci A and B, between loci B and C, and between loci A and C; g00, g01, g10 and g11 denote joint recombination fractions, where the subscript 1 represents recombination, and 0 represents non-recombination, e.g., g10 is the probability of single recombinant with a recombination for loci A and B but none for loci B and C. So it is clear that the following equations hold:

θ AB = g11 + g10, θ BC = g11 + g01, θ AC = g10 + g01. (1)

Ott [3] groups all possible two-offspring haplotype pairs into four phenotype classes with probability p k (k = 1, 2, 3, 4) according to linkage analysis regulation. These classes are reproduced in Table 2, in which the second column represents two-offspring haplotype pairs, corresponding to two phenotypes. Taking (i, j) = (5, 6) as an example, we say one of the sib pair expresses phenotype aa/Bb/Cc, and the other expresses phenotype aa/Bb/cc. There is no order relationship between i and j. The probabilities of occurrence for all 8 × 9/2 = 36 possible pairs of offspring's phenotypes can be calculated easily, e.g., the joint probability of occurrence of phenotypes aa/Bb/Cc and aa/Bb/cc (diplotypes aBC/abc and aBc/abc) is (g11g10 + g01g00)/4. It then turns out that, among the 36 probabilities, only four different values occur so that phenotypes with the same probabilities may be combined a single class and four classes are obtained.

Table 2 Phenotype classes for phase-unknown triple backcross families with two offspring

Let the total number of families (or sib pairs) observed be n, and the number of families which are grouped into class k be n k (k = 1, 2, 3, 4). Then (n1, n2, n3, n4) is multinomial distributed with probability (p1, p2, p3, p4), and ∑ k = 1 4 n k = n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaabCaeaacqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabg2da9iabd6gaUbWcbaGaem4AaSMaeyypa0JaeGymaedabaGaeGinaqdaniabggHiLdaaaa@37C5@ . The MLEs of p k 's are p ^ k = n k n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmiCaaNbaKaadaWgaaWcbaGaem4AaSgabeaakiabg2da9KqbaoaalaaabaGaemOBa42aaSbaaeaacqWGRbWAaeqaaaqaaiabd6gaUbaaaaa@34CF@ (k = 1, 2, 3, 4). Using the function relationships given in equations (1) and Table 2, as well as the property of MLE, the MLEs of θ AB , θ BC and θ AC can be obtained as Ott [3]. We call this method the unrestricted method that gives unrestricted estimates, and let θ ^ U = ( θ ^ A B U , θ ^ B C U , θ ^ A C U ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaakiabg2da9maabmaabaGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaemOqaieabaGaemyvaufaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemyvaufaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemyvaufaaaGccaGLOaGaayzkaaaaaa@4328@ denote the unrestricted MLE, where

θ ^ A B U = { 1 / 2 − 1 / 2 1 − 2 ( p ^ 3 + p ^ 4 ) , 1 / 2 , if  p ^ 3 + p ^ 4 < 1 / 2 , otherwise, θ ^ B C U = { 1 / 2 − 1 / 2 1 − 2 ( p ^ 2 + p ^ 4 ) , 1 / 2 , if  p ^ 2 + p ^ 4 < 1 / 2 , otherwise, θ ^ A C U = { 1 / 2 − 1 / 2 1 − 2 ( p ^ 2 + p ^ 3 ) , 1 / 2 , if  p ^ 2 + p ^ 3 < 1 / 2 , otherwise . MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeaabmGaaaqaaGGaciqb=H7aXzaajaWaa0baaSqaaiabdgeabjabdkeacbqaaiabdwfavbaakiabg2da9maaceqabaqbaeaabiqaaaqaaiabigdaXiabc+caViabikdaYiabgkHiTiabigdaXiabc+caViabikdaYmaakaaabaGaeGymaeJaeyOeI0IaeGOmaiJaeiikaGIafmiCaaNbaKaadaWgaaWcbaGaeG4mamdabeaakiabgUcaRiqbdchaWzaajaWaaSbaaSqaaiabisda0aqabaGccqGGPaqkaSqabaGccqGGSaalaeaacqaIXaqmcqGGVaWlcqaIYaGmcqGGSaalaaaacaGL7baaaeaafaqaaeGabaaabaGaeeyAaKMaeeOzayMaeeiiaaIafmiCaaNbaKaadaWgaaWcbaGaeG4mamdabeaakiabgUcaRiqbdchaWzaajaWaaSbaaSqaaiabisda0aqabaGccqGH8aapcqaIXaqmcqGGVaWlcqaIYaGmcqGGSaalaeaacqqGVbWBcqqG0baDcqqGObaAcqqGLbqzcqqGYbGCcqqG3bWDcqqGPbqAcqqGZbWCcqqGLbqzcqqGSaalaaaabaGaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemyvaufaaOGaeyypa0ZaaiqabeaafaqaaeGabaaabaGaeGymaeJaei4la8IaeGOmaiJaeyOeI0IaeGymaeJaei4la8IaeGOmaiZaaOaaaeaacqaIXaqmcqGHsislcqaIYaGmcqGGOaakcuWGWbaCgaqcamaaBaaaleaacqaIYaGmaeqaaOGaey4kaSIafmiCaaNbaKaadaWgaaWcbaGaeGinaqdabeaakiabcMcaPaWcbeaakiabcYcaSaqaaiabigdaXiabc+caViabikdaYiabcYcaSaaaaiaawUhaaaqaauaabaqaceaaaeaacqqGPbqAcqqGMbGzcqqGGaaicuWGWbaCgaqcamaaBaaaleaacqaIYaGmaeqaaOGaey4kaSIafmiCaaNbaKaadaWgaaWcbaGaeGinaqdabeaakiabgYda8iabigdaXiabc+caViabikdaYiabcYcaSaqaaiabb+gaVjabbsha0jabbIgaOjabbwgaLjabbkhaYjabbEha3jabbMgaPjabbohaZjabbwgaLjabbYcaSaaaaeaacuWF4oqCgaqcamaaDaaaleaacqWGbbqqcqWGdbWqaeaacqWGvbqvaaGccqGH9aqpdaGabeqaauaabaqaceaaaeaacqaIXaqmcqGGVaWlcqaIYaGmcqGHsislcqaIXaqmcqGGVaWlcqaIYaGmdaGcaaqaaiabigdaXiabgkHiTiabikdaYiabcIcaOiqbdchaWzaajaWaaSbaaSqaaiabikdaYaqabaGccqGHRaWkcuWGWbaCgaqcamaaBaaaleaacqaIZaWmaeqaaOGaeiykaKcaleqaaOGaeiilaWcabaGaeGymaeJaei4la8IaeGOmaiJaeiilaWcaaaGaay5EaaaabaqbaeaabiqaaaqaaiabbMgaPjabbAgaMjabbccaGiqbdchaWzaajaWaaSbaaSqaaiabikdaYaqabaGccqGHRaWkcuWGWbaCgaqcamaaBaaaleaacqaIZaWmaeqaaOGaeyipaWJaeGymaeJaei4la8IaeGOmaiJaeiilaWcabaGaee4Ba8MaeeiDaqNaeeiAaGMaeeyzauMaeeOCaiNaee4DaCNaeeyAaKMaee4CamNaeeyzauMaeeOla4caaaaaaaa@D701@

Natural inequality restrictions on parameters

In parameter estimation, not only the data structure but also the restrictions on the parameters should be considered, otherwise the MLEs obtained may be unreasonable. For two-locus recombination fractions, the following inequality restrictions: θ AB ≤ θ BC + θ AC , θ BC ≤ θ AB + θ AC , θ AC ≤ θ AB + θ BC , and 0 ≤ θ AB , θ BC , θ AC ≤ 1/2 must be considered. For the given order of loci A-B-C, additional restrictions: θ AB ≤ θ AC and θ BC ≤ θ AC are required. Combining all these inequalities, the following equivalent restrictions are obtained:

{ θ A B ≤ θ A C , θ B C ≤ θ A C , θ A C ≤ θ A B + θ B C , θ A C ≤ 1 / 2. MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeabbaaaaeaaiiGacqWF4oqCdaWgaaWcbaGaemyqaeKaemOqaieabeaakiabgsMiJkab=H7aXnaaBaaaleaacqWGbbqqcqWGdbWqaeqaaOGaeiilaWcabaGae8hUde3aaSbaaSqaaiabdkeacjabdoeadbqabaGccqGHKjYOcqWF4oqCdaWgaaWcbaGaemyqaeKaem4qameabeaakiabcYcaSaqaaiab=H7aXnaaBaaaleaacqWGbbqqcqWGdbWqaeqaaOGaeyizImQae8hUde3aaSbaaSqaaiabdgeabjabdkeacbqabaGccqGHRaWkcqWF4oqCdaWgaaWcbaGaemOqaiKaem4qameabeaakiabcYcaSaqaaiab=H7aXnaaBaaaleaacqWGbbqqcqWGdbWqaeqaaOGaeyizImQaeGymaeJaei4la8IaeGOmaiJaeiOla4caaaGaay5Eaaaaaa@5B62@
(2)

These restrictions are natural and necessary.

Proposed algorithm

In this section, we propose an approach to calculate MLEs of two-locus recombination fractions under restriction (2), which works well in application. From equations (1) and Table 2, p k 's are functions of independent parameters g10, g01 and g11, and also functions of θ AB , θ BC and θ AC , so the log-likelihood function can be written as the following form

l ( θ | { n k } ) = ∑ k = 1 4 n k ln ( p k ( θ A B , θ B C , θ A C ) ) , MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiBaWMaeiikaGccciGae8hUdeNaeiiFaWNaei4EaSNaemOBa42aaSbaaSqaaiabdUgaRbqabaGccqGG9bqFcqGGPaqkcqGH9aqpdaaeWbqaaiabd6gaUnaaBaaaleaacqWGRbWAaeqaaOGagiiBaWMaeiOBa4MaeiikaGIaemiCaa3aaSbaaSqaaiabdUgaRbqabaGccqGGOaakcqWF4oqCdaWgaaWcbaGaemyqaeKaemOqaieabeaakiabcYcaSiab=H7aXnaaBaaaleaacqWGcbGqcqWGdbWqaeqaaOGaeiilaWIae8hUde3aaSbaaSqaaiabdgeabjabdoeadbqabaGccqGGPaqkcqGGPaqkaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabisda0aqdcqGHris5aOGaeiilaWcaaa@5ACD@

where θ = (θ AB , θ BC , θ AC ). Our goal is to find θ ^ R = ( θ ^ A B R , θ ^ B C R , θ ^ A C R ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaakiabg2da9maabmaabaGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaemOqaieabaGaemOuaifaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemOuaifaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemOuaifaaaGccaGLOaGaayzkaaaaaa@4310@ , such that l ( θ ^ R | { n k } ) = max θ l ( θ | { n k } ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiBaWMaeiikaGccciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaakiabcYha8jabcUha7jabd6gaUnaaBaaaleaacqWGRbWAaeqaaOGaeiyFa0NaeiykaKIaeyypa0ZaaCbeaeaacyGGTbqBcqGGHbqycqGG4baEaSqaaiab=H7aXbqabaGccqWGSbaBcqGGOaakcqWF4oqCcqGG8baFcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPaaa@4CEF@ under restriction (2), where θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ denotes the restricted MLE of θ.

We propose our restricted EM algorithm (REM) on the basis of the EM algorithm of Dempster et al. [18] as follows:

Augment the observed data {n k , k = 1, 2, 3, 4} by latent variables {n kl , k, l = 1, 2, 3, 4} to obtain a complete data set, where n k = ∑ l = 1 4 n k l MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOBa42aaSbaaSqaaiabdUgaRbqabaGccqGH9aqpdaaeWbqaaiabd6gaUnaaBaaaleaacqWGRbWAcqWGSbaBaeqaaaqaaiabdYgaSjabg2da9iabigdaXaqaaiabisda0aqdcqGHris5aaaa@3AA8@ , and {n kl , k, l = 1, 2, 3, 4} is multinomial distributed with probability {p kl , k, l = 1, 2, 3, 4}. Here, p kl are components of p k in Table 2 with p 11 = g 00 2 , p 12 = g 01 2 , p 13 = g 10 2 , p 14 = g 11 2 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiCaa3aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGimaaJaeGimaadabaGaeGOmaidaaOGaeiilaWIaemiCaa3aaSbaaSqaaiabigdaXiabikdaYaqabaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGimaaJaeGymaedabaGaeGOmaidaaOGaeiilaWIaemiCaa3aaSbaaSqaaiabigdaXiabiodaZaqabaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGymaeJaeGimaadabaGaeGOmaidaaOGaeiilaWIaemiCaa3aaSbaaSqaaiabigdaXiabisda0aqabaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGymaeJaeGymaedabaGaeGOmaidaaaaa@5201@ ; p21 = g00g01, p22 = g00g01, p23 = g10g11, p24 = g10g11; p31 = g00g10, p32 = g00g10, p33 = g01g11, p34 = g01g11; p41 = g00g11, p42 = g00g11, p43 = g01g10, p44 = g01g10. n kl have its interpretation, for example, n11 can be interpreted as the number of the families: (phase I → (1,1) or (8,8) or (1,8)), or (phase II → (2,2) or (7,7) or (2,7)), or (phase III → (4,4) or (5,5) or (4,5)), or (phase IV → (3,3) or (6,6) or (3,6)), where (phase I → (1,1)) denotes the event that the families have phase I, and the haplotype pairs of their offspring are (1,1), and other notations are analogous to interpret.

Because parameters θ AB , θ BC , and θ AC are equivalent to independent parameters g10, g01 and g11, we still consider parameters g10, g01 and g11 here, and restriction (2) is equivalent to the following restriction (3):

{ g 11 ≤ g 01 , g 11 ≤ g 10 , g 11 ≥ 0 , g 01 + g 10 ≤ 1 / 2. MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeabbaaaaeaacqWGNbWzdaWgaaWcbaGaeGymaeJaeGymaedabeaakiabgsMiJkabdEgaNnaaBaaaleaacqaIWaamcqaIXaqmaeqaaOGaeiilaWcabaGaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGHKjYOcqWGNbWzdaWgaaWcbaGaeGymaeJaeGimaadabeaakiabcYcaSaqaaiabdEgaNnaaBaaaleaacqaIXaqmcqaIXaqmaeqaaOGaeyyzImRaeGimaaJaeiilaWcabaGaem4zaC2aaSbaaSqaaiabicdaWiabigdaXaqabaGccqGHRaWkcqWGNbWzdaWgaaWcbaGaeGymaeJaeGimaadabeaakiabgsMiJkabigdaXiabc+caViabikdaYiabc6caUaaaaiaawUhaaaaa@5440@
(3)

Thus, finding MLE g ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2E9A@ (the restricted MLE of g = (g10, g01, g11), such that l ( g ^ R | { n k } ) = max g l ( g | { n k } ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiBaWMaeiikaGccbeGaf83zaCMbaKaadaahaaWcbeqaaiabdkfasbaakiabcYha8jabcUha7jabd6gaUnaaBaaaleaacqWGRbWAaeqaaOGaeiyFa0NaeiykaKIaeyypa0ZaaCbeaeaacyGGTbqBcqGGHbqycqGG4baEaSqaaiab=DgaNbqabaGccqWGSbaBcqGGOaakcqWFNbWzcqGG8baFcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPaaa@4BD3@ ) under restriction (3) implies finding MLE θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ of θ under (2). The complete data log-likelihood function can be written as

l ( g | { n k l } ) = ∑ k = 1 4 ∑ l = 1 4 n k l ln ( p k l ( g ) ) , k , l = 1 , 2 , 3 , 4 , MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdYgaSjabcIcaOGqabiab=DgaNjabcYha8jabcUha7jabd6gaUnaaBaaaleaacqWGRbWAcqWGSbaBaeqaaOGaeiyFa0NaeiykaKIaeyypa0ZaaabCaeaadaaeWbqaaiabd6gaUnaaBaaaleaacqWGRbWAcqWGSbaBaeqaaOGagiiBaWMaeiOBa4MaeiikaGIaemiCaa3aaSbaaSqaaiabdUgaRjabdYgaSbqabaGccqGGOaakcqWFNbWzcqGGPaqkcqGGPaqkaSqaaiabdYgaSjabg2da9iabigdaXaqaaiabisda0aqdcqGHris5aaWcbaGaem4AaSMaeyypa0JaeGymaedabaGaeGinaqdaniabggHiLdGccqGGSaalaeaacqWGRbWAcqGGSaalcqWGSbaBcqGH9aqpcqaIXaqmcqGGSaalcqaIYaGmcqGGSaalcqaIZaWmcqGGSaalcqaI0aancqGGSaalaaaaaa@64AE@

where p kl 's are functions of g as given above. The conditional expectation of l(g|{n kl }) when the s th step parameter values g(s)are given is

Q ( g | g ( s ) , { n k } ) = 2 [ a 1 ( s + 1 ) l n ( 1 − g 01 − g 10 − g 11 ) + a 2 ( s + 1 ) l n ( g 01 ) + a 3 ( s + 1 ) l n ( g 10 ) + a 4 ( s + 1 ) l n ( g 11 ) ] , MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyuaeLaeiikaGccbeGae83zaCMaeiiFaWNae83zaC2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGGPaqkaaGccqGGSaalcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPiabg2da9iabikdaYiabcUfaBjabdggaHnaaDaaaleaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqWGSbaBcqWGUbGBcqGGOaakcqaIXaqmcqGHsislcqWGNbWzdaWgaaWcbaGaeGimaaJaeGymaedabeaakiabgkHiTiabdEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOGaeyOeI0Iaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGHRaWkcqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabicdaWiabigdaXaqabaGccqGGPaqkcqGHRaWkcqWGHbqydaqhaaWcbaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabigdaXiabicdaWaqabaGccqGGPaqkcqGHRaWkcqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGGDbqxcqGGSaalaaa@8D1B@
(4)

where

a 1 ( s + 1 ) = n 1 ( g 00 ( s ) ) 2 p 1 ( s ) + n 2 g 00 ( s ) g 01 ( s ) p 2 ( s ) + n 3 g 00 ( s ) g 10 ( s ) p 3 ( s ) + n 4 g 00 ( s ) g 11 ( s ) p 4 ( s ) , a 2 ( s + 1 ) = n 1 ( g 01 ( s ) ) 2 p 1 ( s ) + n 2 g 00 ( s ) g 01 ( s ) p 2 ( s ) + n 3 g 01 ( s ) g 11 ( s ) p 3 ( s ) + n 4 g 01 ( s ) g 10 ( s ) p 4 ( s ) , a 3 ( s + 1 ) = n 1 ( g 10 ( s ) ) 2 p 1 ( s ) + n 2 g 11 ( s ) g 10 ( s ) p 2 ( s ) + n 3 g 00 ( s ) g 10 ( s ) p 3 ( s ) + n 4 g 01 ( s ) g 10 ( s ) p 4 ( s ) , a 4 ( s + 1 ) = n 1 ( g 11 ( s ) ) 2 p 1 ( s ) + n 2 g 11 ( s ) g 10 ( s ) p 2 ( s ) + n 3 g 01 ( s ) g 11 ( s ) p 3 ( s ) + n 4 g 00 ( s ) g 11 ( s ) p 4 ( s ) . MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeaabqqaaaaabaGaemyyae2aa0baaSqaaiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabd6gaUnaaBaaaleaacqaIXaqmaeqaaKqbaoaalaaabaGaeiikaGIaem4zaC2aa0baaeaacqaIWaamcqaIWaamaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaeiykaKYaaWbaaeqabaGaeGOmaidaaaqaaiabdchaWnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaaaakiabgUcaRiabd6gaUnaaBaaaleaacqaIYaGmaeqaaKqbaoaalaaabaGaem4zaC2aa0baaeaacqaIWaamcqaIWaamaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaem4zaC2aa0baaeaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaemiCaa3aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaaaiabgUcaRiabd6gaUnaaBaaabaGaeG4mamdabeaadaWcaaqaaiabdEgaNnaaDaaabaGaeGimaaJaeGimaadabaGaeiikaGIaem4CamNaeiykaKcaaiabdEgaNnaaDaaabaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdchaWnaaDaaabaGaeG4mamdabaGaeiikaGIaem4CamNaeiykaKcaaaaacqGHRaWkcqWGUbGBdaWgaaqaaiabisda0aqabaWaaSaaaeaacqWGNbWzdaqhaaqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabcMcaPaaacqWGNbWzdaqhaaqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGWbaCdaqhaaqaaiabisda0aqaaiabcIcaOiabdohaZjabcMcaPaaaaaGaeiilaWcakeaacqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0JaemOBa42aaSbaaSqaaiabigdaXaqabaqcfa4aaSaaaeaacqGGOaakcqWGNbWzdaqhaaqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaacqGGPaqkdaahaaqabeaacqaIYaGmaaaabaGaemiCaa3aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaaaOGaey4kaSIaemOBa42aaSbaaSqaaiabikdaYaqabaqcfa4aaSaaaeaacqWGNbWzdaqhaaqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabcMcaPaaacqWGNbWzdaqhaaqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGWbaCdaqhaaqaaiabikdaYaqaaiabcIcaOiabdohaZjabcMcaPaaaaaGaey4kaSIaemOBa42aaSbaaeaacqaIZaWmaeqaamaalaaabaGaem4zaC2aa0baaeaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaem4zaC2aa0baaeaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaemiCaa3aa0baaeaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaaaiabgUcaRiabd6gaUnaaBaaabaGaeGinaqdabeaadaWcaaqaaiabdEgaNnaaDaaabaGaeGimaaJaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaiabdEgaNnaaDaaabaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdchaWnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaeiykaKcaaaaacqGGSaalaOqaaiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpcqWGUbGBdaWgaaWcbaGaeGymaedabeaajuaGdaWcaaqaaiabcIcaOiabdEgaNnaaDaaabaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaeiykaKcaaiabcMcaPmaaCaaabeqaaiabikdaYaaaaeaacqWGWbaCdaqhaaqaaiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaaaaGccqGHRaWkcqWGUbGBdaWgaaWcbaGaeGOmaidabeaajuaGdaWcaaqaaiabdEgaNnaaDaaabaGaeGymaeJaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaiabdEgaNnaaDaaabaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdchaWnaaDaaabaGaeGOmaidabaGaeiikaGIaem4CamNaeiykaKcaaaaacqGHRaWkcqWGUbGBdaWgaaqaaiabiodaZaqabaWaaSaaaeaacqWGNbWzdaqhaaqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabcMcaPaaacqWGNbWzdaqhaaqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGWbaCdaqhaaqaaiabiodaZaqaaiabcIcaOiabdohaZjabcMcaPaaaaaGaey4kaSIaemOBa42aaSbaaeaacqaI0aanaeqaamaalaaabaGaem4zaC2aa0baaeaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaem4zaC2aa0baaeaacqaIXaqmcqaIWaamaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaemiCaa3aa0baaeaacqaI0aanaeaacqGGOaakcqWGZbWCcqGGPaqkaaaaaiabcYcaSaGcbaGaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabd6gaUnaaBaaaleaacqaIXaqmaeqaaKqbaoaalaaabaGaeiikaGIaem4zaC2aa0baaeaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaeiykaKYaaWbaaeqabaGaeGOmaidaaaqaaiabdchaWnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaaaakiabgUcaRiabd6gaUnaaBaaaleaacqaIYaGmaeqaaKqbaoaalaaabaGaem4zaC2aa0baaeaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaem4zaC2aa0baaeaacqaIXaqmcqaIWaamaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaemiCaa3aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaaaiabgUcaRiabd6gaUnaaBaaabaGaeG4mamdabeaadaWcaaqaaiabdEgaNnaaDaaabaGaeGimaaJaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaiabdEgaNnaaDaaabaGaeGymaeJaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdchaWnaaDaaabaGaeG4mamdabaGaeiikaGIaem4CamNaeiykaKcaaaaacqGHRaWkcqWGUbGBdaWgaaqaaiabisda0aqabaWaaSaaaeaacqWGNbWzdaqhaaqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabcMcaPaaacqWGNbWzdaqhaaqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGWbaCdaqhaaqaaiabisda0aqaaiabcIcaOiabdohaZjabcMcaPaaaaaGaeiOla4caaaaa@A3CB@

Then the restricted estimating problem may be written as

Max Q(g|g(s), {n k }), subject to g satisfies restriction (3).

The Hessian matrix of Q(g|g(s), {n k }) for g10, g01 and g11 is negative definite, so Q(g|g(s), {n k }) is strictly concave for g10, g01 and g11. This implies that there exists one unique point g ˜ ( s + 1 ) = ( g ˜ 10 ( s + 1 ) , g ˜ 01 ( s + 1 ) , g ˜ 11 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabcIcaOiqbdEgaNzaaiaWaa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiqbdEgaNzaaiaWaa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiqbdEgaNzaaiaWaa0baaSqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcMcaPaaa@502D@ satisfying Q ( g ˜ ( s + 1 ) | g ( s ) , { n k } ) = max g Q ( g | g ( s ) , { n k } ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyuaeLaeiikaGccbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYha8jab=DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaeiykaKcaaOGaeiilaWIaei4EaSNaemOBa42aaSbaaSqaaiabdUgaRbqabaGccqGG9bqFcqGGPaqkcqGH9aqpdaWfqaqaaiGbc2gaTjabcggaHjabcIha4bWcbaGae83zaCgabeaakiabdgfarjabcIcaOiab=DgaNjabcYha8jab=DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaeiykaKcaaOGaeiilaWIaei4EaSNaemOBa42aaSbaaSqaaiabdUgaRbqabaGccqGG9bqFcqGGPaqkaaa@5A42@ . Following some calculation, it is easy to obtain that g ˜ 10 ( s + 1 ) = a 3 ( s + 1 ) / n , g ˜ 01 ( s + 1 ) = a 2 ( s + 1 ) / n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4zaCMbaGaadaqhaaWcbaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0Jaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabc+caViabd6gaUjabcYcaSiqbdEgaNzaaiaWaa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGVaWlcqWGUbGBaaa@5301@ and g ˜ 11 ( s + 1 ) = a 4 ( s + 1 ) / n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4zaCMbaGaadaqhaaWcbaGaeGymaeJaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0Jaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabc+caViabd6gaUbaa@3EFF@ . If g ˜ ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@325F@ satisfies restriction (3), then g ( s + 1 ) = g ˜ ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGae83zaC2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpcuWFNbWzgaacamaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaaa@39E2@ in the (s + 1)th iteration for EM algorithm, otherwise, we use the Kuhn-Tucker conditions [19, 20] to deal with problem (5). Thus, we can still find a unique point ̌ g ( s + 1 ) = ( ̌ g 10 ( s + 1 ) , ̌ g 01 ( s + 1 ) , ̌ g 11 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWeuvgwd1KuV1wyUbqegmwBYfdmaGabbiadaciKaaaa=XWa3Iqabiab+DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0JaeiikaGcceaGamaiGqca9=3hddCRaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiadaciNaq==9XWa3kabdEgaNnaaDaaaleaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalcWaGacja0=FFmmWTcqWGNbWzdaqhaaWcbaGaeGymaeJaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiykaKcaaa@66DF@ , such that Q ( ̌ g ( s + 1 ) | g ( s ) , { n k } ) = max g Q ( g | g ( s ) , { n k } ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyuaeLaeiikaGYeuvgwd1KuV1wyUbqegmwBYfdmaGabbiadaciKaaaa=XWa3Iqabiab+DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiiFaWNae43zaC2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGGPaqkaaGccqGGSaalcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPiabg2da9maaxababaGagiyBa0MaeiyyaeMaeiiEaGhaleaacqGFNbWzaeqaaOGaemyuaeLaeiikaGIae43zaCMaeiiFaWNae43zaC2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGGPaqkaaGccqGGSaalcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPaaa@6289@ under restriction (3), because Q(g|g(s), {n k }) is a strictly concave function for g10, g01 and g11 and the restriction region is a convex set. See Appendix for the Kuhn-Tucker conditions and the solving process of ̌ g ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWeuvgwd1KuV1wyUbqegmwBYfdmaGabbiadaciKaaaa=XWa3Iqabiab+DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaaa@3AAA@ .

We give the complete REM algorithm as follows:

Let g ( 0 ) = ( g 10 ( 0 ) , g 01 ( 0 ) , g 11 ( 0 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGae83zaC2aaWbaaSqabeaacqGGOaakcqaIWaamcqGGPaqkaaGccqGH9aqpcqGGOaakcqWGNbWzdaqhaaWcbaGaeGymaeJaeGimaadabaGaeiikaGIaeGimaaJaeiykaKcaaOGaeiilaWIaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabicdaWiabcMcaPaaakiabcYcaSiabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqaIWaamcqGGPaqkaaGccqGGPaqkaaa@46A5@ be the starting point (the starting value of g(0) may be taken as g ^ U = ( g ^ 10 U , g ^ 01 U , g ^ 11 U ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaKaadaahaaWcbeqaaiabdwfavbaakiabg2da9iabcIcaOiqbdEgaNzaajaWaa0baaSqaaiabigdaXiabicdaWaqaaiabdwfavbaakiabcYcaSiqbdEgaNzaajaWaa0baaSqaaiabicdaWiabigdaXaqaaiabdwfavbaakiabcYcaSiqbdEgaNzaajaWaa0baaSqaaiabigdaXiabigdaXaqaaiabdwfavbaakiabcMcaPaaa@4131@ which can make the REM converge faster, where ( g ^ 10 U , g ^ 01 U , g ^ 11 U ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeiikaGIafm4zaCMbaKaadaqhaaWcbaGaeGymaeJaeGimaadabaGaemyvaufaaOGaeiilaWIafm4zaCMbaKaadaqhaaWcbaGaeGimaaJaeGymaedabaGaemyvaufaaOGaeiilaWIafm4zaCMbaKaadaqhaaWcbaGaeGymaeJaeGymaedabaGaemyvaufaaOGaeiykaKcaaa@3D54@ can be obtained from ( θ ^ A B U , θ ^ B C U , θ ^ A C U ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeiikaGccciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaemOqaieabaGaemyvaufaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemyvaufaaOGaeiilaWIaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemyvaufaaOGaeiykaKcaaa@3F20@ by equations (1));

E-step: At step s, compute the expected number of recombination events a ( s + 1 ) = ( a 1 ( s + 1 ) , a 2 ( s + 1 ) , a 3 ( s + 1 ) , a 4 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGae8xyae2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpcqGGOaakcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWIaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalcqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiykaKcaaa@5546@ from g(s);

M-step: Compute g(s+1)using a(s+1). Firstly, compute g ˜ ( s + 1 ) = ( g ˜ 10 ( s + 1 ) , g ˜ 01 ( s + 1 ) , g ˜ 11 ( s + 1 ) ) = ( a 3 ( s + 1 ) / n , a 2 ( s + 1 ) / n , a 4 ( s + 1 ) / n ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabcIcaOiqbdEgaNzaaiaWaa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiqbdEgaNzaaiaWaa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiqbdEgaNzaaiaWaa0baaSqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcMcaPiabg2da9iabcIcaOiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGVaWlcqWGUbGBcqGGSaalcqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaei4la8IaemOBa4MaeiilaWIaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabc+caViabd6gaUjabcMcaPaaa@71C1@ . If g ˜ ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@325F@ satisfies restriction (3), then g(s+1)= g ˜ ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@325F@ ; otherwise, then g(s+1)must belong to one of the following cases (i.e. only one case holds):

case 1. g 00 ( s + 1 ) = g 01 ( s + 1 ) = g 10 ( s + 1 ) = g 11 ( s + 1 ) = 1 / 4 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0Jaem4zaC2aa0baaSqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabigdaXiabc+caViabisda0aaa@5433@ , if the following inequalities hold simultaneously

{ a 3 ( s + 1 ) + a 4 ( s + 1 ) > a 1 ( s + 1 ) + a 2 ( s + 1 ) , a 2 ( s + 1 ) + a 4 ( s + 1 ) > a 1 ( s + 1 ) + a 3 ( s + 1 ) , a 2 ( s + 1 ) + a 3 ( s + 1 ) + a 4 ( s + 1 ) > 3 a 1 ( s + 1 ) ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSaqaaiabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHRaWkcqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyOpa4Jaemyyae2aa0baaSqaaiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalaeaacqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqaIZaWmcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaei4oaSdaaaGaay5Eaaaaaa@9240@

case 2. g 01 ( s + 1 ) = g 11 ( s + 1 ) = g 10 ( s + 1 ) = a 2 ( s + 1 ) + a 3 ( s + 1 ) + a 4 ( s + 1 ) 3 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpcqWGNbWzdaqhaaWcbaGaeGymaeJaeGimaadabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0tcfa4aaSaaaeaacqWGHbqydaqhaaqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaacqGHRaWkcqWGHbqydaqhaaqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaacqGHRaWkcqWGHbqydaqhaaqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaeaacqaIZaWmcqWGUbGBaaaaaa@62BB@ , g 00 ( s + 1 ) = a 1 ( s + 1 ) n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaemOBa4gaaaaa@3E89@ , if

{ a 3 ( s + 1 ) + a 4 ( s + 1 ) > 2 a 2 ( s + 1 ) , a 2 ( s + 1 ) + a 4 ( s + 1 ) > 2 a 3 ( s + 1 ) , 3 n / 4 ≥ a 2 ( s + 1 ) + a 3 ( s + 1 ) + a 4 ( s + 1 ) > 0 ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqaIYaGmcqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWcabaGaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqaIYaGmcqWGHbqydaqhaaWcbaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWcabaGaeG4mamJaemOBa4Maei4la8IaeGinaqJaeyyzImRaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHRaWkcqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyOpa4JaeGimaaJaei4oaSdaaaGaay5Eaaaaaa@8226@

case 3. g 01 ( s + 1 ) = g 11 ( s + 1 ) = a 2 ( s + 1 ) + a 4 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@511E@ , g 10 ( s + 1 ) = g 00 ( s + 1 ) = a 1 ( s + 1 ) + a 3 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIWaamcqaIWaamaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@5116@ , if

{ a 3 ( s + 1 ) a 4 ( s + 1 ) > a 1 ( s + 1 ) a 2 ( s + 1 ) , a 3 ( s + 1 ) > a 1 ( s + 1 ) , a 1 ( s + 1 ) + a 3 ( s + 1 ) ≥ a 2 ( s + 1 ) + a 4 ( s + 1 ) > 0 ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSaqaaiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWcabaGaemyyae2aa0baaSqaaiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHLjYScqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabicdaWiabcUda7aaaaiaawUhaaaaa@81A6@

case 4. g 10 ( s + 1 ) = g 11 ( s + 1 ) = a 3 ( s + 1 ) + a 4 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@5120@ , g 01 ( s + 1 ) = g 00 ( s + 1 ) = a 1 ( s + 1 ) + a 2 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIWaamcqaIWaamaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@5114@ , if

{ a 2 ( s + 1 ) a 4 ( s + 1 ) > a 1 ( s + 1 ) a 3 ( s + 1 ) , a 2 ( s + 1 ) > a 1 ( s + 1 ) , a 1 ( s + 1 ) + a 2 ( s + 1 ) ≥ a 3 ( s + 1 ) + a 4 ( s + 1 ) > 0 ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSaqaaiabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWcabaGaemyyae2aa0baaSqaaiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHLjYScqWGHbqydaqhaaWcbaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabicdaWiabcUda7aaaaiaawUhaaaaa@81A4@

case 5. g 01 ( s + 1 ) = g 11 ( s + 1 ) = a 2 ( s + 1 ) + a 4 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@511E@ , g 10 ( s + 1 ) = a 3 ( s + 1 ) n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaemOBa4gaaaaa@3E8F@ , g 00 ( s + 1 ) = a 1 ( s + 1 ) n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaemOBa4gaaaaa@3E89@ , if

{ a 4 ( s + 1 ) > a 2 ( s + 1 ) , 2 a 3 ( s + 1 ) ≥ a 2 ( s + 1 ) + a 4 ( s + 1 ) > 0 , a 2 ( s + 1 ) + 2 a 3 ( s + 1 ) + a 4 ( s + 1 ) ≤ n ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalaeaacqaIYaGmcqWGHbqydaqhaaWcbaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyyzImRaemyyae2aa0baaSqaaiabikdaYaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqaIWaamcqGGSaalaeaacqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaeGOmaiJaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHKjYOcqWGUbGBcqGG7aWoaaaacaGL7baaaaa@77B8@

case 6. g 10 ( s + 1 ) = g 11 ( s + 1 ) = a 3 ( s + 1 ) + a 4 ( s + 1 ) 2 n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9iabdEgaNnaaDaaaleaacqaIXaqmcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH9aqpjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaiabgUcaRiabdggaHnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaqaaiabikdaYiabd6gaUbaaaaa@5120@ , g 01 ( s + 1 ) = a 2 ( s + 1 ) n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaemOBa4gaaaaa@3E8D@ , g 00 ( s + 1 ) = a 1 ( s + 1 ) n MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaemOBa4gaaaaa@3E89@ , if

{ a 4 ( s + 1 ) > a 3 ( s + 1 ) , 2 a 2 ( s + 1 ) ≥ a 3 ( s + 1 ) + a 4 ( s + 1 ) > 0 , 2 a 2 ( s + 1 ) + a 3 ( s + 1 ) + a 4 ( s + 1 ) ≤ n ; MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeWabaaabaGaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabdggaHnaaDaaaleaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalaeaacqaIYaGmcqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyyzImRaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGH+aGpcqaIWaamcqGGSaalaeaacqaIYaGmcqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabgUcaRiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHKjYOcqWGUbGBcqGG7aWoaaaacaGL7baaaaa@77BA@

case 7. g 01 ( s + 1 ) = a 2 ( s + 1 ) 2 ( a 2 ( s + 1 ) + a 3 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaeGOmaiJaeiikaGIaemyyae2aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaey4kaSIaemyyae2aa0baaeaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaeiykaKcaaaaa@4F54@ , g 10 ( s + 1 ) = a 3 ( s + 1 ) 2 ( a 2 ( s + 1 ) + a 3 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaeGOmaiJaeiikaGIaemyyae2aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaey4kaSIaemyyae2aa0baaeaacqaIZaWmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaeiykaKcaaaaa@4F56@ , g 11 ( s + 1 ) = a 4 ( s + 1 ) 2 ( a 1 ( s + 1 ) + a 4 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabigdaXiabigdaXaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaeGOmaiJaeiikaGIaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaey4kaSIaemyyae2aa0baaeaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaeiykaKcaaaaa@4F5A@ , g 00 ( s + 1 ) = a 1 ( s + 1 ) 2 ( a 1 ( s + 1 ) + a 4 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4zaC2aa0baaSqaaiabicdaWiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg2da9KqbaoaalaaabaGaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaaabaGaeGOmaiJaeiikaGIaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaey4kaSIaemyyae2aa0baaeaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGaeiykaKcaaaaa@4F50@ , if

{ a 2 ( s + 1 ) + a 3 ( s + 1 ) > a 1 ( s + 1 ) + a 4 ( s + 1 ) , a 4 ( s + 1 ) > 0 , a 1 ( s + 1 ) a 2 ( s + 1 ) > a 3 ( s + 1 ) a 4 ( s + 1 ) , a 1 ( s + 1 ) a 3 ( s + 1 ) > a 2 ( s + 1 ) a 4 ( s + 1 ) . MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeabbaaaaeaacqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaey4kaSIaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabdggaHnaaDaaaleaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGHRaWkcqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiilaWcabaGaemyyae2aa0baaSqaaiabisda0aqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabicdaWiabcYcaSaqaaiabdggaHnaaDaaaleaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqWGHbqydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyOpa4Jaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabdggaHnaaDaaaleaacqaI0aanaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalaeaacqWGHbqydaqhaaWcbaGaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemyyae2aa0baaSqaaiabiodaZaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabg6da+iabdggaHnaaDaaaleaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqWGHbqydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiOla4caaaGaay5Eaaaaaa@97ED@

The above procedure is iteratively carried out until convergence. Then the restricted MLE θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ of θ in terms of the restricted MLE g ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2E9A@ can be obtained correspondingly by equations (1).

Compared to the general EM algorithm, the M-step of the REM is a little more complex. It needs some necessary discrimination, then g(s+1)can be obtained based on a(s+1). Note that g(s+1)has the closed-form solution, so it will largely improve the computational efficiency of the parameters. The restricted EM algorithm is convergent, and the restricted MLE θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ from the proposed restricted EM algorithm is a consistent estimator of the parameter θ.

Case for more offspring

It is an important fact that more offspring in each family will provide more information in linkage analysis, therefore, and we need to extend the REM algorithm to cases of multiple offspring (sibship) in each family.

We develop a strategy for estimating the two-locus recombination fractions for this case, and the proposed REM algorithm works as a unified method. Taking three-offspring case as an example, we group the observed families into 5 classes according to linkage analysis regulation, with the observed data {n k , k = 1,⋯, 5}. After data augmentation, we obtain complete data {n kl , k = 1, 2, 3, 4, 5, l = 1, 2, 3, 4}. Furthermore, the conditional expectation of the complete-data log-likelihood is

Q ( g | g ( s ) , { n k } ) = 3 [ b 1 ( s + 1 ) l n ( 1 − g 01 − g 10 − g 11 ) + b 2 ( s + 1 ) l n ( g 01 ) + b 3 ( s + 1 ) l n ( g 10 ) + b 4 ( s + 1 ) l n ( g 11 ) ] , MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyuaeLaeiikaGccbeGae83zaCMaeiiFaWNae83zaC2aaWbaaSqabeaacqGGOaakcqWGZbWCcqGGPaqkaaGccqGGSaalcqGG7bWEcqWGUbGBdaWgaaWcbaGaem4AaSgabeaakiabc2ha9jabcMcaPiabg2da9iabiodaZiabcUfaBjabdkgaInaaDaaaleaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqWGSbaBcqWGUbGBcqGGOaakcqaIXaqmcqGHsislcqWGNbWzdaWgaaWcbaGaeGimaaJaeGymaedabeaakiabgkHiTiabdEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOGaeyOeI0Iaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGHRaWkcqWGIbGydaqhaaWcbaGaeGOmaidabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabicdaWiabigdaXaqabaGccqGGPaqkcqGHRaWkcqWGIbGydaqhaaWcbaGaeG4mamdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabigdaXiabicdaWaqabaGccqGGPaqkcqGHRaWkcqWGIbGydaqhaaWcbaGaeGinaqdabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaemiBaWMaemOBa4MaeiikaGIaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGGDbqxcqGGSaalaaa@8D25@

where b i ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOyai2aa0baaSqaaiabdMgaPbqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@339B@ 's have similar expressions with a i ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyyae2aa0baaSqaaiabdMgaPbqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@3399@ 's given previously. Then the other steps of the REM are the same as those for the case of two offspring, except replacing a i ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyyae2aa0baaSqaaiabdMgaPbqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@3399@ 's by b i ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOyai2aa0baaSqaaiabdMgaPbqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@339B@ 's. More offspring's cases are analogous completely. It is helpful to construct and analyze a linkage map using this kind of family data.

Case for unequal prior probabilities of linkage phases

Affected by many factors (e.g., linkage disequilibrium), each phase of a triply heterozygous parent's genotype may in fact not occur with equal prior probability, but the proposed REM can also be applied to the case of unequal phase probability as a unified method. Let each phase occur with probability h i (i = 1, 2, 3, 4), where h i is any fixed number that satisfying 0 ≤ h i ≤ 1, and ∑ i = 1 4 h i = 1 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaabCaeaacqWGObaAdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabigdaXaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaeGinaqdaniabggHiLdaaaa@373C@ . In this case, two-offspring family data needs to be grouped into 10 different phenotype classes according to linkage analysis regulation (see Table 3), and we can obtain the observed data {n k , k = 1, 2,⋯, 10}. Then we augment the observed data {n k , k = 1, 2,⋯, 10} by latent variables {n kl , k = 1, 2,⋯, 10, l = 1, 2, 3, 4} with corresponding probabilities {p kl , k = 1, 2,⋯, 10, l = 1, 2, 3, 4}. The major difference from the procedure of the REM for h i = 1/4 (i = 1, 2, 3, 4) lies in the expression of conditional expectation for each n kl (k = 1, 2,⋯, 10, l = 1, 2, 3, 4). Take n11 as an example, E ( n 11 | g ( s ) , n 1 ) = n 1 h 1 ( g 00 ( s ) ) 2 p 1 ( s ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyrauKaeiikaGIaemOBa42aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGG8baFieqacqWFNbWzdaahaaWcbeqaaiabcIcaOiabdohaZjabcMcaPaaakiabcYcaSiabd6gaUnaaBaaaleaacqaIXaqmaeqaaOGaeiykaKIaeyypa0JaemOBa42aaSbaaSqaaiabigdaXaqabaqcfa4aaSaaaeaacqWGObaAdaWgaaqaaiabigdaXaqabaGaeiikaGIaem4zaC2aa0baaeaacqaIWaamcqaIWaamaeaacqGGOaakcqWGZbWCcqGGPaqkaaGaeiykaKYaaWbaaeqabaGaeGOmaidaaaqaaiabdchaWnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaaaaaaa@511A@ , where h1 is the assigned prior probability of phase I. Repeating the similar procedure given in the REM for h i = 1/4 (i = 1, 2, 3, 4), we find that the conditional expectation of the log-likelihood of the complete data still has the form of (4), and only the expressions of the components of a(s+1)are more complex than those given previously. Using the REM algorithm, we can obtain the restricted MLEs of the two-locus recombination fractions easily.

Table 3 Phenotype classification when each linkage phase occur with probability hi

Simulation methods

We conduct two simulation studies to evaluate the performance and robustness of the proposed REM. In the simulations, we simulate two-offspring family data.

Comparing the REM and the unrestricted method

Let θ0 = (θ AB , θ BC , θ AC ) denote the true value of the recombination fraction. In genetics, loci A and B are said to be closely linked when 0 ≤ θ AB ≤ 0.1, moderately linked when 0.1 ≤ θ AB ≤ 0.2, and loosely linked when 0.2 ≤ θ AB ≤ 0.5. To show the advantage of the REM algorithm, we consider six scenarios according to the different combinations of linkage states of loci AB and loci BC: CC, CM, CL, MM, ML, and LL, where C, M, and L denotes close, moderate, and loose linkage, respectively. In each scenario, θ AB and θ BC are respectively taken as 0.05, 0.15, and 0.35 for close, moderate, and loose linkage. θ AC is taken as three equally spaced values which all guarantee that (θ AB , θ BC , θ AC ) satisfies the natural restriction (2), and the smaller value and the larger one are near the boundary of the region composed by restriction (2), and the moderate one is inside the region. Since the triply homozygous parent only produces haplotype abc in triple backcross family, we can only consider the sampling from the heterozygous parent. For demonstrate purpose, we give the process of generating data for each θ0 in detail:

1. According to equal probability 1/4, We randomly assign a linkage phase of the heterozygous parent in one family.

2. Generate two haplotypes of two offspring from the heterozygous parent in the family according to the conditional probabilities given in Table 1. The haplotype pair (or the family) is easily classified into one of the four classes in Table 2.

3. Repeat step 1 and 2 for n = 300 times, then data {n k } for n simulated families can be obtained.

In each scenario of our simulations, for each θ0, we calculate θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ and θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ by the unrestricted method and the REM, respectively. Repeating the whole process for M = 1000 times, we obtain the averages of θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ and θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ over 1000 replicates by the two methods (see Table 4). As expected, the averages of θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ over 1000 replicates agree better with θ0 than the averages of θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ .

Table 4 The averages of estimates over 1000 replicates for 300 two-offspring families by unrestricted method and the REM

To better show the performance of the REM, we mainly use the following three measures of accuracy to compare θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ and θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ :

1. The number, denoted by KK, for which the unrestricted methods give unreasonable estimates based on 1000 replicates.

2. The standard derivations (SDs) of the estimate θ ^ i R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyAaKgabaGaemOuaifaaaaa@3055@ ; the ratio of SDs of two kinds of estimates being rSD = SD ( θ ^ i U ) / SD ( θ ^ i R ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeeOCaiNaee4uamLaeeiraqKaeyypa0Jaee4uamLaeeiraqKaeiikaGIafqiUdeNbaKaadaqhaaWcbaGaemyAaKgabaGaemyvaufaaOGaeiykaKIaei4la8Iaee4uamLaeeiraqKaeiikaGIafqiUdeNbaKaadaqhaaWcbaGaemyAaKgabaGaemOuaifaaOGaeiykaKcaaa@4252@ , i = AB, BC, AC.

3. The mean absolute error (MAE) of the estimate θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ , where MAE = ∑ l = 1 1000 ( | θ ^ A B l R − θ A B | + | θ ^ B C l R − θ B C | + | θ ^ A C l R − θ A C | ) / 3000 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeeyta0KaeeyqaeKaeeyrauKaeyypa0ZaaabCaeaacqGGOaakcqGG8baFiiGacuWF4oqCgaqcamaaDaaaleaacqWGbbqqcqWGcbGqcqWGSbaBaeaacqWGsbGuaaGccqGHsislcqWF4oqCdaWgaaWcbaGaemyqaeKaemOqaieabeaakiabcYha8jabgUcaRiabcYha8jqb=H7aXzaajaWaa0baaSqaaiabdkeacjabdoeadjabdYgaSbqaaiabdkfasbaakiabgkHiTiab=H7aXnaaBaaaleaacqWGcbGqcqWGdbWqaeqaaOGaeiiFaWNaey4kaSIaeiiFaWNaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qamKaemiBaWgabaGaemOuaifaaOGaeyOeI0Iae8hUde3aaSbaaSqaaiabdgeabjabdoeadbqabaGccqGG8baFcqGGPaqkcqGGVaWlcqaIZaWmcqaIWaamcqaIWaamcqaIWaamaSqaaiabdYgaSjabg2da9iabigdaXaqaaiabigdaXiabicdaWiabicdaWiabicdaWaqdcqGHris5aaaa@6D37@ ; the ratio of MAEs being rMAE = MAE( θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ )/MAE( θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ ).

The comparisons of estimations of two-locus recombination fraction by the unrestricted method and the REM are listed in Table 5. In each scenario, the unrestricted method gives lots of unreasonable results, i.e., the estimates do not satisfy the natural restriction (2), whereas the estimates obtained by the proposed REM all satisfy the restriction. The number KK of unreasonable estimates is larger when the true value θ0 is near the boundary of the restriction region (2), which corresponds to the larger or smaller true values of θ AC , and KK is somewhat smaller when θ0 is inside the region, which corresponds to the moderate values of θ AC . In the former situation the resulting θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ could be obtained in the whole parameter space but not in the restriction region (2). When θ0 is near the boundary of the restriction region (2), θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ is liable to be near the boundary of the region and hence likely to lie outside the boundary. However the proposed method can guarantee that θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ must be inside the restriction region at any time.

Table 5 Comparison of estimation of two-locus recombination fraction for 300 two-offspring families by the unrestricted method and the REM

It is clear to see that our REM outperforms the unrestricted method for estimating two-locus recombination fractions in each simulated scenario. The estimates obtained by the REM have smaller SDs than the unrestricted method, which is more obvious especially at least one of the intervals of AB and BC is loosely linked. This suggests that the accuracy of estimates by the REM is more higher than by the unrestricted method, and that the natural restriction (2) should be taken into account in estimating, otherwise it would have significant impact on the accuracy on practical inference. Compared to θ ^ U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdwfavbaaaaa@2F00@ , θ ^ R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaahaaWcbeqaaiabdkfasbaaaaa@2EFA@ is closer to the true value θ0 (rMAE > 1 for all groups in Table 5).

It also can be seen that the proposed REM is a robust algorithm. The REM can still give better results than the unrestricted method in each scenario even when KK is small (e.g., 1).

Evaluating the effect of interference to estimates

Interference refers to the phenomenon that crossovers in nearby intervals along a chromosome do not occur independently. Let I denote the value of interference. According the definition of interference in Strickberger [21], we have I = 1 − g 11 θ A B θ B C MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemysaKKaeyypa0JaeGymaeJaeyOeI0scfa4aaSaaaeaacqWGNbWzdaWgaaqaaiabigdaXiabigdaXaqabaaabaacciGae8hUde3aaSbaaeaacqWGbbqqcqWGcbGqaeqaaiab=H7aXnaaBaaabaGaemOqaiKaem4qameabeaaaaaaaa@3BAB@ . To better evaluate the effect of interference to the two kinds of estimations, we consider three scenarios: positive, null and negative interferences. In each scenario, we choose equal θ AC and different θ AB and θ BC corresponding to different interference values (see Table 6). For each scenario, we also simulate 300-family data, and the REM and the unrestricted method are applied to the simulated data, respectively. The whole process is repeated for 1000 times to compute the measures of accuracy given previously. The simulation results listed in Table 6 firstly show that the values of KK are very large when there exists positive (or negative) interference, and the values are small when there is no interference, while the REM gives reasonable estimates at any time. That is to say the estimating results by the unrestricted method are much affected by the interference, but the results by our REM is less affected. Secondly, the less fluctuations of SD( θ ^ A C R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemOuaifaaaaa@3114@ ) in scenario 1 (or 3) also validate that the REM is less affected by interference. Finally, the REM outperforms the unrestricted method in each scenario (rSD > 1, rMAE > 1), especially, when negative interference is present.

Table 6 Evaluation of the effect of interference to estimates of recombination fractions

In addition, we find that the restricted EM estimate is little changed when different starting values are taken. These above results indicate that the use of the REM can yield better performance than the current unrestricted method.

A worked example

We applied our proposed method to a real data set from published literature [22]. The data set comprised of 134 individuals from a backcross of mice. Here we consider the three ordinal marker loci D2Mit365, D2Mit272 and D2Mit456 on the linkage map of chromosome 2, and we still use A, B and C to denote the three loci. According to the genotypes given in the data set, we record a haplotype code of each individual, where the haplotype is from the heterozygous parent. Two individuals are randomly grouped into one family, and we consider they are really from that family, where the treatment will not affect linkage information, because all offspring's genotypes are independent conditional on the genotypes of all parents for the data. Then we obtain n = 67 two-offspring families, and n1 = 21, n2 = 17, n3 = 14 and n4 = 15 by the classification given in Table 2. We used the proposed REM and the unrestricted method to estimate the recombination fractions based on (n1, n2, n3, n4). The MLEs of the recombination fractions are θ ^ A B R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaemOqaieabaGaemOuaifaaaaa@3112@ = 0.3166, θ ^ B C R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemOuaifaaaaa@3116@ = 0.3738 and θ ^ A C R MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemOuaifaaaaa@3114@ = 0.3738; and θ ^ A B U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaemOqaieabaGaemyvaufaaaaa@3118@ = 0.3167, θ ^ B C U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemOqaiKaem4qameabaGaemyvaufaaaaa@311C@ = 0.3942 and θ ^ A C U MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGaf8hUdeNbaKaadaqhaaWcbaGaemyqaeKaem4qameabaGaemyvaufaaaaa@311A@ = 0.3634, respectively. Obviously, the unrestricted estimates do not satisfy the second one of the natural restriction (2), and thus estimates contradict with the true order of the three markers on the linkage map of chromosome 2 [22]. According to our simulation and practical experience, the accuracy of estimation by the REM will improve by increasing sample size or by using the unrestricted estimates as initial values.

Discussion

We developed a restricted EM algorithm to calculate numerically the MLEs of two-locus recombination fractions that initially studied by Ott [3]. The method in Ott [3] may not always provide the parameter estimates satisfying the natural restriction (2), since the approach does not take the inequality restrictions into account. Our method can deal with this problem, and the real data were handled very well with the proposed method.

The performance of the REM is also illustrated using simulated data. Our simulation shows that the unrestricted method gives some unreasonable estimate results in each scenario, and thus such estimates may not provide correct interpretation of the recombination phenomenon in practice. The major advantage of the REM is its robustness and efficiency. The REM can give better results even when the number for which the unrestricted method gives unreasonable estimate results is small (e.g., KK = 1), and our estimates are more precise than those obtained by the unrestricted method. Moreover, the REM is less affected by interference, and the estimate of parameter g in M-step having the closed-form solution largely improves the computational efficiency of the parameter.

On the other hand, noticing the important fact that more offspring in each family can really provide more information in linkage analysis, we develops a strategy for estimating the two-locus recombination fractions when each observed family has more offspring, and the proposed REM algorithm works as a unified method. In practice, the method developed by Lu et al. [10] can be first adopted to obtain the estimates of probabilities h i 's of linkage phases when considering multiple offspring, then the REM is used to obtain the restricted MLEs of recombination fractions, which may improve the estimation precision. It is helpful to construct and analyze a linkage map using this kind of family data.

Recent research in genetics has shown that statistical inference about the two-locus recombination fraction offers an effective approach for constructing and analyzing a linkage map between the genetic marker and the genetic disorders. Reasonable estimates of the recombination fractions are important in gene mapping, especially in interval mapping [23–26]. Only the reasonable estimate result may identify the actual genes responsible for some trait, and it is feasible to embed the REM into interval mapping to improve the efficiency of mapping.

It is noticed that our analysis is focused on three biallelic loci. The above constrained parameter problem may become complicated if the number of loci is more than three, or some markers may have more alleles than others, for example, in outcrossing plant species. When the number of loci is more than three, we suggest that every three adjacent loci are subject to three-point analysis. We can obtain two different estimates of the recombination fraction for the same marker interval, and a better way to combine these estimates is to take a weighted mean. More alleles for each markers mean more possible linkage phases [10], which bring some difficulty to linkage analysis, however, the idea of considering the natural restriction (2) on recombination fractions should also be emphasized. Further investigation in this area is warranted.

Appendix

The Kuhn-Tucker Theorem [19, 20]

Suppose that θ* is a solution of

Max f(θ) subject to f1(θ) ≥ 0,⋯, f m (θ) ≥ 0,

where f, f1,⋯, f m : RN→ R are C1 functions. Then the following conditions hold:

(1) ∂ ∂ θ i f ( θ ∗ ) + ∑ j = 1 m λ j ∂ θ i f j ( θ ∗ ) = 0 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqGHciITaeaacqGHciITiiGacqWF4oqCdaWgaaqaaiabdMgaPbqabaaaaOGaemOzayMaeiikaGIae8hUde3aaWbaaSqabeaacqGHxiIkaaGccqGGPaqkcqGHRaWkdaaeWbqaaiab=T7aSnaaBaaaleaacqWGQbGAaeqaaKqbaoaalaaabaGaeyOaIylabaGae8hUde3aaSbaaeaacqWGPbqAaeqaaaaaaSqaaiabdQgaQjabg2da9iabigdaXaqaaiabd2gaTbqdcqGHris5aOGaemOzay2aaSbaaSqaaiabdQgaQbqabaGccqGGOaakcqWF4oqCdaahaaWcbeqaaiabgEHiQaaakiabcMcaPiabg2da9iabicdaWaaa@5210@ , i = 1,⋯, N;

(2)λ j f j (θ*) = 0, j = 1,⋯, m;

(3) f j (θ*) ≥ 0, j = 1,⋯, m;

(4)λ j ≥ 0, j = 1,⋯, m,

where (λ1,⋯, λ m ) are Lagrangian multipliers. The four conditions are called Kuhn-Tucker conditions. Specially, if f(θ) is strictly concave and the set {θ: f1(θ) ≥ 0,⋯, f m (θ) ≥ 0} is convex, the Kuhn-Tucker conditions are also sufficient, and the solution θ* is unique.

Solving equation (5) when g ˜ ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGaf83zaCMbaGaadaahaaWcbeqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaaaaa@325F@ does not satisfy restriction (3)

Because Q(g|g(s), {n k }) is a strictly concave function and the restriction region (3) is a convex set, there must be a unique solution ̌ g ( s + 1 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWeuvgwd1KuV1wyUbqegmwBYfdmaGabbiadaciKaaaa=XWa3Iqabiab+DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaaaa@3AAA@ to equation (5) by the Kuhn-Tucker Theorem. The Lagrangian is

L(g, λ) = Q(g|g(s), {n k }) + λ1(g01 - g11) + λ2(g10 - g11) + λ3g11 + λ4(1/2 - g01 - g10),

where λ = (λ1, λ2, λ3, λ4), and λ i 's are Lagrangian multipliers. Then ̌ g ( s + 1 ) = ( ̌ g 10 ( s + 1 ) , ̌ g 01 ( s + 1 ) , ̌ g 11 ( s + 1 ) ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWeuvgwd1KuV1wyUbqegmwBYfdmaGabbiadaciKaaaa=XWa3Iqabiab+DgaNnaaCaaaleqabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeyypa0JaeiikaGcceaGamaiGqca9=3hddCRaem4zaC2aa0baaSqaaiabigdaXiabicdaWaqaaiabcIcaOiabdohaZjabgUcaRiabigdaXiabcMcaPaaakiabcYcaSiadaciNaq==9XWa3kabdEgaNnaaDaaaleaacqaIWaamcqaIXaqmaeaacqGGOaakcqWGZbWCcqGHRaWkcqaIXaqmcqGGPaqkaaGccqGGSaalcWaGacja0=FFmmWTcqWGNbWzdaqhaaWcbaGaeGymaeJaeGymaedabaGaeiikaGIaem4CamNaey4kaSIaeGymaeJaeiykaKcaaOGaeiykaKcaaa@66DF@ is a unique solution to

{ − a 1 ( s ) g 00 + a 3 ( s ) g 10 + λ 2 − λ 4 = 0 , − a 1 ( s ) g 00 + a 2 ( s ) g 01 + λ 1 − λ 4 = 0 , − a 1 ( s ) g 00 + a 4 ( s ) g 11 − λ 1 − λ 2 + λ 3 = 0 , λ 1 ( g 01 − g 11 ) = 0 , λ 2 ( g 10 − g 11 ) = 0 , λ 3 g 11 = 0 , λ 4 ( 1 / 2 − g 01 − g 10 ) = 0 , g 01 ≥ g 11 , g 10 ≥ g 11 , g 11 ≥ 0 , 1 / 2 ≥ g 01 + g 10 , λ i ≥ 0 , i = 1 , 2 , 3 , 4. MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqabeaafaqaaeadbaaaaaaabaGaeyOeI0scfa4aaSaaaeaacqWGHbqydaqhaaqaaiabigdaXaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGNbWzdaWgaaqaaiabicdaWiabicdaWaqabaaaaOGaey4kaSscfa4aaSaaaeaacqWGHbqydaqhaaqaaiabiodaZaqaaiabcIcaOiabdohaZjabcMcaPaaaaeaacqWGNbWzdaWgaaqaaiabigdaXiabicdaWaqabaaaaOGaey4kaSccciGae83UdW2aaSbaaSqaaiabikdaYaqabaGccqGHsislcqWF7oaBdaWgaaWcbaGaeGinaqdabeaakiabg2da9iabicdaWiabcYcaSaqaaiabgkHiTKqbaoaalaaabaGaemyyae2aa0baaeaacqaIXaqmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaem4zaC2aaSbaaeaacqaIWaamcqaIWaamaeqaaaaakiabgUcaRKqbaoaalaaabaGaemyyae2aa0baaeaacqaIYaGmaeaacqGGOaakcqWGZbWCcqGGPaqkaaaabaGaem4zaC2aaSbaaeaacqaIWaamcqaIXaqmaeqaaaaakiabgUcaRiab=T7aSnaaBaaaleaacqaIXaqmaeqaaOGaeyOeI0Iae83UdW2aaSbaaSqaaiabisda0aqabaGccqGH9aqpcqaIWaamcqGGSaalaeaacqGHsisljuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGymaedabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdEgaNnaaBaaabaGaeGimaaJaeGimaadabeaaaaGccqGHRaWkjuaGdaWcaaqaaiabdggaHnaaDaaabaGaeGinaqdabaGaeiikaGIaem4CamNaeiykaKcaaaqaaiabdEgaNnaaBaaabaGaeGymaeJaeGymaedabeaaaaGccqGHsislcqWF7oaBdaWgaaWcbaGaeGymaedabeaakiabgkHiTiab=T7aSnaaBaaaleaacqaIYaGmaeqaaOGaey4kaSIae83UdW2aaSbaaSqaaiabiodaZaqabaGccqGH9aqpcqaIWaamcqGGSaalaeaacqWF7oaBdaWgaaWcbaGaeGymaedabeaakiabcIcaOiabdEgaNnaaBaaaleaacqaIWaamcqaIXaqmaeqaaOGaeyOeI0Iaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGH9aqpcqaIWaamcqGGSaalaeaacqWF7oaBdaWgaaWcbaGaeGOmaidabeaakiabcIcaOiabdEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOGaeyOeI0Iaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGPaqkcqGH9aqpcqaIWaamcqGGSaalaeaacqWF7oaBdaWgaaWcbaGaeG4mamdabeaakiabdEgaNnaaBaaaleaacqaIXaqmcqaIXaqmaeqaaOGaeyypa0JaeGimaaJaeiilaWcabaGae83UdW2aaSbaaSqaaiabisda0aqabaGccqGGOaakcqaIXaqmcqGGVaWlcqaIYaGmcqGHsislcqWGNbWzdaWgaaWcbaGaeGimaaJaeGymaedabeaakiabgkHiTiabdEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOGaeiykaKIaeyypa0JaeGimaaJaeiilaWcabaGaem4zaC2aaSbaaSqaaiabicdaWiabigdaXaqabaGccqGHLjYScqWGNbWzdaWgaaWcbaGaeGymaeJaeGymaedabeaakiabcYcaSaqaaiabdEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOGaeyyzImRaem4zaC2aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGGSaalaeaacqWGNbWzdaWgaaWcbaGaeGymaeJaeGymaedabeaakiabgwMiZkabicdaWiabcYcaSaqaaiabigdaXiabc+caViabikdaYiabgwMiZkabdEgaNnaaBaaaleaacqaIWaamcqaIXaqmaeqaaOGaey4kaSIaem4zaC2aaSbaaSqaaiabigdaXiabicdaWaqabaGccqGGSaalaeaacqWF7oaBdaWgaaWcbaGaemyAaKgabeaakiabgwMiZkabicdaWiabcYcaSiabdMgaPjabg2da9iabigdaXiabcYcaSiabikdaYiabcYcaSiabiodaZiabcYcaSiabisda0iabc6caUaaaaiaawUhaaaaa@005F@

To solve the above equations, we need to consider all possible cases for λ i = 0 or λ i > 0, i = 1, 2, 3, 4. There are totally seven possible solutions for the above equations which were just given in the previous REM algorithm.

References

  1. Elston RC, Stewart J: A general model for the analysis of pedigree data. Hum Hered. 1971, 21: 523-542.

    Article  CAS  PubMed  Google Scholar 

  2. Risch N: Linkage strategies for genetically complex traits. Am J Hum Genet. 1990, 46: 222-253.

    PubMed Central  CAS  PubMed  Google Scholar 

  3. Ott J: Phase-Unkown Triple Backcross with Two Offspring. Analysis of Human Genetic Linkage. 1999, The Johns Hopkins University Press: Baltimore, 122-124. 3

    Google Scholar 

  4. Thompson EA: Statistical Inference from Genetic Data on Pedigree. 2000, Institute of Mathematical Statistics Beachwood: Ohio

    Google Scholar 

  5. Haldane JBS: The recombination of linkage values and the calculation of distances between the loci of linked factors. J Genet. 1919, 8: 299-309.

    Article  Google Scholar 

  6. Morgan TH: The Theory of Genes. 1928, Yale University Press: New Haven

    Google Scholar 

  7. Felsenstein J: A mathematically tractable family of genetic mapping functions with different amounts of interference. Genetics. 1979, 91: 769-775.

    PubMed Central  CAS  PubMed  Google Scholar 

  8. Thompson EA: Information gain in joint linkage analysis. IMA J Math Appl Med Biol. 1984, 1: 31-49. 10.1093/imammb/1.1.31.

    Article  CAS  PubMed  Google Scholar 

  9. Wu RL, Ma CX, Painter I, Zeng ZB: Simultaneous maximum likelihood estimation of linkage and linkage phases in outcrossing populations. Theor Pop Biol. 2002, 61: 349-363. 10.1006/tpbi.2002.1577.

    Article  Google Scholar 

  10. Lu Q, Cui YH, Wu RL: A multilocus likelihood approach to joint modelling of linkage, parnet diplotype and gene order in a full-sib family. BMC Genet. 2004, 5: 20-10.1186/1471-2156-5-20.

    Article  PubMed Central  PubMed  Google Scholar 

  11. Wu RL, Ma CX, Casella G: Statistical Genetics of Quantitative Traits: Linkage, Maps, and QTL. 2007, Springer: New York

    Google Scholar 

  12. Lathrop GM, Lalouel JM, Julier C, Ott J: Strategies for multilocus linkage analysis in humans. Proc Natl Acad Sci USA. 1984, 81: 3443-3446. 10.1073/pnas.81.11.3443.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Lathrop GM: Multilocus linkage analysis in humans: Detection of linkage and estimation of recombination. Am J Hum Genet. 1985, 37: 482-498.

    PubMed Central  CAS  PubMed  Google Scholar 

  14. Dykstra RL: An algorithm for restricted least squares regression. J Am Statist Assoc. 1983, 78: 837-842. 10.2307/2288193.

    Article  Google Scholar 

  15. Robertson T, Wright FT, Dykstra R: Order Restricted Statistical Inference. 1988, Wiley: New York

    Google Scholar 

  16. Liu C: Estimation of discrete distribution with a class of simplex constraints. J Am Stat Assoc. 2000, 95: 109-120. 10.2307/2669531.

    Article  Google Scholar 

  17. Shi NZ, Zheng SR, Guo JH: The restricted EM algorithm under inequality restrictions on the parameters. J Multivariate Anal. 2005, 92: 53-76. 10.1016/S0047-259X(03)00134-9.

    Article  Google Scholar 

  18. Dempster AP, Laird NM, Rubin DB: Maximum likelihood from incomplete data via the EM algorithm (with discussion). J Roy Stat Soc B. 1977, 39: 1-38.

    Google Scholar 

  19. Mokhtar SB, Shetty CM: Nonlinear Programming: Theory and Algorithms. 1979, John Wiley and Sons: New York

    Google Scholar 

  20. Anthony LP, Francis ES, Uhl JJ: The Mathematics of Nonlinear Programming. 1992, Springer-Verlag: New York

    Google Scholar 

  21. Strickberger MW: Genetics. 1985, MacMillan: New York, third

    Google Scholar 

  22. Clemens KE, Churchill G, Bhatt N, Richardson K, Noonan FP: Genetic control of susceptibility to UV-induced immunosuppression by interacting quantitative trait loci. Genes and Immunity. 2000, 1: 251-259. 10.1038/sj.gene.6363667.

    Article  CAS  PubMed  Google Scholar 

  23. Lander ES, Botstein D: Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics. 1989, 121: 185-199.

    PubMed Central  CAS  PubMed  Google Scholar 

  24. Jansen RC, Stam P: High resolution of quantitative trait into multiple loci via interval mapping. Genetics. 1994, 136: 1447-1455.

    PubMed Central  CAS  PubMed  Google Scholar 

  25. Kao CH, Zeng ZB, Teasdale RD: Multiple interval mapping for quantitative trait loci. Genetics. 1999, 152: 1203-1216.

    PubMed Central  CAS  PubMed  Google Scholar 

  26. Chen Z: The full EM algorithm for the MLEs of QTL effects and positions and their estimated variance in multiple-interval mapping. Biometrics. 2005, 61: 474-480. 10.1111/j.1541-0420.2005.00327.x.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Dr. Wen-Sheng Zhu for helpful discussions and comments on a draft of the paper. This research was supported by the National Natural Science Foundation of China (Grant Numbers 10431010 and 10701022), National 973 Key Project of China (2007CB311002), NCET-04-0310, the Jilin Distinguished Young Scholars Program (Grant Number 20030113) and the Program Innovative Research Team (PCSIRT) in University (#IRT0519).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianhua Guo.

Additional information

Authors' contributions

YZ derived the genetic and statistical model and wrote computer programs. NZS and WKF provided insightful comments to the presentation. JG conceived of ideas and algorithm. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Zhou, Y., Shi, NZ., Fung, WK. et al. Maximum likelihood estimates of two-locus recombination fractions under some natural inequality restrictions. BMC Genet 9, 1 (2008). https://doi.org/10.1186/1471-2156-9-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2156-9-1

Keywords