|
|
||||||||
ANIMAL GENETICS |
Genetic Diversity Department, National Institute of Agrobiological Sciences, Tsukuba 305-8602, Japan
| Abstract |
|---|
|
|
|---|
Key Words: Genetic Evaluation Mixed Linear Model Restricted Best Linear Unbiased Prediction
| Introduction |
|---|
|
|
|---|
The original method for calculating R-BLUP, imposing the same restrictions on all animals, has been improved (Quaas and Henderson, 1976
; Itoh and Iwaisaki, 1990
, Satoh, 1998
). These methods have also been used in several studies based on field data or computer simulation (e.g., Satoh et al., 1998
; Díaz et al., 1999
; Ieiri et al., 2003). However, the breeding values of all animals need not be subjected to the same restrictions. For example, if selection criteria in each subline, such as sire and dam lines, from the same population are different, then different constraints should be imposed. Another example is that restrictions imposed on breeding values of dead animals are meaningless because the constraints should generally apply only to breeding values of animals available for selection. However, if restrictions are imposed on breeding values for only some animals in a population, calculations become more complex, requiring a matrix whose elements are coefficients of additive relationships between all animals and animals with restricted breeding values, and the second power of the matrix for writing R-BLUP equations.
The objectives of the present paper were to derive a procedure for computing R-BLUP of breeding values and to discuss its application when constraints were imposed on the additive genetic values of only some animals in a population.
| Materials and Methods |
|---|
|
|
|---|
![]() |
where yi is a vector of observations for the ith trait; bi is a vector of unknown fixed effects; Xi is a known incidence matrix relating elements of bi to yi; ui is a vector of unknown random additive genetic effects for n animals; Zi is a known incidence matrix relating elements of ui to yi; and ei is a vector of random errors. Let qk be the number of traits in the kth animal; k = 1, 2, ..., n; and 0
qk
q. The model for all traits is written as:
![]() |
where records are ordered by animals within traits. Assume that u and e are multivariates normally distributed with E(u) = E(e) = 0, var(u) = G, var(e) = R, and cov(u, e') = 0; G = G0
A, where G0 is a q x q additive genetic variance-covariance matrix for the q traits; A is an n x n additive relationship matrix for the n animals; and R is an n. x n. (n. =
kqk) error variance-covariance matrix for the q traits for the n animals.
Let the set of restrictions on u be C'u. If the same constraints are imposed on the additive genetic values of all animals, C = C0
In, where C0 is a q x r matrix with full columns rank. The number of columns of C0, r, depends on the number and type of constraints imposed: no change and/or proportional change. This will be illustrated subsequently. Kempthorne and Nordskog (1959)
defined C0 for no-change constraints. For example, if the restriction is no change for the second trait, C0 might be:
![]() |
Here r (= z) is the number of traits constrained. For the case of proportional constraints (2
p
q traits), define:
![]() | [1] |
where k'1 = [c1 c2 ... cp 1] and ci is a predetermined proportional change for Traits 1, 2, ..., p (Mallard, 1972
; Itoh and Yamada, 1987
). Note that r = p 1. Furthermore, if constraints include both no change and proportional change, the form of C0 = [Cz | Cp ], and so r = z + p 1. The R-BLUP of u, û, is obtained by solving the following equations (Quaas and Henderson, 1976
):
![]() | [2] |
where
is a vector of some solution to b and
is a vector of Lagrange multipliers. Equation [2]
involves G = G0
A; in contrast to its inverse, it is very time consuming to compute with large numbers of animals. Quaas and Henderson (1976)
represented Eq. [2]
as [3]
:
![]() | [3] |
where P = G0C0
In. Equation [3]
is simpler because it does not involve A.
Furthermore, Eq. [3]
is represented in Eq. [4]
as:
![]() | [4] |
where
![]() | [5] |
where subscripts Z, R, and N correspond to z characters with no change, p characters with proportional constraints, and q z p characters without constraints, respectively; k' = [k1'/cp 1]; and K0 is a q x (q r) matrix. Thus ûZ = 0 and ûR = k
ûP (Satoh, 1998
). Equations [3]
and [4]
are represented more simply when an animal model without repeated records is used (see Quaas and Henderson, 1976
; Satoh, 1998
).
Constraints for Some Animals in a Population
If the constraints are imposed on the additive genetic values of some animals in a population, then
![]() | [6] |
where Jm is an identity matrix with columns pertaining to n m animals without constraints deleted. We can apply different restrictions to different animals (e.g., different constraints for different groups from the same base population):
![]() |
where Cj (j = 1, 2, ..., s) is a matrix for constraints of jth group and mj is the number of animals with constraints in jth group. Thus, if the constraints are imposed on breeding values for only some animals in the population, Eq. [2]
cannot be represented as in Eq. [3]
or [4]
. As an example, for an animal model with one record in each trait, no missing observations, and the same restriction of breeding values for some of the animals, the cell in the third row and third column of the coefficient matrix in Eq. [2]
is as follows:
![]() | [7] |
where R0 is a q x q error variance-covariance matrix. Because Eq. [2]
includes computation of elements of AJm and the product of Jm'A and AJm, it is quite difficult to write R-BLUP equations. Furthermore, with repeated records, partially missing observations, and two or more constraints, Eq. [7]
is more complex. If Eq. [2]
is denoted as W
= v, an equivalent set of equations for
and û is:
![]() |
where
![]() |
where t is the number of columns of X. Then Eq. [2]
is represented as in Eq. [8]
![]() | [8] |
Equation [8]
is simpler than Eq. [2]
because it has fewer nonzero elements and does not include the second power of a submatrix of A in the cell in the third row and third column of the coefficient matrix. However, after solving Eq. [8]
, we need AJm to obtain û from û + GC
and
.
Now let û + GC
= w1 and
= w2. Then,
![]() | [9] |
Equation [9]
does not include A. It can be rewritten as
![]() | [10] |
where ûi, w1i, and w2i are subvectors for the ith trait of û, w1, and w2, respectively, and pij is the ijth element of G0C0. The coefficient matrix in Eq. [10]
is alwaysA1. Therefore, Eq. [10]
can be solved easily and separately for each trait after solving Eq. [8]
. Furthermore, from Eq. [6]
and the third row in Eq. [8]
, C'û = (C0'
Jm')û = 0. Then, to obtain û relating only to animals with restrictions (û0), û0Z = 0 (constraints with no change in some traits) and û0R = k
û0p (constraints with proportional changes), where subscripts Z, R, and p are as in Eq. [5]
.
Computational Remarks
Linear Dependencies Among Fixed Equations.
Quaas and Henderson (1976)
showed three types of linear dependencies among the fixed-effect equations of an R-BLUP procedure in which the same restrictions were imposed on the predicted breeding values of all animals. When the constraints were imposed on breeding values for only some animals in a population, the rules for eliminating the linear dependencies among the fixed effect equations were similar to those of Quaas and Henderson (1976)
and are summarized as follows: If the classes in the fixed effects include only animals with constraints, the rules are similar to those of Quaas and Henderson (1976)
; otherwise, the rules are similar to those of multiple-trait BLUP (Henderson and Quaas, 1976
).
If some records in the restricted traits are missing, some columns of X may be identical to other columns of X. In extreme cases, this occurs not only in columns of X between traits, but also in those within traits. Hence, we need to investigate which columns of X are identical to each other before constructing R-BLUP equations.
Rules for Ignoring Records of Observations.
Quaas and Henderson (1976)
indicated that if qk
r, all the records of any animals missing qk r observations are ignored in R-BLUP; exceptions to this can occur if some of the restricted traits are genetically uncorrelated with some of the unrestricted traits. However, they did not consider constraints with proportional change. The rules for ignoring records of observations are summarized below.
First assume that the observations are ordered traits within animals. Let Zk be a known incidence submatrix on the diagonal of Z, relating elements of breeding values to observations for the kth animal. If ZkG0C0 = 0, this means no restriction for the kth animal (Case 2 in Appendix). If the elements of the jth row of G0C0 are all zero, constraints are not imposed on the trait corresponding to the jth row of G0C0. The trait should be treated as an unrestricted trait. Restricted traits may be divided into three types. First, all records of the kth animal are ignored if ZkG0C0 has full row rank (Case 1 in the Appendix). If a linear combination of the columns of ZkG0C0 can be 0, linearly dependent columns of ZkG0C0 are deleted (Case 5 in the Appendix). Next, if a vector exists with only one nonzero element corresponding to the jth trait from a linear combination of columns of ZkG0C0, the records of the jth trait of the kth animal are ignored (Case 3 in the Appendix). Otherwise, the records of the kth animal are always used for R-BLUP (Case 4 in the Appendix).
For example, G0 is assumed to be
![]() |
If proportional changes in Traits 1, 2, and 4 are wanted on the basis of proportional constraints in the ratio 2:1:1, then C0 is expressed as:
![]() |
If there are only observations on the first trait, and then no restrictions for the first trait because ZkG0C0 = 0 (Case 2 in the Appendix). If there are only observations of any one trait except the first trait or the observations of the first and another trait are missing, the records are ignored because ZkG0C0 has full row rank (Case 1 in the Appendix). If the observations of only one trait other than the first trait are missing, records other than the first trait are ignored because a vector exists with only one nonzero element corresponding to the second, third, or fourth trait from a linear combination of columns of ZkG0C0 (Case 3 in the Appendix). If the observations of any two traits except the first trait are missing, the records other than the first trait are ignored (Cases 3 and 5 in the Appendix). If no observations or only those on the first trait are missing, the records are used (Case 4 in the Appendix).
| Results |
|---|
|
|
|---|
For computer simulation, the base population (G0) of 150 males and 1,500 females was mated at random to produce 3,000 males and 3,000 females. Six separate generations (G0 to G5) and two traits were simulated. One hundred fifty males and 1,500 females in G1 through G4 were selected by using a selection index with zero-gain constraints. For simplicity, the animals were assumed to have a record in each trait, and two fixed effects (sex and generation) and two random effects (additive direct genotype and error) were assumed. Breeding values of a trait of only the animals in the last generation were restricted to be zero. The total number of animals was 31,650 and the number of restricted animals was 6,000. A preconditioned conjugate gradient based iterative method using iteration on data was used to solve each R-BLUP equation. The convergence criterion was based on relative adjusted right-hand sides: ||b Ax||/||b|| < 109, where Ax and b are left- and right-hand sides of R-BLUP equations, respectively, and ||b|| represents the norm of b. All computations were performed on a personal workstation (3.06 GHz, 1.5 gigabytes RAM).
Table 1
shows the number of nonzero elements in the upper triangular matrix on the left-hand side of the R-BLUP equations and central processing unit time for each Method. The number of nonzero elements in Method II was reduced to 8.6% compared with Method I. Central processing unit time for Method II was about 41 times faster than that for Method I.
|
| Discussion |
|---|
|
|
|---|
The method R-BLUP can be used for selection with two practical constraints: 1) zero change in one or a few traits and 2) proportional change for several or all traits. Itoh and Iwaisaki (1990)
found that a canonical transformation technique could be applied to R-BLUP to decrease the number of equations for an animal model. However, the method has a limitation in that models must be identical for all traits and there cannot be any partially missing observations. Satoh (1998)
showed a simple method for computing R-BLUP of breeding values; this procedure works, in particular, without repeated records. These computations for formulating BLUP equations are relatively simpler than those of Quaas and Henderson (1976)
; however, restrictions of breeding values should essentially be imposed only on candidates for evaluation.
The calculation of R-BLUP imposing the same restrictions on all animals is easier (Method III). However, if one wants to apply it only to candidates for evaluation, it is difficult to write R-BLUP equations because the need to include matrix A. In this case, however, Eq. [8]
and [9]
were available to compute R-BLUP. Method II gave much better performance than Method I in both memory requirement and central processing unit time, even if computing time was affected by the algorithm and programming to some extent. The technique in the present study can be easily applied to animal models with more than two random effects.
| Implications |
|---|
|
|
|---|
| Appendix |
|---|
|
|
|---|
from Eq. [3]
and û:
![]() |
where S = R1 R1ZP(P'Z'R1ZP)P'Z'R1 (Quaas and Henderson, 1976
). Now consider that the observations are ordered traits within animals; then S is a block diagonal matrix,
![]() |
where Sk is a q x q matrix peculiar to the kth animal and
![]() | [11] |
where Rk1 is the inverse of the error variance-covariance matrix for the q traits for the kth animal, and Zk is a known incidence matrix relating elements of breeding values to observations for the kth animal. We use the following four lemmas in order to show the property of the columns of Sk.
Lemma 1.
When vectors x0 and x1 are linearly independent (LIN), for A nonsingular, vectors y0 = Ax0 and y1 = Ax1 are LIN. The proof of this lemma is that if y0 and y1 are linearly dependent, then for some constant c
0, Ax0 = cAx1. Because A1 exists, x0 = cx1, but this contradicts the assumption that x0 and x1 are LIN. Therefore y0 and y1 are LIN.
Lemma 2.
When X has full low rank, then AX(X'AX)X'A = A. The proof of this lemma is that when X has full low rank, XX' is non-singular (Searle, 1971
). Pre- and post-multiplying X'AX(X'AX)X'AX = X'AX by (XX')1X and X'(XX')1, respectively, gives AX(X'AX)X'A = A.
Lemma 3.
For A nonsingular and positive definite (p.d.), A1 is p.d. The proof of this lemma is that any p.d. matrix can be written as A = UDU', where U is the matrix of eigenvectors and D is a diagonal matrix of eigenvalues (all of which are positive). Then the inverse is U(D1)U' because UU' = I, and still diagonals of (D1) are positive, so the inverse is p.d.
Lemma 4.
For P
0 and X
0, the rows and columns of P = I X(X'X)X' are orthogonal to and LIN of columns of X. The proof of this lemma is that X(X'X)X' is symmetric (Searle, 1971
) and P = I X(X'X)X' is symmetric. Because (X'X)X' is a generalized inverse of X, PX = X X(X'X)X'X = 0, the rows and columns of P are orthogonal to the columns of X. Assume that the columns of P depend on those of X. Then P = XA for some A
0, and P = PP = PXA = 0. Therefore the assumption is false, and so the columns of P are LIN of those of X. Because P is symmetric, the rows of P are LIN of those of X.
Because the columns of C0 are LIN and G0 is nonsingular, the columns of G0C0 are LIN by Lemma 1. We can classify ZkG0C0 into the following five cases: ZkG0C0 has full row rank (Case 1); ZkG0C0 = 0 (Case 2); a linear combination of the column vectors of ZkG0C0 has only one nonzero element (Case 3); a linear combination of the column vectors of ZkG0C0 has more than one nonzero element (Case 4); a linear combination of the column vectors of ZkG0C0 is 0 (Case 5).
Case 1
For Case 1 if ZkG0C0 has full row rank, using Lemma 2 leads to
![]() |
Therefore, Sk = 0.
Cases 2 to 5
Cases 2 through 5 apply to situations when ZkG0C0 does not have full row rank.
Case 2.
If ZkG0C0 = 0, then Rk1 is p.d. from Lemma 3 because Rk is p.d. Hence, C0'G0Zk'Rk1ZkG0C0 = 0 only when ZkG0C0 = 0. This means that the submatrices of the kth animal of the third row and the third column in Eq. [3]
are all zero. Therefore Sk = Rk1.
Case 3.
If a linear combination of the column vectors of ZkG0C0 has only one nonzero element then Rk1 can be written as Lk'Lk for a nonsingular Lk because Rk1 is p.d. From [11], Sk = Lk'[Ik LkZkG0C0(C0'G0Zk'Lk'LkZkG0C0)C0'G0Zk'Lk'] Lk.
Let Wk = Ik LkZkG0C0(C0'G0Zk' Lk'LkZkG0C0)C0'G0Zk'Lk'. Because the rows of Wk are orthogonal to the columns of LkZkG0C0 by Lemma 4, for any vector t
0, the rows of Wk are orthogonal to LkZkG0C0t. Hence, WkLkZkG0C0t = 0; then the rows of WkLk are also orthogonal to ZkG0C0t. This means that the jth column vector of WkLk is 0 if only the jth element in a linear combination of the columns of ZkG0C0 is zero. Therefore, the jth column vector of Sk = Lk'WkLk is 0. Then the records of the jth trait of the kth animal are ignored.
Case 4.
If a linear combination of the column vectors of ZkG0C0 has more than one nonzero element, and if LkZkG0C0t has more than one nonzero element, then each row of Wk has more than one nonzero element because columns of Wk are orthogonal to LkZkG0C0t. Therefore, the column vectors of Sk relating traits with records are not 0. Then the records of the kth animal are always used for R-BLUP.
Case 5.
If a linear combination of the column vectors of ZkG0C0 is 0, then some linearly dependent columns of ZkG0C0 exist, because for some vector t
0, ZkG0C0t = 0. Because the rows of WkLk are orthogonal to the columns of ZkG0C0, they are also orthogonal to the linear combinations of columns that eliminate linearly dependent columns of ZkG0C0. Then if the linear combinations of columns that eliminate linearly dependent columns of ZkG0C0 have only one nonzero element corresponding to the jth trait, the records of the trait are ignored (refer to Case 3); otherwise, the records are always used (refer to Case 4).
1 Correspondence: Kannondai 2-1-2, Tsukuba-shi 305-8602 (phone: 81-29-838-7041; fax: 81-29-838-7408; e-mail: hereford{at}affrc.go.jp).
Received for publication September 29, 2003. Accepted for publication March 1, 2004.
| Literature Cited |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |