|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ANIMAL GENETICS |
Animal Breeding and Genomics Centre, Wageningen University, PO Box 338, 6700 AH Wageningen, the Netherlands
| Abstract |
|---|
|
|
|---|
Key Words: cage performance genetic evaluation group laying hen
| INTRODUCTION |
|---|
|
|
|---|
Moreover, in current breeding schemes, pure lines in nucleus breeding herds are usually housed individually and selected on the basis of individually recorded traits, whereas at the commercial level their offspring are kept in groups. Individual housing at the nucleus breeding herds is not only more expensive, but might also give rise to genotype x environment interaction (e.g., Besbes and Ducroq, 2003
) that can reduce the response to selection at the commercial level. Therefore, it would be very attractive to extend genetic evaluations of farm animals by incorporating information collected at the group level. This was recognized by Olson et al. (2006)
who used simulations to calculate breeding values and accuracies for pooled and individual observations.
The objective of this paper was to develop a method for estimating genetic parameters and predicting breeding values from information collected on groups of animals (pooled information) and compare its relative efficiency with the use of information collected on individual animals. For our analysis, data on individual and pooled BW of laying hens were available. The methodology was also applied to the trait egg production, for which no individual measurements were available.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The experiment was carried out by licensed and authorized personnel under approval of Institut de Sélection Animale B.V.
The animal population used in this study consisted of about 2,500 laying hens from 12 distinct genetic lines housed together in a single stable of an experimental farm for a laying period of 51 wk (wk 19 to 69 in terms of age of the hens), from June 2004 to June 2005. These lines were either of Rhode Island Red type (RIR; laying brown eggs) or of White Leghorn type (WL; laying white eggs).
Both during the rearing and the experimental period, the hens were housed in a total of 641 cages of different size (from 2 to 5 birds); only cages with 4 hens were used in this study (
560 cages). About one-half of the cages were composed of full sibs, and the other one-half had a random composition. The number of hens and cages per line and origin is presented in Table 1
.
|
The data consisted of individual records of the BW of the hens at 19 (BW19, onset of the egg production), 27 (BW27), 43 (BW43), and 51 (BW51) wk of age. These were then pooled (i.e., summed by cage) to derive the group observations, thus allowing for comparison between genetic evaluations from individual and pooled observations. Because of software limitations, only cages with 4 hens were used. As a consequence of mortality, the number of cages with 4 hens decreased over time. In addition to BW, egg production of the entire laying period (wk 19 to 69) of 550 cages was used. For this trait there were no individual observations, but only pooled data were available.
The 4 BW traits were all normally distributed. To account for non-normality of egg production, the power transformation method of Box and Cox (1964)
was applied to produce standardized variates, z(t), according to the following formula:

where y is the original observation, Gy the geometric mean of the data, and t is the parameter by which data are normalized. The value of t = 5, empirically derived, was used to approximate a normal distribution.
From the pedigree files, 4 generations of ancestors were extracted for the genetic analysis; sires had 1 to 30 daughters, whereas dams had 1 to 5 daughters. This is consistent with a hierarchical structure in which a rooster is mated to more females, whereas a female is only mated to 1 male.
Theoretical Background
Following the work of Olson et al. (2006)
, we set up the mixed model equations (MME) for the prediction of breeding values and the estimation of variance components using group observations.
In the case of individual records, the model, in matrix notation, is

with y being the vector of observations, b and a the vectors of fixed and random effects, with their respective incidence matrices X and Z, and e a vector of random residuals. Var(y) = Var(a) + Var(e), which, after substitution, becomes Var(y) = ZGZ' + R, with G = A
2a and R = I
2e.
Solutions for fixed effects and BLUP of a (i.e., â) can be then obtained by solving the usual MME (Henderson, 1975
, 1984
):

In a numerical example, we have 4 founding animals and measured individual records on 6 of their progeny, 2 from group A, 2 from group B, and 2 from group C;
, the ratio between residual and genetic variance, is 2 (h2 of 0.33). The pedigree and data for the example are in Table 2
.
|



In this case, Var(a) = G = A
2a, where A is the additive relationship matrix, and Var(e) = R = I
2a, where I is an identity matrix (i.e., assuming that there are no residual correlations between animals of the same group). Taking the diagonal elements of the inverse of the MME (Henderson, 1975
), the accuracies of predictions can be calculated with the following formula:

where di is the ith diagonal element of the MME–1 and
is the ratio of variances

|
![]()
where the * symbol indicates the modified elements of the equation. The MME therefore are



Groups are now the experimental units (the observations), and, therefore, some of the elements of the MME must consequently be modified. The matrices X* and Z* now reflect the composition of each group: in the X matrix, the number of times each fixed effect is present in each group is given, whereas in the matrix Z in each row there are as many 1s as animals in the group. The vector y* consists of the sums of the individual observations and e* is the sum of the residuals. Vector a remains unchanged, and in this example, with only a general mean, the vector b also remains unchanged. The variance of the genetic effect is also unmodified and equal to A
2a; on the other hand, because e*
e, the variance of the residuals is now Var(e*) = R* = D
2e: assuming that there are no residual correlations between animals of the same group, D is not an identity matrix but a diagonal matrix with elements nj representing the number of animals that contributed to the jth pooled observation. This affects also the value of
, which is now n times that for individual observations. In our example, where we have 2 observations in each group,
* = 2 x
= 4. If groups are not of equal size, the relation
* = n x
does not hold and R*–1 must be used in the MME.
With this approach, BLUP of the fixed and random effects can be obtained in the very same way from the pooled observations, setting up the MME as previously outlined and solving them by direct inversion or iteratively. The solutions for the animals of the numerical example are reported in Table 3
. For some animals EBV from pooled data differ considerably from EBV based on individual observations. The example illustrates that for instance full sib individuals 5 and 6, who are in the same group (A), have the same EBV when estimates are based on pooled observations. Individual 7, which is also a full sib of individuals 5 and 6, but is in another group (B), has an EBV different from 5 and 6. Following the procedure outlined by Mrode (2005)
, the solutions can be partitioned into parent averages, yield deviations, and progeny contributions; this helps in explaining the differences between the 2 models. The main reason behind the observed differences is that the pooling of records leads to loss of information and reduced variation: in the case of pooled data, deviations from the mean are, in general, smaller than those computed from individual records. Animal 6, for instance, has an individual record of 15, which is a 20% deviation from the overall mean (12.25); its pooled observation is 25, which deviates only 2% from the overall mean of the pooled data (24.5). Its weighted yield deviation is, therefore, 0.575 in the individual model and 0.0616 in the pooled model, and this explains the different EBV in the 2 situations (0.675 vs. –0.079). The same applies to animal 3, whose EBV depends entirely on the contributions of its progeny; its 2 offspring have yield deviations of –26% and –10%, and –14% and +10% in the individual and pooled models, respectively, thus explaining the difference observed in its EBV in the 2 situations (–0.7083 vs. –0.0891). The reduced variation of pooled observations not only affects the yield deviations, but also the parent averages and the progeny contributions.
The loss of information and the reduced variation in the case of pooled observations as compared with individual observations lead to decreased accuracy of the estimates, especially for individuals with phenotypic observations only (and no offspring).
Data Analysis
Variance components and breeding values for BW and egg production were estimated from the MME with a REML procedure. The Asreml software package was used for the analysis (Gilmour et al., 2002
). The function and() in Asreml was used to fit multiple genetic effects per observation. For the analysis of individual and pooled data, respectively, the models described in Eq. [1] and [2] of the previous section were used. Because all hens were kept in a single stable and on the same tier of the battery system, the only fixed effect considered in this study was the effect of line.
| RESULTS |
|---|
|
|
|---|
Basic statistics of the data are summarized in Table 4
. The number of hens in the analysis gradually decreased over time because of mortality and the restriction that only cages with 4 hens were included. The average BW at 19 wk was 1,405 g and increased gradually to 1,874 g at 51 wk. The coefficient of variation for individual observations of BW was approximately 13%. As pooled observations are based on 4 individuals, standard deviations of pooled observations are expected to be twice that of the individual observations (i.e., for independent observations). Table 4
shows that the standard deviations for pooled observations were 3.2 to 3.3 times the standard deviation of individual observations, suggesting positive covariances among observations on cage mates.
|
|
Breeding Values
Table 6
shows the mean accuracy of the estimated breeding values for all hens with an observation, and for sires and dams with at least 10 and 4 progeny, respectively. The accuracy of EBV from pooled observations is consistently less than that of EBV from individual records. The decrease in accuracy for pooled observations is greater for hens with observations than for sires with at least 10 offspring or for dams with at least 4 offspring.
|

Table 6
also shows correlations between the estimated breeding values based on individual and pooled observations for 3 groups of individuals: all hens with an observation, sires with more than 10 daughters, and dams with more than 4 daughters. The correlations between EBV from individual and pooled observations are between 0.70 and 0.75 when all hens with an observation are considered. For sires with at least 10 daughters, correlations are between 0.81 and 0.89, depending on the trait. For dams with 4 daughters, correlations between EBV were between 0.85 and 0.88.
| DISCUSSION |
|---|
|
|
|---|
The results show that for the same number of animals, EBV and heritability estimates based on pooled data are less accurate than those based on individual observations. However, in practice comparison of estimates based on an equal number of phenotypes rather than based on an equal number of individuals might be more appropriate. The use of pooled data offers some advantages: they are often easier and cheaper to collect, they may be in some cases the only data available, and they may sometimes better reflect the commercial environment where animals are kept, thus avoiding bias due to possible genotype x environment interaction.
At the commercial level only pooled data are available. These data may be of great interest for breeding companies to select pure line individuals. Because animals at the commercial level are almost always cross-bred individuals, this aspect should be accounted for (e.g., Wei and van der Werf, 1995
).
Presence of GxE interaction for animals kept individually or in groups will reduce the response to selection at the commercial level. Merks (1989)
found a genetic correlation of 0.64 for backfat thickness measured on individually housed pigs at central test stations and on group-housed pigs on commercial fattening farms. In broilers, Zerehdaran et al. (2005)
estimated a genetic correlation of 0.80 between BW in group housing and individual cages. These estimates clearly suggest the presence of genotype x environment interaction for individually and group-housed animals.
Therefore, selecting individuals based on group performance may be convenient in many situations. It will make genetic evaluation of farm animals more flexible and more accurate by including information collected under commercial conditions where animals are kept in groups.
Accuracies of Selection
The accuracy of selection based on pooled observations, for all hens with records and sires and dams with at least 10 or 4 offspring is, respectively, 65 to 72% and 72 to 88% of the accuracies that can be obtained when using individual observations. However, this reduced accuracy should be interpreted in the context of direct vs. indirect selection. The breeding goal of a breeding company is the trait under commercial conditions (i.e., group housing). If testing is under individual housing, the genetic correlation between group and individual housing is relevant. For the same selection intensity, the ratio of the selection response for direct and indirect selection is a function of the accuracies for both situations and the genetic correlation between the traits (Falconer, 1989
). Similarly, the ratio between accuracies based on pooled and individual data provides a threshold for the genetic correlation between individual and group housing below which pooled data would result in a greater selection response (for the same selection intensity). Therefore, in the present study a genetic correlation below 0.72 for sires would compensate for the loss of accuracy due to pooled observations.
Fixed Effects
The relationship matrix enables estimation of the breeding value of each animal based on pooled observations. However, some differences in the definition of systematic environmental effects may arise when using either individual or pooled records. In the numerical example presented in this paper, only the fixed effect of the mean was considered and the vector of fixed effects was the same in both models, but this will not always be the case. Examples are when pooled phenotypes are being produced by animals of different sex or breeds. In these situations, the design matrix for the fixed effects must be modified to reflect the contribution of the fixed effect classes to the pooled phenotype. A consequence will be that systematic environmental effects can be estimated less accurately for pooled data when compared with a situation where individual observations are available.
Group Composition
In the present study about one-half of the cages were composed of full-sibs and the other one-half had a random composition. The numerical example used to outline the procedure illustrated that 2 full-sibs in group A (animals 5 and 6) had the same EBV when estimated based on pooled data, and, therefore, the composition of the groups will have an impact on breeding value estimation and estimation of variance components. Group observations are in fact disentangled through the pedigree structure. Family groups seem to be better for productivity and social welfare (P. Bijma, Wageningen University, Wageningen, the Netherlands, personal communication); therefore, on one hand having groups of closely related animals poses statistical challenges; on the other hand, they offer a better social environment and genetic background for animal performance and welfare. The appropriate cage composition, therefore, needs to be chosen to balance these 2 contrasting aspects.
Effects of Selection and Competition
According to theory, the residual variance estimated from pooled observations should be 4 times that estimated from individual observations. Results are in close agreement with this hypothesis for BW19 and BW27, but less so for BW43 and BW51. The reason why results diverged from theoretical expectations later in life might in part be that we limited this study to groups with 4 hens per cage. Excluding cages with less than 4 individuals might introduce a culling bias. This may be partly accounted for by using a multivariate approach, where the unselected trait (BW19) is analyzed simultaneously with the selected traits (BW27, BW43, and BW51), and all animals have observations on at least the trait on which selection is based (Pollak et al., 1984
).
An alternative explanation might be the presence of within-group competition effects (Bijma et al., 2007a
,b
), which have been shown to play a role in several traits of agricultural interest as, for instance, survival in laying hens and growth rate (Ellen et al., 2007
). The phenotype of an individual might depend not only on its own genotype, but also on the phenotypes or the genotypes of group mates and this alters the variance structure of the model. The consequences of these competition effects might accumulate over time and in this way bias the estimated genetic parameters. If competition effects play a role, this will affect the analyses based on individual observations, as well as those based on pooled data, but they might be affected differently. Therefore, in such cases, inclusion of the associative genetic effects in the analysis might be required (Bijma et al., 2007a
,b
).
This paper indicates that pooled observations can be used to estimate variance components and breeding values. The use of pooled data, therefore, offers interesting possibilities for breeding companies. In this study we focused on the use of pooled data from groups of the same size; there is scope for extension to groups of any size. We found that heritability estimates based on pooled data and individual observations tended to differ at older ages, which might be due to effects of selection, competition, or both.
| Footnotes |
|---|
2 Corresponding author: filippo.biscarini{at}wur.nl
Received for publication November 27, 2007. Accepted for publication May 23, 2008.
| LITERATURE CITED |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |