|
|
||||||||
ANIMAL GENETICS |

* Departamento de Mejora Genética Animal, INIA, Madrid, Spain; and
and
Unidad de Bioinformática, CBMSO-CSIC, Madrid, Spain
| Abstract |
|---|
|
|
|---|
Key Words: Bayesian Analysis Beef Cattle Clusters Contemporary Groups Genetic Evaluation
| Introduction |
|---|
|
|
|---|
Several ad hoc criteria have been used to compare alternative definitions of CG (Schmitz et al., 1991
; Sivarajasingam, 1993
; Crump et al., 1997
; Van Bebber et al., 1998
). These criteria include estimates of within- CG variance, residual variances, effective number of progeny, and accuracy of genetic evaluations. Other criteria based on the likelihood have been explored to a lesser extent. In most situations, alternative models arising from different definitions of CG are not nested, so likelihood ratio tests are not appropriate. Akaikes information criterion can be used in these cases, but it does not account for the amount of information available and tends to favor more complex models when the amount of data is sufficiently large. The Bayesian analysis provides the tools for model selection in a more general framework.
The goal of this study was to compare alternative definitions of CG for weaning weight of Avileña Negra Ibérica beef cattle using several criteria that include classical and Bayesian criteria for comparison of models.
| Materials and Methods |
|---|
|
|
|---|
|
ModelsDefinition of CG
The following model, currently used in genetic evaluations for this breed, was used:
![]() | [1] |
where yijklno is the weaning weight of animal n, CGi is the effect of the ith contemporary group, sxj is the effect of the sex of the animal (j = 1,2), dagek is the effect of the kth class of the age of the dam (k = 1,6), where groups of age consisted of dams being 2, 3, 4, 5 to 9, >9 yr old or having unknown age at calving, sfl is the effect of the lth class for supplement feed (l = 1,2), with animals receiving or not supplementary feed before weaning, b is the linear regression coefficient of weight on age of calf at weaning (agen), udn is the direct additive genetic effect of the nth animal, umo is the maternal additive genetic effect of o, dam of animal n, po is the maternal permanent environmental effect of o, and eijklno is the error term for record yijklno.
Six alternative definitions of CG were used. A minimum of five observations per CG class was required in all cases. Contemporary groups were formed within herd according to "conventional" seasons (spring, summer, autumn, and winter [HYS]) or months (HYM), or, following the "natural" calving pattern. The CG classes for the "natural" clusters were obtained in two steps. In a first step, animals were sorted by birth date within herd. Starting from the first date of birth, a CG was formed if adding 30 or 90 d to the first birth date in the group resulted in groups that included five or more animals. Once the list of ordered birth dates was read, some animals could not be assigned to any CG and were discarded. Two data sets resulted from this first step, HC30-30 and HC90-90, for natural fixed period herd clusters, spanning 30 or 90 d, respectively. In the second step, adaptive strategies involving two time limits were developed. Animals not attached to any CG for the HC30-30 definition were assigned to the previous/subsequent CG if the difference between its date of birth and the date of birth of the first/last animal in that CG did not exceed 90 d, for the HC30-90 definition, or 180 d, for the HC30-180 definition. After this second step, some animals still remained unassigned to a CG and had to be discarded from the corresponding data sets.
The six data sets with alternative definitions of CG were analyzed with Model [1]. Both classical approaches and Bayesian procedures were used. For the classical analyses, BLUP procedures were employed. Variance components used in the current evaluation system were obtained from the literature and were also used in these analyses. These values were 111.7, 64.7, and 10.6 kg2 for the additive genetic direct and maternal variances and direct-maternal genetic covariance, respectively, 49.3 kg2 for the maternal permanent environmental variance, and 363.3 kg2 for the residual variance.
For the Bayesian analyses, variances and other parameters involved in the model were considered unknown. Data were assumed to be generated from a multivariate normal distribution (MVN), according to the stochastic process:
![]() |
![]() |
where b is the vector containing the systematic effects previously defined (CG, sx, dage, sf, and b); ud is the vector of direct genetic effects; um is the vector of maternal additive genetic effects; p is the vector of maternal permanent environmental effects;
is the residual variance; X, Zd, Zm, and W are the corresponding incidence matrices; and R is the (co)variance matrix of residuals.
Conjugate priors were used for location and scale parameters. Proper-vague priors were employed in order to assign low degree of belief to the prior information and to allow the computation of the marginal density of the data and the Bayes factor (BF), which is not well defined for improper priors (Gelman et al., 1995
). For the location parameters, MVN distributions were assumed:
![]() |
![]() |
where
is a positive, scalar hyper parameter, large enough to give small weight to prior information (more precisely,
= 106); u contains the vectors for direct and maternal additive genetic effects; G =
u
A and P = I
are the additive genetic and maternal permanent environmental (co)variance matrices, respectively; A is the relationship matrix;
u is the matrix containing the (co)variances among the additive direct and maternal genetic; and
is the maternal permanent environmental variance.
For the dispersion parameters, scaled inverse
2 and inverse Wishart (IW) distributions were used:
![]() |
where parameters
i and
are interpreted as degrees of belief and a priori values for the residual, maternal permanent environmental variances and for the additive genetic (co)variance matrix. A value of four was assigned to the degrees of belief for the
2 distributions to provide low weight to the prior information. Degrees of belief for the IW distribution were 12. In both cases, degrees of belief were larger than the respective matrix dimensions to avoid degenerate forms (Gelman et al., 1995
). Numerical values for the scalars
and
and the matrix Su2 were those currently used in genetic evaluations of this breed.
Posterior marginal inferences on parameters of interest were drawn from their corresponding conditional posterior distributions through a Gibbs sampling scheme. The specifications for the Gibbs sampling implementation were based on the Raftery and Lewis (1992)
criterion, which was determined using the public domain program Gibbsit v.2.0 (http://lib.stat.cmu.edu/general/gibbsit). Alternatively, the coupling method analysis (Johnson, 1996
; García-Cortés et al., 1998
) was performed. Figure 2
presents the convergence of two independent chains run with the same seed and different starting values for two components, the direct genetic variance and the direct-maternal genetic covariance, for two definitions of CG, HYM, and HC30-90, to illustrate convergence of the parameters of interest. Other components of variance under other definitions showed similar patterns. As shown in this figure, convergence to the stationary distribution seems to be reached at approximately 4,000 iterations. The Raftery and Lewis (1992)
criterion yielded a substantially lower number of iterations for convergence. Finally, a safe burn-in period of 20,000 iterations was carried out for all analyses, and a total of 100,000 iterates was carried out after burn-in. Posterior mean and standard deviation of the variance components were obtained with results from all iterates after burn-in. However, posterior distributions (not shown), including modes and high-density regions were obtained using a thinning parameter of 40 iterations. Effective chain size ranged from 27 for the genetic covariance in the HC30-30 model to 254 for the maternal permanent environmental variance in the HYS model.
|
The between and within CG variance were obtained using the SAS proc Varcomp (SAS Inst., Inc., Cary, NC) under the type I option for the model,
![]() | [2] |
where yij* is the observation corrected by the corresponding BLUE and BLUP solutions for other than the CG effect in Model [1].
Effective number of progeny for the evaluation of sires for the direct genetic effect was obtained for sire j as:
![]() |
where nij is the number of progeny of sire j in CG i, ni. is the total number of records in CG i, and cj is the number of CG where sire j has progeny.
The accuracy of prediction of breeding values was measured as reliability, computed by the well-known formula:
![]() |
where Ri is the reliability of the prediction of genetic value for animal i; PEVi is the corresponding prediction error variance;
is the direct or maternal additive genetic variance when considering prediction of direct or maternal genetic values, respectively; and Fi is the inbreeding coefficient for animal i.
Variance components used in the routine genetic evaluation and estimates obtained from the Bayesian analysis were used to obtain the between and within CG variances and the accuracy of predicted breeding values for all models.
Bayesian Criteria.
The BF (Newton and Raftery, 1994
; Kass and Raftery, 1995
) and the cross-validation predictive densities of the data (Gelfand et al., 1992
), considered to be common tools for model checking and sensitivity analysis in the Bayesian framework (e.g., Gelman et al., 1995
), were computed to assess the performance of the alternative models based on the different definitions of the CG. Both criteria have the advantage of being applicable in a general framework, without the requirements needed in other procedures such as the likelihood ratio test, only applicable to nested models.
The BF for two competing models, Models [1] and [2], was computed as the ratio of their two corresponding marginal densities of the data (MD) under each model. The MD for each model was obtained from the Newton and Raftery (1994)
estimator:
![]() |
where H is the total number of iterations after the burn-in period and f(y |
(i)) is the conditional distribution of the data given the parameters involved in each model at iteration i.
The cross-validation predictive densities were defined as the set of univariate densities (Gelfand et al., 1992
; Gelfand, 1996
),
![]() |
where y(r) is the vector of observations with observation yr excluded, and Yr is a random variable which has distribution f(Yr |y(r)), which gives the values of the unobserved Yr that would most likely be observed when the model has been fitted to y(r); Gelfand, 1996
). Comparison of models was done by computing the expected value of the checking function, g = Yr yr, for all the data with respect to its univariate predictive density, dr = EYr|y(r) [Yr yr], with the best model being the one n having minimum
An importance sampling, with the joint posterior density of the parameters given the data as an importance distribution, was implemented to evaluate every dr. The log of the MD (LMD) and D statistics were obtained for each datum within the Gibbs sampling process (details of the implementation can be found in López-Romero et al., 2003
).
Because the value of MD varies with the size of the data set, the total data set containing all 12,740 initial observations (data in CG with less than five observations were not discarded) was used to compute the LMD for each alternative definition of CG. The D statistic was computed for both the total data set (DT) and the specific data sets of different size associated to the alternative CG models (Ds).
Computer programs employed to solve the BLUP equations and to obtain corresponding accuracies, as well as programs used to carry out the Bayesian analyses, were developed by the authors in Fortran90 language.
| Results |
|---|
|
|
|---|
|
|
|
|
Criteria for comparison of models from the Bayesian analysis are shown in Table 5
. The predictive ability of excluded observations of the alternative models measured in the total (DT) and specific data sets (Ds) ranked models in the same order. Low values for both statistics indicate better predictive ability of the model. The DT values were variably larger than the corresponding Ds values, probably because in the total data set, where no observations are discarded, some CG had very few observations (less than five). The difference between DT and DS statistics was greatest for the HYM and HC30-30 definitions, where the loss of data when forming the specific CG was largest. The HYS and HC90-90 models showed lower predictive ability of missing observations than HYM and HC30-30 models or models with two time limits. Ranking of models was the same when the LMD statistic was considered. The quantity 2log BF, which provides a numerically more tractable figure than the BF and is analogous to the likelihood ratio test, ranged from 185.0, when comparing more similar models such as HC30-90 and HC30-180, to 1,214.6, when comparing most distant models HYM and HC90-90. In all cases, the BF showed strong evidence favoring definitions with CG spanning shorter periods of time.
|
| Discussion |
|---|
|
|
|---|
In this study, definitions involving 90 d (HYS and HC90-90) had the largest CG size, but also largest average time spans, whereas definitions involving 30 d (HYM and HC30-30) yielded the smallest CG size and also the shortest average time spans, as expected. Adaptive definitions involving two time limits resulted in CG spanning intermediate time periods without a significant improvement in average CG size compared with CG of 30 d. Loss of information was minimal with these definitions.
The use of what we called classical, ad hoc criteria to provide information on what definition would be optimal in terms of bias and accuracy of prediction is not adequate because these criteria depend on the variance components associated to each model. The use of the same parameters in the BLUP analyses with all models allowed inferences about the effects of the amount and structure of available information on accuracy of genetic value prediction and comparison of between and within CG variation from different models on the same scale. With the same parameters, models with CG spanning 90 d (HYS and HC90-90) increased the effective number of progeny for predicting direct genetic values of sires, with a slight increase of accuracy. However, these strategies yielded a suboptimal partition of between and within CG, indicating that a part of the environmental variation in the CG effects remains unadjusted when CG span larger periods of times. This result was expected due to the variable environmental conditions during a year and across years and the extensive production system of the Avileña-Negra Ibérica breed. However, the comparison with the same parameters is not strictly correct because true variance components are not expected to be equal for all models. In particular, the residual variance would be expected to decrease as more environmental variation is accounted for by the CG effect (i.e., for CG spanning shorter periods of time), with no differences expected for the other components. In this study, estimates of the residual variance followed the expected trend, but the genetic components were significantly affected by the definition of CG. Larger estimates of the direct variance component were found for the HYS and HC90-90 definitions. As a result, when estimates of the variance components obtained for each alternative definition were used, the HYS and HC90-90 definitions resulted in substantially larger calculated accuracies for the predicted direct genetic effects, with no clear pattern for the maternal effects. However, the suspected bias in the estimated variance components makes this result questionable. Legarra et al. (2002)
, in a comparison of different contemporary group definitions for dairy sheep in the Bayesian context, did not find large differences in the estimates of the variance components across definitions, except for the residual component. In this population, the low degree of connectedness may be affecting the estimated genetic components. The lack of genetic ties among herds would be expected to induce an underestimation of the genetic components of variance because only the intraunit variation is captured in the estimation process (e.g., Clément et al. 2001
). This is likely to be the case in this study, where genetic variances and corresponding heritabilities seem to be smaller than the values usually provided in the literature for weaning weight in beef cattle, for models including CG spanning 30 d or adaptive CG. Adaptive CG definitions only contained a small proportion of data in CG spanning more than 30 d. For definitions of CG spanning larger periods of time, HYS and HC90-90, the environmental variation due to changes in environmental conditions throughout the year may not be properly removed by the CG effect. Due to the lack of sufficient genetic ties, this unadjusted environmental variability might have been erroneously assigned to the direct genetic effect, resulting in inflated estimates for the direct genetic variance.
In an attempt to provide a theoretically better approximation for the accuracy of the genetic value estimation under alternative models, the variance of the posterior distribution of genetic values, var(u |y), was also obtained (data not shown). This value would provide a measure of the accuracy of the marginal estimates of the genetic effects taking account of the uncertainty about other parameters of the model, including the variance components. Accuracies of estimated genetic values obtained from the variance of the posterior distribution of the genetic values were nearly identical to the accuracies obtained in the BLUP analyses (shown in Table 4
). In this case, the corresponding posterior means obtained in the Bayesian analyses are considered as true values of the variance components. Therefore, accounting for uncertainty of estimates of variance components in the Bayesian analyses leads to the same conclusions as found with the BLUP analyses, which would be expected when the amount of information is large enough.
In contrast to use of several ad hoc statistics to judge the effectiveness of alternative models, using Bayesian model selection provides a procedure in which the evidence from the data, the prior odds, the model dimensionality, and the sample size are combined automatically in a single formula (Sorensen and Gianola, 2003
). In this study, the Bayes factors and the cross validation predictive statistic ranked models in the same way. Both criteria indicated models with CG spanning 30 d as the most plausible and best predictors of missing data, and models with CG spanning 90 d as the worst. Adaptive definitions yielded intermediate values. These results again suggest that models with definitions of CG spanning larger periods of time might not properly account for environmental variation, which might lead to bias in estimates of the unknown parameters.
Other checking functions could have been used to compare the predictive ability of the models. In particular, the discrepancy between the probability that the predicted record is smaller than the observed record, and the expected value of 0.5 for this probability has been also used as a checking function in animal breeding (Varona et al., 1997
; Legarra et al., 2002
). This function measures systematic directional bias in prediction of unobserved records under alternative models. Systematic over- or underestimation of observations due to alternative definitions of CG are not expected in this case. Nevertheless, this checking function was tried for three alternative definitions in this study (data not shown). The average of the checking function value over all observations was nearly null and was approximately 0.20 for the three models analyzed when the absolute value of the difference between the probability of underestimation and the expected value of 0.5 was employed. These results indicate that, as expected, no systematic under- or overestimation occurred for alternative definitions of CG in this data, and that approximately 20% of the time we expect some kind of deviation of the predicted over the true value of the record.
Schmitz et al. (1991)
, comparing different strategies to cluster animals in CG according to different periods, reached somewhat different conclusions from the ones obtained in this study. Those authors found that the size of contemporary groups was more important than the strategy for forming CG in terms of improving accuracy of genetic evaluations. They also found no bias in genetic evaluations except for CG spanning several years, due to a time trend in milk production of Holstein cattle. They concluded that CG with 15 to 40 observations would be optimal. In our study, where environmental conditions are much more variable than in the dairy cattle case, the bias introduced by defining CG spanning longer periods of time seems to have a larger effect than the size of the CG.
With respect to the way of defining CG given one or two time spans, Crump et al. (1997)
argued that the use of cluster procedures that rely on distances between consecutive birth dates within a herd allows for the formation of "natural" contemporary groups, whereas with the fixed herd-year-season groups, seasons may not reflect the calving pattern within herds. These authors also argue that using two time spans instead of one promotes obtaining more evenly sized groups within herds. In our study, the use of "natural" CG strategies with one time span (HC30-30 or CG 90-90) did improve the CG size and decreased the amount of discarded records. However, definitions HC30-30 and HC90-90 did not provide better results than the corresponding strategies defined by conventional months (HYM) or seasons (HYS), in terms of plausibility of the model to fit the data or predictive ability, nor in achieving noticeable improvement in accuracy of prediction of genetic values. This might be explained by the fact that the relatively small improvement in size of CG with "natural" strategies resulted in longer periods of time associated with those CG. The use of two time limits (HC30-90, HC30-180) had a larger effect on the CG size than the use of only one limit of time, but also resulted in longer time spans. In addition, conventional months and seasons may align reasonably well with the calving pattern. If that is the case, the advantages of the "natural" grouping strategy may be reduced, as pointed out by Crump et al. (1997)
.
All the CG definitions proposed in this study imply a lack of continuity in the CG effect over time, given that animals born 1 d apart can be assigned to different CG. Several approaches have been used in the animal breeding area to avoid this problem: assignment of observations to more than one CG according to fuzzy classification techniques (Strandberg and Grandinsson, 1997
) or similar procedures (Sivarajasingam, 1993
), considering CG as random effects (Van Vleck, 1987
; Ugarte et al., 1992
; Visscher and Goddard, 1993
), allowing in some cases for covariances among observations in the same CG (Chauhan and Thompson, 1986
; Wade et al., 1990
). However, no clear advantages were found in those studies. In our study, the use of procedures that assign observations to more than one CG might not be very advantageous, particularly for CG spanning shorter periods of time, where the lack of continuity is less critical. Moreover, using fuzzy classification has been found to produce instability of CG solutions when having a small proportion of observations in any one class (Strandberg and Grandinsson, 1997
). Considering CG as random effects might help in improving the low accuracy of prediction of genetic values. However, bias of the prediction of genetic values may occur if an association between level of CG and genetic level of animals exists, which is difficult to evaluate in this population due to the connectedness problems. The assumption of stationary mean required to use simple autoregressive models may not be adequate in this population where a strong environmental influence seems to be present over the year. Therefore, the increase in computing cost and the difficulties in obtaining reliable estimates for procedures accounting for a continuous time trend might be counterbalanced by sufficiently large improvements in accuracy of prediction in this population. The use of those procedures might be less relevant if CG spanning shorter periods of time are to be used.
| Implications |
|---|
|
|
|---|
| Footnotes |
|---|
2 Correspondence: Ctra. de la Coruña Km. 7.5; 28040 Madrid (phone: +34-91-3476742; fax: +34-91-3572293; e-mail: mjc{at}inia.es).
Received for publication April 7, 2004. Accepted for publication August 27, 2004.
| Literature Cited |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. Vasconcelos, F. Santos, A. Bagnato, and J. Carvalheira Effects of Clustering Herds with Small-Sized Contemporary Groups in Dairy Cattle Genetic Evaluations J Dairy Sci, January 1, 2008; 91(1): 377 - 384. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. C. Cantet, A. N. Birchmeier, A. W. C. Cayo, and C. Fioretti Semiparametric animal models via penalized splines as alternatives to models with contemporary groups J Anim Sci, November 1, 2005; 83(11): 2482 - 2494. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |