J. Anim Sci.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J. Anim Sci. 2008. 86:1047-1056. doi:10.2527/jas.2007-0653
© 2008 American Society of Animal Science

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
jas.2007-0653v1
86/5/1047    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hickey, J. M.
Right arrow Articles by Veerkamp, R. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hickey, J. M.
Right arrow Articles by Veerkamp, R. F.

ANIMAL GENETICS

Estimation of accuracy and bias in genetic evaluations with genetic groups using sampling1

J. M. Hickey*,{dagger},{ddagger},2, M. G. Keane{dagger}, D. A. Kenny{ddagger}, A. R. Cromie§, H. A. Mulder* and R. F. Veerkamp*

* Animal Breeding and Genomics Centre, Animal Sciences Group, PO Box 65, 8200 AB, Lelystad, the Netherlands; and {dagger} Grange Beef Research Centre, Teagasc, Dunsany, Co. Meath, Ireland; and {ddagger} School of Agriculture, Food and Veterinary Medicine, College of Life Sciences, University College Dublin, Belfield, Dublin 4, Ireland; and § Irish Cattle Breeding Federation, Shinagh House, Bandon, Co. Cork, Ireland


    Abstract
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Accuracy and bias of EBV are important measures of the quality of genetic evaluations. A sampling method that accounts for the uncertainty in the estimation of genetic group effects was used to calculate accuracy and bias of estimated effects. The method works by repeatedly simulating phenotypes for multiple traits for a defined data and pedigree structure. These simulated values are analyzed using BLUP with genetic groups in the relationship matrix. Accuracies and biases are then calculated as correlations among and differences between true and estimated values across all replicates, respectively. The method was applied to the Irish beef production data set for 15 traits and with 15 genetic groups to account for differences in breed means. Accuracy and bias of estimated genetic group effects, estimated comparisons between genetic group effects, EBV within genetic group, and EBV across genetic group were calculated. Small biases were detected for most estimated genetic group effects and most estimated comparisons between genetic group effects. Most of these were not important relative to the phenotypic SD of the traits involved. For example, a bias of 0.78% of the phenotypic SD was detected for carcass conformation in Aberdeen Angus. However, one trait, calf quality, which had few performance records in the data set, displayed larger bias, ranging from –10.31 to 5.85% of the phenotypic SD across the different estimated genetic group effects. Large differences were observed in the accuracies of genetic group effects, ranging from 0.02 for feed intake in Holstein, which had no data recorded, to >0.97 for carcass conformation, a trait with large amounts of data recorded in the different genetic groups. Large differences were also observed in the accuracies of the comparisons among genetic group effects. The accuracies of the EBV within genetic group and EBV across genetic group were sometimes different; for example, carcass conformation in Belgian Blue had an average accuracy within genetic group of 0.69 compared with an average accuracy across genetic group of 0.89. This suggests that the accuracy of genetic groups should be taken into account when publishing EBV across genetic groups.

Key Words: accuracy • bias • data quality • genetic groups • multiple breed genetic evaluations


    INTRODUCTION
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
The quality of genetic evaluations is reflected by the accuracy and bias of estimated effects. Accuracy reflects the quality and quantity of the data, whereas bias reflects the systematic estimation error. When genetic evaluations using BLUP assume homogeneous means in founders, the accuracy and bias of an EBV are the correlation and difference between the EBV and its corresponding true value, respectively.

To account for heterogeneous means in founders, genetic groups in the relationship matrix are used in BLUP models (Westell et al., 1988Go). Such models are common in within-breed (Phocas and Laloe, 2004Go), across-breed (Sullivan et al., 1999Go), and international (Schaeffer, 1994Go) evaluations. In these circumstances the accuracy of an EBV across genetic group depends on the accuracy of the estimated genetic group effects and the EBV within genetic group, and any sampling covariances among these effects (Van Vleck et al., 1992Go). Bias in estimated genetic group effects or EBV within genetic group may bias EBV across genetic group and may be due to sampling covariances among these.

Ideally the exact method would be used to calculate the accuracy of EBV within or across genetic group. Because this is infeasible in most national data sets, several methods to approximate accuracy within genetic group have been developed. These give biased approximations for certain data structures (e.g., Tier and Meyer, 2004Go). Unbiased approximations of accuracy within genetic group can be calculated using sampling (Garcia-Cortes et al., 1995Go; Fouilloux and Laloe, 2001Go). No method exists to approximate accuracy across genetic group and methods exist only to calculate retrospective bias (Reverter et al., 1994Go).

The aim of this work was to extend the sampling method to evaluate genetic group effects and EBV within and across genetic groups, in terms of accuracy and bias, and to use this method to evaluate the Irish multiple-breed beef cattle data set.


    MATERIALS AND METHODS
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Animal Care and Use Committee approval was not obtained for this study because no animals were used; rather, only an existing database was used (The Irish Cattle Breeding Federation Database).

Accuracy of an EBV is calculated as the correlation between the EBV and its corresponding true breeding value. Bias is calculated as the difference between the EBV and its corresponding true breeding value. For animal i,


Formula

where r wgi is the accuracy of the EBV within genetic group (ûwgi) and u wgi is the true breeding value within genetic group. Using a sampling process, the variances and covariances required for r wgi can be estimated from the empirical distribution of ûwgi and u wgi (Fouilloux and Laloe, 2001Go), and bias can be detected using the mean error across all samples.

When genetic groups are used in the relationship matrix, u agi = bi + u wgi and ûagi = Formulai + û wgi, where u agi is the true breeding value across genetic group and bi is the true genetic group effect for animal i. The sampling method can be extended to account for this and for accuracy and bias of Formulai and ûagi, and comparisons among different Formulai and ûagi can be calculated.

Methodology

Simulation of True Genetic Values. In a genetic evaluation model with genetic groups, all animals in the pedigree trace to a founder genetic group. A matrix, B, of dimension g x n, of founder mean values for each genetic group g represented in the pedigree values for each of the n traits, is simulated. For each trait, the founder mean values for each genetic group are simulated from a normal distribution with a mean of zero and a variance equal to the breed variance for that trait in the population under analysis.

For each animal i in the pedigree, a vector uagi = bi + uwgi of true breeding values for each of n traits is simulated, dependent upon the status of i’s parents j and k. If both j and k are unknown, then each element in bi is given the average of its founder genetic groups and each element of uwgi is simulated as LGz. LG is obtained by Cholesky decomposition of VG, the genetic covariance matrix of the traits, and z is a multivariate random-sampled vector with a mean of zero and a covariance matrix I.

If one parent, say j, is known, then bi is given the average of the genetic group value of the known parent, bj, and the founder genetic group value for the unknown parent, Bg, and Formula. If both parents j and k are known, then bi is taken to be the average of bj and bk, and Formula. This results in a matrix of true breeding values with a distribution N(Qm, A {otimes} VG), where Q is an incidence matrix relating animals to m, the means of the founder genetic groups of which they are composed, and A is the relationship matrix between all animals in the pedigree.

Simulation of Phenotypic Values. A vector yi of phenotypic values for each trait is generated for each animal i as yi = uagi + ei, where ei = LEz is a vector of random residual values for each trait, and LE is obtained by Cholesky decomposition of VE, the residual covariance matrix for the traits. Values of fixed effects do not affect the distribution of random variables (Garcia-Cortes et al., 1995Go) and are simulated with values of zero.

Estimation of Genetic Group Values and Breeding Values. Mixed model equations are set up using genetic groups in the relationship matrix and appropriate fixed effects and solved to obtain estimated genetic group values and EBV across genetic group. The EBV within genetic group are obtained by subtracting estimated genetic group values, weighted by the appropriate breed composition, from the EBV across genetic group.

Sampling Process and Calculation of True Accuracies. The whole process is repeated several times, and the accuracy of the genetic group effects, the accuracy of the EBV within genetic group, the accuracy of the EBV across genetic group, and the accuracies of comparisons among the different genetic groups are calculated as the correlations between the true and estimated values across all of the replicates. Bias in these estimates is calculated as the average of the differences in the true and estimated values across all of the replicates. As the number of replications increases, estimates of accuracy and bias converge to their true values. To determine the number of replicates required to negate the effects of sampling error, confidence intervals for different values of true accuracy for different numbers of replicates were calculated using Fisher z transformations (Figure 1Go). As the true accuracy was reduced, the confidence interval increased. As a compromise between computing time and sampling error, we deemed 350 replicates to be sufficient, because the confidence interval varied between 0.002 and 0.104.


Figure 1
View larger version (26K):
[in this window]
[in a new window]

 
Figure 1. Confidence intervals for different numbers of replicates for different levels of accuracy.

 
Application to Data

This method was applied to the Irish multiple-breed beef cattle data set used for the January 2007 routine national genetic evaluation, which evaluated the direct genetic effects for 15 traits using data on purebred and crossbred animals of 35 breeds, of which 8 breeds dominated. Maternal effects are currently not included in the Irish beef genetic evaluation due to data limitations. Of the 15 traits, 8 (carcass weight, carcass conformation, carcass fatness, cull cow BW, weaning weight, live BW, feed intake, and calf quality) were breeding goal traits, and 7 were correlated linear type traits included as predictors. Most of the 493,092 animals with records on at least one trait had information only on subsets of traits (Table 1Go), with different breeds tending to have records on particular subsets of traits. Most of the information for some breeds and traits came from crossbreds; for others, it came from pure-breds. For example, data on carcass weight, conformation, and fatness were dominated by Holsteins. Data for these traits in most of the other breeds were from the offspring of sires of these other breeds and Holstein dams. Feed intake was primarily recorded on purebred sires of terminal sire breeds at a central performance test station under carefully controlled data recording and environmental conditions. Linear type traits were primarily recorded on purebred Charolais and Limousin. With the exception of the feed intake and some weaning and live BW records, all data were recorded under field conditions.


View this table:
[in this window]
[in a new window]

 
Table 1. Number of purebred AI sires of each breed, total number of records, and average number of records per AI sire for each of the traits for each of the 7 breeds1 with the most AI sires in the Irish Cattle Breeding Federation database
 
Fifteen genetic groups were defined, 1 for each of the 14 most-numerous breeds and 1 for the remaining breeds. The matrix B was simulated with each trait having a mean of zero and a variance that was set equal to the variance of the genetic group solutions from the January 2007 routine evaluation (Table 2Go). The VG and VE from the same evaluation were also used (Table 2Go and Table 3Go). Only animals having a phenotypic value for a trait in the January 2007 routine evaluation were given a simulated phenotypic value.


View this table:
[in this window]
[in a new window]

 
Table 2. Breed ({sigma}b2), additive genetic ({sigma}a2) and residual ({sigma}e2) variance used to generate true breeding values for the 15 traits
 

View this table:
[in this window]
[in a new window]

 
Table 3. Heritabilities (on the diagonal), genetic correlations (above the diagonal), and residual correlations (below the diagonal) used to simulate true breeding values for the 15 traits1
 
Accuracy and bias of estimated genetic group effects and of the comparisons among genetic groups, and accuracy of the EBV within genetic groups and of the EBV across genetic groups were calculated for sires recorded as AI sires in the Irish Cattle Breeding Federation database.

Computation was carried out on several computers, one of which was a 64-bit PC, with a 2.40-GHz, AMD Operaton dual-core processor, and 8 gigabytes of RAM. A program was written in Fortran 90 to simulate the true breed effects, true breeding values, and phenotypes. Mixed model equations were solved using a version of PEST (Groeneveld, 1990Go) compiled for a 32-bit PC.


    RESULTS
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Computing Time

On the 64-bit PC, 4 replicates could be run simultaneously in 462 min, with the solving of the mixed model equations taking 98.6% of this time. All 350 replicates could have been completed on this machine in 674 h.

Bias and Accuracy of Estimated Effects

Results presented here cover the 7 numerically most important breeds in the data set for 6 breeding goal traits. Results are not presented for the minor breeds, for the linear type traits, or for carcass weight and carcass fatness. Carcass weight and carcass fatness have data structure almost identical to carcass conformation, with any differences in accuracy and bias being due to differences in relevant variance components and correlations. Accuracy and bias of estimated genetic group values, of the comparisons between different estimated genetic group values, and of EBV within genetic group and across genetic group of AI sires are presented.

Bias and Accuracy of Genetic Group Effects. Small biases in the estimated genetic group effects were observed for most traits (Table 4Go). When expressed as a percentage of the phenotypic SD of the traits involved, most of these were not important (Table 2Go) and only in 3 cases were the biases significantly different from zero (P < 0.05): carcass conformation for the Simmental genetic group, and calf quality for the Belgian Blue and Hereford genetic groups. For carcass conformation the bias ranged from 0.00 to 1.63% of the phenotypic SD. Calf quality was an exception. It displayed much more bias than the other traits, ranging from –10.31 to 5.85% of the phenotypic SD across the different estimated genetic group effects.


View this table:
[in this window]
[in a new window]

 
Table 4. Mean bias across 350 replicates, expressed as a percentage of the phenotypic SD of the trait, of estimated genetic group effects for the 7 numerically most important genetic groups1 for 6 breeding goal traits
 
Large differences were observed in the accuracy of the genetic group effect for different breeds and traits (Table 5Go). For some traits the genetic group effects were well estimated [e.g., carcass conformation in all breeds (>0.97)], for others they were moderately estimated [e.g., weaning weight in Aberdeen Angus (0.50), cull cow BW in Charolais (0.53), feed intake in Limousin (0.46)], and for some they were poorly estimated [e.g., feed intake in Holstein (0.02), calf quality in Simmental (0.26)]. The accuracy of the genetic group effect was related to the numbers of records for that trait within that genetic group and to the numbers of records on crossbred animals with part of their ancestry within that genetic group. As the numbers of records increased, the accuracy of the genetic group effect increased. In Holsteins, carcass conformation averaged 77.53 records per sire and the accuracy of the genetic group effect was 0.99, whereas feed intake had no records for this breed and the accuracy of the genetic group effect was 0.02. However, this relationship was not directly proportional and was offset by the differing information content in the records of the different breeds (Figure 2Go).


View this table:
[in this window]
[in a new window]

 
Table 5. Accuracy of estimated genetic group effects for the 7 numerically most important genetic groups1 for 6 breeding goal traits
 

Figure 2
View larger version (15K):
[in this window]
[in a new window]

 
Figure 2. Accuracy of estimated genetic group effect compared with the number of records of offspring with sires in each of the genetic groups for 6 genetic groups (Aberdeen Angus, Belgian Blue, Charolais, Hereford, Limousin, and Simmental) and 2 traits (cull cow weight and calf quality).

 
Bias and Accuracy of Comparisons Among Genetic Group Effects. Small biases were observed in the estimated comparisons between the different breed groups for most traits (Table 6Go). When expressed as a percentage of the phenotypic SD of the traits involved, most of these were not important, and these biases were only significantly different from zero (P < 0.05) in 8 instances: comparisons for live BW between Belgian Blue and each of Aberdeen Angus, Hereford, Holstein, and Limousin and comparisons for calf quality between Hereford and each of Aberdeen Angus, Belgian Blue, Holstein, and Simmental. There was some disparity between bias in the estimates of individual genetic group effects and bias in the estimates of comparisons between genetic groups. For example, even though the estimated genetic group effect for Belgian Blue was significantly biased (4.86% of the phenotypic SD) in the case of calf quality, the comparisons between Belgian Blue and all other breeds with the exception of Hereford were not biased. Carcass conformation in the Simmental genetic group showed a similar pattern.


View this table:
[in this window]
[in a new window]

 
Table 6. Mean bias across 350 replicates, expressed as a percentage of the phenotypic SD of the trait, of estimated comparisons between genetic group effects for the 7 numerically most important genetic groups1 for 6 breeding goal traits
 
Large differences were observed in the accuracy of estimated comparisons between different genetic group effects for different traits (Table 7Go). For carcass conformation (>0.97) and live BW (>0.92) the accuracy of comparison between all genetic groups was always high. For weaning weight it ranged from moderately low (0.31) to moderately high (0.80). For feed intake the accuracy of the comparisons among the different genetic groups ranged from very low (0.06) to moderate (0.44). Large differences also existed in the accuracy of estimated comparisons between individual genetic groups and other genetic groups. For example, the accuracy of comparisons between Aberdeen Angus and other breeds for cull cow BW varied from 0.54 (comparison with Limousin) to 0.83 (comparison with Holstein).


View this table:
[in this window]
[in a new window]

 
Table 7. Accuracies of within trait breed comparisons among 7 breeds for 6 breeding goal traits
 
Accuracies of EBV Within Genetic Group and Across Genetic Group. Average accuracy of EBV within genetic group and average accuracy of EBV across genetic group for sires recorded as AI sires in the ICBF database for the 7 numerically most important breeds for 6 of the breeding goal traits are given in Table 8Go.


View this table:
[in this window]
[in a new window]

 
Table 8. Average accuracy of EBV within genetic group (rwg), and EBV across genetic group (rag) for AI sires of the 7 numerically most important breeds1 for 6 breeding goal traits
 
Average accuracy of EBV within genetic group ranged from 0.29 for calf quality in Hereford to 0.83 for carcass conformation in Holstein. Although traits with large numbers of records per sire in the different breeds tended to have greater average accuracy of EBV within genetic group, traits with few records per sire did not always have lower average accuracy of EBV within genetic group, because of the use of correlated traits in the evaluation. For example, average numbers of records per Holstein sire for weaning weight was 1.46, for live BW was 3.34, and for feed intake was zero, yet their respective average accuracy of EBV within genetic group were similar: 0.62, 0.67, and 0.65.

Average accuracies of EBV across genetic group ranged from 0.20 for calf quality in Simmental to 0.93 for carcass conformation in Holstein and were dependent upon the accuracy of the estimated genetic group effect and accuracy of EBV within genetic group. The accuracy of EBV across genetic group may benefit or suffer from having high or low accuracy of the estimated genetic group effects or from having high or low accuracy of EBV within genetic group. For example, average accuracy of EBV within genetic group for carcass conformation (0.77) and weaning weight (0.79) in Limousin have similar values, but the large differences in the values of the accuracy of the estimated genetic group effect for these traits (carcass conformation: 0.99, weaning weight: 0.57) create large differences in average accuracy of EBV across genetic group: 0.90 and 0.75.


    DISCUSSION
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
The quality of genetic evaluations when genetic groups are included in the relationship matrix was assessed. Accuracy and bias of estimated genetic group effects and of the comparisons among estimated genetic group effects were evaluated, and the accuracy of EBV within genetic group and of EBV across genetic group were calculated. When applied to Irish data the method helped to estimate accuracy and bias of breeding values and to identify weaknesses and strengths in the current data recording and evaluation system.

Accuracy of EBV Across Genetic Group

The accuracy of EBV across genetic group of an animal was determined by the accuracy of the estimated genetic group effects of the genetic groups that constitute an animal and the animal’s accuracy of EBV within genetic group. The relative importance of these 2 components depended on the ratio of breed to genetic variance, the accuracy with which these components are estimated, and the sampling covariances among the genetic group effects and among the animal’s individual genetic effect and the genetic group effects. Where there is poor partitioning of these effects, the sampling covariances among them may become important.

The accuracy of the estimated genetic group effect and the accuracy of EBV across genetic group provide information about how well these effects are estimated for individuals. However, as selection decisions involve comparing alternatives, the accuracy of comparisons among these effects may be more important. Publishing the accuracies of every pair-wise comparison would be cumbersome under the current system, in which an active bull list of the top 75 sires for total beef merit is published in Ireland. Web-based delivery of sire breeding advice as part of customized selection indices (Garrick, 2005Go) would make the delivery of these accuracies feasible.

Advantages of the Methodology

The quality of genetic evaluations depends on the quality of the data used in the analysis and on the quality of the model used to analyze it. The quality of models can be assessed with goodness-of-fit tests such as the R2, Akaike (Akaike, 1973Go), and Bayesian (Schwarz, 1978Go) information criterion, or based on the predictive ability of models by estimating effects on a randomly selected portion of the data and then using these estimated effects to predict records for the remaining data (e.g., Stone, 1974Go; Perez-Enciso et al., 1993Go; Olesen et al., 1994Go; Urioste et al., 2003Go).

Assessing the quality of the data involved in genetic evaluations has received less attention than the assessment of the quality of models, and the methods developed do not provide complete assessment. Current methods to check data quality use basic statistics, determine the consistency of results and sire variances from consecutive evaluations, check for trends in Mendelian sampling (Klei et al., 2002Go), or approximate the accuracy of individual EBV within genetic group (e.g., Tier and Meyer, 2004Go). These generate summary statistics that are difficult to interpret, or give incomplete assessment, or fail to account for all of the issues known to affect data quality. Data mining techniques are being developed to assess data quality in genetic evaluations (Banos et al., 2003Go), but although these promise undoubtedly powerful insights, they require specialist training.

The sampling method used in this study assesses the data quality in genetic evaluations using statistical parameters and techniques with which practitioners of animal breeding are familiar. It takes full account of issues influencing data quality, such as effective numbers of direct and correlated records contributing to an animal’s EBV, any number of fixed effects each with any number of levels, missing records, and connectedness among animals and sub-populations.

In addition, the sampling method can be used to assess a breeding program in terms of its potential to provide genetic improvement. Deterministic methods can be used to predict the genetic gain within a breeding program (Wray and Hill, 1989Go) and these have been expanded to situations involving crossbred as well as purebred data (e.g., Bijma and van Arendonk, 1998Go). However, deterministic methods have some drawbacks. The sampling approach used in this study can take full account of the structure existing in the data to determine the accuracy of EBV. The accuracy of EBV are proportional to the potential of response to selection (Falconer and Mackay, 1996Go). In a breeding program involving multiple genetic groups, response to selection is affected by both the accuracy of EBV within genetic group and the accuracy of EBV across genetic groups. Although the accuracy of EBV within genetic group may be acceptable (e.g., for feed intake in Holstein due to correlated information) and genetic gain can be made within a breed, the accuracy of the estimated genetic group effect, and, consequently, the accuracy of EBV across genetic groups may not be acceptable and efficiency of across breed selection would be reduced.

The sampling method allows alternative data recording scenarios to be tested. Phenotypic records could be simulated for animals in the pedigree that do not have phenotypes in the real data to determine the effect that recording this information would have on the accuracy and bias of the estimated effects before embarking on potentially expensive data recording. By looking at areas of strength and weakness for the different traits, suggestions could be made about where the priorities lie for efforts to increase or decrease the effective numbers of records.

The sampling method calculates accuracy and bias assuming that the true genetic model is used in the evaluations. This is obviously a simplification of the real genetic model and parameters are only estimated. For example, in the Irish model no account is taken of heterogeneity of variances in the different genetic groups or of effects of nonadditive genetic variance. Modeling of these effects is difficult because of the number of parameters to be estimated and the quality of the data needed for these estimations (e.g., Lutaaya et al., 2002Go; Legarra et al., 2007Go). In addition, because the method is based on BLUP, it does not account for possible biases due to unrecorded selection or unrecorded preferential treatment of animals. The true accuracy and bias of genetic evaluations might therefore be somewhat different from their calculated accuracy.

The computing required for this analysis was extensive, especially for the calculation of accuracies for routine genetic evaluations. However, several steps can be taken to reduce the computing time required and therefore allow routine application. Using more modern breeding value estimation software could reduce the overall time requirements. Using MiX99 (Lidauer et al., 2006Go) compiled for a 64-bit PC, the mixed model equations for each replicate could have been solved in 366 min instead of the 462 min required with PEST. To use this method to routinely assess the quality of genetic evaluations with genetic groups in the relationship matrix, the seed number used in the simulation and the solutions of the mixed model equations could be stored for each replicate. Then, for the subsequent genetic evaluation, one could update the files of simulated values with values simulated for new records included since the previous genetic evaluation and use the solutions of the matching replicate from the previous genetic evaluation as starting values for solving the mixed model equations. This study used 350 replicates to calculate accuracy and bias of estimated effects. Using fewer replicates would have reduced the computing time. For animals with a true accuracy of 0.50, reducing the number of replicates from 350 to 200 would have only increased the 95% confidence interval of their accuracy from 0.07 to 0.10 (Figure 1Go).

Bias and Accuracy of Estimated Effects in the Irish Beef Cattle Population

The Irish multiple-breed beef cattle data set is extremely unbalanced. Certain breeds are only mated with certain others, purebred data are unavailable for certain breeds, and the amount of data recorded on the different traits differs within breeds. In spite of such imbalances the genetic evaluation model applied to the Irish multiple-breed beef cattle population allows the estimation of breed effects and comparisons among breed effects without serious systematic bias. The large differences in accuracy of the estimated genetic group effects and in the accuracy of the comparisons among the genetic group effects reflect large differences in the quality and quantity of the information used to estimate these effects and comparisons. From the results it is clear that some feed intake records on Holstein animals are required to get a reasonable estimate of the breed effect. The EBV within breed are reasonably accurate due to correlated information; hence, there is no need for continuous recording of Holstein animals for feed intake. For the beef breeds the focus should be on increasing the effective numbers of records to improve the accuracy of estimated genetic group effects and consequently the accuracy of EBV across Limousin, Simmental, and Charolais breeds for cull cow BW, and Aberdeen Angus and Hereford for weaning weight. The calf quality trait exhibited more bias than feed intake even though calf quality tended to have more records per sire in the different breeds. Feed intake was recorded on animals that were selected for performance testing based on their EBV for other traits. At the same time, performance testing was done by comparing animals from different breeds under equal circumstances. Calf quality was recorded on commercial animals sold for export. Although these were likely to be superior commercial animals, they were not selected on the basis of the other traits recorded. Therefore, better correction was likely for selective recording of feed intake compared with calf quality.

In conclusion, a method was developed to assess the quality of genetic evaluations in which animals from different genetic groups are compared, in terms of accuracy and bias of estimated effects and of comparisons among effects. Accuracy of EBV within genetic group and EBV across genetic group was very different in some instances, with the differences due to the accuracies of the estimated genetic group effects. Further theoretical work is required to quantify the effect of the ratio of breed group variance to genetic variance on the relative importance of the different components contributing to the accuracy of EBV across genetic group and to quantify the effect of sampling covariances among these components. This method can be used to calculate accuracy of multiple-breed EBV.


    Footnotes
 
1 The authors acknowledge Mario Calus, Marco Pool, Debbie Wilhelmus, Ross Evans, Donagh Berry, and Ismo Stranden for useful discussions and the Irish Cattle Breeding Federation for providing the data. Back

2 Corresponding author: John.Hickey{at}wur.nl

Received for publication October 12, 2007. Accepted for publication January 29, 2008.


    LITERATURE CITED
 Top
 Abstract
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 


Akaike, H. 1973. Information theory and an extension of the maximum likelihood principle. Pages 267–281 in 2nd Int. Symp. Inf. Theory, Budapest, Hungary. Akademiai, Budapest, Hungary.

Banos, G., P. A. Mitkas, Z. Abas, A. L. Symeonidis, G. Milis, and U. Emanuelson. 2003. Quality control of national genetic evaluation results using data mining techniques: A progress report. Interbull Bull. 31:8–15.

Bijma, P., and J. A. M. van Arendonk. 1998. Maximizing genetic gain for the sire line of a crossbreeding scheme utilizing both purebred and crossbred information. Anim. Sci. 66:529–542.

Falconer, D. S., and T. F. C. Mackay. 1996. Introduction to Quantitative Genetics. 4th ed. Longman, New York, NY.

Fouilloux, M. N., and D. Laloe. 2001. A sampling method for estimating the accuracy of predicted breeding values in genetic evaluation. Genet. Sel. Evol. 33:473–486.[CrossRef][Medline]

Garcia-Cortes, L. A., C. Moreno, L. Varona, and J. Altarriba. 1995. Estimation of prediction error variances by resampling. J. Anim. Breed. Genet. 112:176–182.

Garrick, D. J. 2005. Making the Web Equal Profit: Surfing for Genetics. Pages 105–111 in Proc. 37th Annu. Mtg. Beef Improvement Federation, Billings, MT.

Groeneveld, E. 1990. PEST User’s Manual. University of Illinois, Urbana.

Klei, L., T. Mark, F. Fikse, and T. Lawlor. 2002. A method for verifying genetic evaluation results. Interbull Bull. 29:178–182.

Legarra, A., J. K. Bertrand, T. Strabel, R. L. Sapp, J. P. Sanchez, and I. Misztal. 2007. Multi-breed genetic evaluation in a Gelbvieh population. J. Anim. Breed. Genet. 124:286–295.[CrossRef][Medline]

Lidauer, M., I. Stranden, K. Vuori, and E. Mantysaari. 2006. Mix99 User Manual. MTT, Jokioinen, Finland.

Lutaaya, E., I. Misztal, J. W. Mabry, T. Short, H. H. Timm, and R. Holzbauer. 2002. Joint evaluation of purebreds and crossbreds in swine. J. Anim. Sci. 80:2263–2266.[Abstract/Free Full Text]

Olesen, I., M. Pérez-Enciso, D. Gianola, and D. L. Thomas. 1994. A comparison of normal and nonnormal mixed models for number of lambs born in Norwegian sheep. J. Anim. Sci. 72:1166–1173.[Abstract]

Perez-Enciso, M., R. J. Tempelman, and D. Gianola. 1993. A comparison between linear and Poisson mixed models for litter size in Iberian pigs. Livest. Prod. Sci. 35:303–316.[CrossRef]

Phocas, F., and D. Laloe. 2004. Should genetic groups be fitted in BLUP evaluation? Practical answer for the French Al beef sire evaluation. Genet. Sel. Evol. 36:325–345.[CrossRef][Medline]

Reverter, A., B. L. Golden, R. M. Bourdon, and J. S. Brinks. 1994. Technical note: Detection of bias in genetic predictions. J. Anim. Sci. 72:34–37.[Abstract]

Schaeffer, L. R. 1994. Multiple-country comparisons of dairy sires. J. Dairy Sci. 77:2671–2678.[Abstract]

Schwarz, G. 1978. Estimating the dimension of a model. Ann. Statist. 6:461–464.[CrossRef]

Stone, M. 1974. Cross-validatory choice and assessment of statistical predictions. J. R. Stat. Soc. Ser. B. Method. 36:111–147.

Sullivan, P. G., J. W. Wilton, S. P. Miller, and L. R. Banks. 1999. Genetic trends and breed overlap derived from multiple-breed genetic evaluations of beef cattle for growth traits. J. Anim. Sci. 77:2019–2027.[Abstract/Free Full Text]

Tier, B., and K. Meyer. 2004. Approximating prediction error covariances in multiple-trait and random regression models. J. Anim. Breed. Genet. 121:77–89.[CrossRef]

Urioste, J. I., R. Rekaya, D. Gianola, W. F. Fikse, and K. A. Weigel. 2003. Model comparison for genetic evaluation of milk yield in Uruguayan Holsteins. Livest. Prod. Sci. 84:63–73.[CrossRef]

Van Vleck, L. D., A. F. Hakim, L. V. Cundiff, R. M. Koch, J. D. Crouse, and K. G. Boldman. 1992. Estimated breeding values for meat characteristics of crossbred cattle with an animal model. J. Anim. Sci. 70:363–371.[Abstract]

Westell, R. A., R. L. Quaas, and L. D. Van Vleck. 1988. Genetic groups in an animal model. J. Dairy Sci. 71:1310–1318.[Abstract/Free Full Text]

Wray, N., and W. G. Hill. 1989. Asymptotic rates of response from index selection. Anim. Prod. 49:217–227.



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
jas.2007-0653v1
86/5/1047    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hickey, J. M.
Right arrow Articles by Veerkamp, R. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hickey, J. M.
Right arrow Articles by Veerkamp, R. F.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS