J. Anim Sci.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Legarra, A.
Right arrow Articles by Bertrand, J. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Legarra, A.
Right arrow Articles by Bertrand, J. K.
J. Anim. Sci. 2004. 82:1564-1571
© 2004 American Society of Animal Science


ANIMAL GENETICS

Constructing covariance functions for random regression models for growth in Gelbvieh beef cattle

A. Legarra, I. Misztal1 and J. K. Bertrand

Department of Animal and Dairy Science, University of Georgia, Athens 30602-2771

Abstract

Genetic parameters for a random regression model of growth in Gelbvieh beef cattle were constructed using existing estimates. Information for variances along ages was provided by parameters used for routine Gelbvieh multiple-trait evaluation, and information on correlations among different ages was provided by random regression model estimates from literature studies involving Nellore cattle. Both sources of information were combined into multiple-trait estimates; corrected for continuity, smoothness, and general agreement with literature estimates; and extrapolated to 730 d. Covariance functions using standardized Legendre polynomials were fit for the following effects: additive genetic (direct and maternal), and animal and maternal permanent environment. Residual variances at different ages were fitted using linear splines with three knots. Fit was by least squares. The order of polynomials was varied from third to sixth. Increasing the fit beyond cubic provided small improvements in R2 and increased the number of small eigenvalues of covariance matrices, especially for the additive effect. Parameters for a random regression model in beef cattle can be constructed with negligible artifacts from literature estimates. Formulas can easily be modified for other types of polynomials and splines.

Key Words: Beef Cattle • Genetic Evaluation • Random Regression

Introduction

Models with covariance functions using polynomials, also known as random coefficient or random regression models (Kirkpatrick et al., 1990Go), are routinely applied in the genetic evaluation of dairy, but are used less frequently in the beef cattle industry. In the United States, the genetic evaluation of beef cattle is based on a multiple-trait model (birth, 205 d, and 365 d; BIF, 2002Go). Weights close to given ages are preadjusted, whereas remaining weights are discarded. Random regression models allow for the use of all available records without preadjustment or editing, and would provide estimates of breeding values at any age.

There are few reports of genetic parameter estimates for random regression models in beef cattle (Albuquerque and Meyer, 2001Go; Meyer, 2002Go; Nobre et al., 2003aGo). These estimates seem to contain artifacts due to 1) the tendency of polynomials to provide poor fit at the extremes, 2) an uneven distribution of data points, and 3) the use of small datasets due to high computing costs (Misztal et al., 2000Go). As a result of those problems, Nobre et al. (2003b)Go found that evaluations from random regression models with parameters estimated from the data were worse than from multiple-trait models.

Misztal et al. (2000)Go described the so-called "constructive approach" to form artifact-free estimates of parameters of random regression models. The basic idea was to assemble functions of variances along the trajectory and of correlations across two trajectories, construct multiple-trait model parameters for a large number of traits, and then fit random regression model parameters, as in Kirkpatrick et al. (1990)Go. In dairy cattle, some parameters of a random regression model were derived from multiple-trait model parameters (Van Der Werf et al., 1998Go; Kettunen et al., 2000Go; Emmerling et al., 2002Go). The purpose of this study was to develop artifact-free parameters for a random regression model in Gelbvieh beef cattle using multiple-trait model parameters estimates for this breed, literature estimates, and heuristics.

Materials and Methods

Model
A multiple-trait model, as in BIF (2002)Go, was assumed. An equivalent random regression model should account for the following sources of variation: animal and maternal additive genetic, animal permanent environment, maternal permanent environment, and residual. A residual effect in a multiple-trait model is equivalent to permanent environmental and residual effects in a random regression model. Thus, residual (co)variances in a multiple-trait model should be decomposed into permanent environmental and residual variances in the corresponding random regression model.

Algorithm
The algorithm used to estimate the random regression model parameters was as follows:

  1. Using literature, for all effects, prepare estimates of variances for weights at several ages and estimates of correlations among different weights;
  2. For each random effect except the residual:
    1. set a vector of ages d of size nd
    2. using variances from 1, obtain a vector of variances v corresponding to days in d by linear interpolation or, beyond points obtained from literature, by linear extrapolation.
    3. similar to b, calculate a matrix of correlations among days in d: C.
    4. Construct a multiple-trait model matrix of (co)variances Gn, by multiplying each correlation in C by the correspondent standard deviations of the variances in v:


      where {odot} is the Hadamard product.

    5. Let fi(x) be an ith regression for day x. Matrix {Phi} is a nd x np matrix of functions of np degree (e.g., polynomials) for all days in d. Matrices {Phi} need not be the same for different random effects. Parameters of the random regression model: Kn of dimension np x np can be estimated as in Kirkpatrick et al. (1990)Go by solving ordinary least squares:


      where E is a matrix of residuals. In a linear form:


      where {otimes} is the Kronecker product and vec is an operator that stacks full-stored matrices in vectors (Searle, 1982Go). Operator vech, which stacks half-stored symmetric matrices, was used by Kirkpatrick et al. (1990)Go; however, in our work, it often resulted in negative-definite estimates of Kn. Least squares result in equal weights given to each (co)variance. A better match for selected points can be obtained by using weighted least squares.


  3. Residual effect in the multiple-trait model is modeled in the random regression model as a sum of permanent environment plus random regression model residual. Subsequently the residual (co)variance matrix R0 is modeled in the random regression model as a sum of permanent environment (Kp) and a diagonal error variance (Rr):


    where {Phi} are matrices of functions as defined previously. Let diag(Rr)={Theta}kr, where {Theta} is a matrix of functions, not necessarily the same as before, and kr is a vector of parameters for the residual effect in the random regression model. Solutions of Kp and kr can be obtained by least squares in the following system of equations:


    where F is a matrix of dimensions (nd x nd) x (size of kr) such that the ith row of F is equal to the jth row of {Theta} if i = (j – 1)2 + 1, and to a row of zeros otherwise, and where {varepsilon} is a vector of residuals.

Implementation
The vector of ages was set to d' = (1, 31, 61, ... 691, 721, 733). Literature estimates for variances were those used at the University of Georgia for routine genetic evaluation of Gelbvieh for 1, 205, and 365 d. Correlations among traits were initially obtained from Nobre et al. (2003a)Go for the missing data set and visually corrected based on common sense (i.e., after visualization of the corresponding patterns between different ages and detection of obvious artifacts such as "holes"); correlations among 1, 205, and 365 d were similar from the two sources. As the variance of maternal permanent environmental effect in the multiple-trait model at birth was considered 0 and the random regression model requires nonzero variances, that variance was arbitrarily set to 0.02 kg2. As the estimate of direct-maternal correlations from Nobre et al. (2003a)Go contained many artifacts, it was arbitrarily set to –0.2. Parameters as prepared are shown on a three-trait scale in Table 1Go.


View this table:
[in this window]
[in a new window]
 
Table 1. Parameters at 1, 205, and 365 d as initially created and as corrected for additive direct and maternal genetic, maternal permanent environment (MPE), and residual effects
 
Regressions for all effects except the residuals were standardized Legendre polynomials from order 3 (quadratic) to 6 (quintic). For simplicity, the same degree was fit for all polynomials at the same time. For residuals, regressions were linear splines with knots at 1, 205, and 365 d. Direct and maternal additive genetic parameters were estimated simultaneously.

Therefore the random regression model for the Gelbvieh data to be hypothetically evaluated with the parameters obtained from this work would be:


where y is a weight at a given (standardized) age t; {Phi}i is the ith order Legendre polynomial as a function of t; ai, mi, mpei, and pei are the ith random additive direct, additive maternal, maternal permanent effects environment, and animal permanent environment associated with each respective Legendre polynomial, respectively. Variance components for these effects were denoted as K previously (with different subscripts): n is the order of the Legendre polynomials, and {varepsilon} is the associated residual whose variance component varies with age and is denoted as Rr.

Analyses
Evaluation of the estimated parameters was done in several ways: 1) fit, measured as R2; 2) size and sign of the eigenvalues of the parameters; 3) comparison of original and estimated correlations of effects at 1, 205 and 365 d; 4) agreement in plots of original and estimated variances along the trajectory; and 5) agreement in plots of original and estimated correlations among ages. Computer programs for estimation and for visualization were written in Scilab (http://www-rocq.inria.fr/scilab/), a matrix-programming environment. These programs are available from the authors upon request.

Results and Discussion

Table 2Go shows the R2 and the number of small (lower than 0.1% of the highest one) eigenvalues for each effect; the direct and maternal effects are treated as one effect. High-order Legendre polynomials resulted in parameters with very small eigenvalues. Some of these eigenvalues in fifth- and sixth-order polynomials were negative; the methodology does not guarantee positive-definiteness of the result. However, because reducing the rank of the random regression model covariance matrix (i.e., setting these eigenvalues to 0) provided almost identical fit in terms of R2, these eigenvalues were unimportant.


View this table:
[in this window]
[in a new window]
 
Table 2. Fit (R2) and number of small eigenvalues using polynomials of different order, for additive direct and maternal, maternal permanent environment (MPE) and permanent environment (PE) using polynomials of order 3 (quadratic) to 6 (quintic)
 
The proposed and estimated variances along time for quadratic, cubic, and quartic fits are shown in Figures 1Go to 3Go. The fit appears to be very good even with the quadratic fit for all but the maternal effects. The maternal effects have an assumed abrupt change at 205 d, and this is difficult to model with polynomials. In general, it may be more practical to change the proposed shape of these curves than to increase the dimension of the covariance functions. Figure 3Go is arranged to show how in a random regression model the sum (the "total" residual) of the permanent environmental effect and the residual is equivalent to the residual in a multiple-trait model.



View larger version (18K):
[in this window]
[in a new window]
 
Figure 1. Original and fitted variances for direct additive effect, using polynomials of order 3 (quadratic) and 4 (cubic). The quartic polynomial is overlapped with a cubic polynomial. The estimated Nellore direct variance (Nobre et al., 2003aGo) is included as a reference.

 


View larger version (24K):
[in this window]
[in a new window]
 
Figure 3. Original and fitted variances for residual effects using polynomials of order 3 (quadratic) to 5 (quartic).

 
For (co)variances in this paper, cubic polynomials provide an adequate fit, while avoiding numerical problems. The initial (co)variances were of order 6 x 6 for the additive effects and 3 x 3 for the remaining effects, and the cubic fit results were in the order 7 x 7 and 3 x 3, respectively, if dimensions with near-zero eigenvalues are deleted. Slightly higher order of parameters for random regression model than for multiple-trait parameters matrices is usually necessary because Legendre polynomials require higher order to fit less regular functions. One should note that a perfect fit is not required because such a fit would increase computations and susceptibility to numerical errors, and BLUP is robust to slightly incorrect parameters. However, reduction of dimensionality by the elimination of small eigenvalues may result in poor fit at extremes of the trajectory (Foulley and Robert-Granié, 2002Go). In the case of the cubic fit, such reduction for the direct effect resulted in lower direct variance for birth weight, but negligible differences for (co)variances associated with weaning weight and yearling weight. Reduction for the other effects had much lower influence on the (co)variances. It seems that reduction of dimensionality due to elimination of small eigenvalues may not be prudent in all cases. We believe cubic polynomials to be a good compromise in this case. The values of the parameters (variance componentes of the random regression model) estimated with the cubic fit are shown in the Appendix.

In general, the agreement between the fit provided by cubic polynomials (Table 3Go) with those proposed in Table 1Go is quite good, with the exceptions of those correlations that were considered as null in the Gelbvieh multiple-trait model but not in the random regression model. However, their importance in practice is small because of the low correlations (in the maternal additive effect) or variances (in the maternal permanent effect). If necessary, one can achieve a better fit for selected points by using weighted least squares.


View this table:
[in this window]
[in a new window]
 
Table 3. Parameters at 1, 205, and 365 d as estimated with the cubic fit for additive direct and maternal genetic, maternal permanent environment (MPE), and residual effects
 
Figure 4Go shows the pattern of genetic correlations between the additive effects at different times. Holes in such a curve indicate fluctuating correlations, which are unlikely to be biologically realistic. Such a hole can be observed in the pattern of correlations between direct effects at 50 and 150 d of age; however, the additive correlations in this age range are low. The rest of the correlation pattern is reasonably smooth and it is considered as acceptable. "Slices" of this graph at 1 and 205 d of age are shown in Figures 5Go and 6Go, including the original "constructed" multiple-trait model parameters and the fit provided by quadratic to quartic polynomials. The quadratic polynomials do not provide a good fit, whereas the cubic polynomials provide essentially the same fit as quartic.



View larger version (23K):
[in this window]
[in a new window]
 
Figure 4. Correlations fitted for Gelbvieh between different ages for effects direct (lower right corner) and maternal (upper left corner), using cubic polynomials.

 


View larger version (14K):
[in this window]
[in a new window]
 
Figure 5. Original and fitted correlations of direct additive effect in d 1 (top) or d 205 (bottom), with effects in other days using polynomials of order 3 (quadratic) to 5 (quartic).

 


View larger version (15K):
[in this window]
[in a new window]
 
Figure 6. Original and fitted correlations of maternal additive effect in d 1 (top) or d 205 (bottom), with effects in other days using polynomials of order 3 (quadratic) to 5 (quartic).

 
Figure 7Go shows the direct-maternal genetic correlation at the same age along the trajectory. The pattern estimated in Nellore was extremely erratic and it is shown as a reference. The arbitrary correlation of –0.2, which was assumed in this study, was well approximated with any polynomial.



View larger version (20K):
[in this window]
[in a new window]
 
Figure 7. Correlations between direct and maternal additive effects at the same age as estimated in Nellore cattle (Nobre et al., 2003aGo), as constructed, and as fitted with polynomials of order 3 (quadratic) to 5 (quartic).

 
Other Polynomials and Splines
Parameters were fit for a random regression model using Legendre polynomials. Other functions, such as fractional polynomials by Foulley and Robert-Granié (2002)Go or splines (White et al., 1999Go; Torres and Quaas, 2001Go), could have resulted in better fit at a lower dimensionality. In particular, the use of linear splines would allow the use of the multiple-trait model parameters without changes for the genetic and maternal permanent effects, but not for the residual or permanent environment. However, models with splines may have poor numerical properties when points are strongly correlated. Computer programs developed for this study could be easily modified to work with other functions or polynomials.

Use of random regression models with polynomials, splines, or other general functions may lead to highly correlated regressions and subsequently poor numerical properties and low convergence rates if solutions are obtained by iteration. Covariates in random regression models can be reparameterized to result in diagonal (co)variances (Van der Werf, 1998Go) and subsequently much improved numerical properties (Lidauer et al., 2003Go; Nobre et al., 2003bGo). For an order of fit high enough, such reparameterized covariates are likely to be very similar regardless of which polynomial/spline/functions were used initially.

Implications

Parameters for random regression models for growth traits in beef cattle can be obtained without estimation from the data by using multiple-trait models’ parameter estimates derived from the data or from the literature and common-sense corrections. The calculations allow for testing the necessary degree of fit in negligible amount of time. Cubic Legendre polynomials seem to provide a reasonable fit with minimal artifacts.



View larger version (17K):
[in this window]
[in a new window]
 
Figure 2. Original and fitted variances for maternal genetic and maternal permanent effects using polynomials of order 3 (quadratic) to 5 (quartic).

 
Appendix

Estimates of the parameters (variances and covariances) of the random regression model using cubic Legendre polynomials and values in Table 1Go (second row). The term "a" stands for direct additive genetic, "m" for maternal additive genetic, "mpe" for maternal permanent environment, and "pe" for individual animal permanent environment. The subscript of each random effect is the order of the associated term in the polynomial.


Additive genetic (direct and maternal)
Item a1 a2 a3 a4 m1 m2 m3 m4
a1 413.83 181.33 –48.98 –19.42 –40.24 –5.58 5.29 –3.09
a2 181.33 139.83 1.42 –25.42 –15.94 –2.21 2.09 –1.22
a3 –48.98 1.42 47.78 21.83 2.01 0.28 –0.26 0.15
a4 –19.42 –25.42 21.83 27.17 –0.38 –0.05 0.05 –0.03
m1 –40.24 –15.94 2.01 –0.38 58.68 8.03 –12.80 4.69
m2 –5.58 –2.21 0.28 –0.05 8.03 9.93 0.84 –2.61
m3 5.29 2.09 –0.26 0.05 –12.80 0.84 5.51 –0.93
m4 –3.09 –1.22 0.15 –0.03 4.69 –2.61 –0.93 2.56
Maternal permanent environment
Item mpe1 mpe2 mpe3 mpe4
mpe1 72.81 11.06 –15.39 6.63
mpe2 11.06 4.44 –1.01 0.36
mpe3 –15.39 –1.01 5.57 –0.34
mpe4 6.63 0.36 –0.34 1.87
Animal permanent environment
Item pe1 pe2 pe3 pe4
pe1 730.90 342.30 –125.12 –59.95
pe2 342.30 227.60 –6.21 –26.95
pe3 –125.12 –6.21 87.53 31.14
pe4 –59.95 –26.95 31.14 20.88
Residual variance: Values at days 1, 205, and 365
8.83
121.94
214.40

1 Correspondence—phone: 706-542-0951; fax: 706-583-0274; e-mail: ignacy{at}uga.edu.

Received for publication August 29, 2003. Accepted for publication February 24, 2004.

Literature Cited



Albuquerque, L. G., and K. Meyer. 2001. Estimates of covariance functions for growth from birth to 630 days of age in Nellore cattle.J. Anim. Sci. 79:2776–2789.[Abstract/Free Full Text]

BIF. 2002. Guidelines for Uniform Beef Improvement Programs. Beef Improv. Fed., Athens, GA.

Emmerling, R., E. A. Mäntysaari, and M. Lidauer. 2002. Reduced rank covariance functions for a multi-lactation test-day model. CD-ROM communication No. 17-03 in Proc. 7th World Congr. Genet. Appl. Livest. Prod., Montpellier, France.

Foulley, J. L., and C. Robert-Granié. 2002. Basic tools for the statistical analysis of longitudinal data via mixed linear models. Course Notes. Montpellier, France.

Kettunen, A., E. A. Mäntysaari, and J. Pösö. 2000. Estimation of genetic parameters for daily milk yield of primiparous Ayrshire cows by random regression test-day models. Livest. Prod. Sci. 66:251–261.

Kirkpatrick M., D. Lofsvold, and M. Bulmer. 1990. Analysis of the inheritance, selection and evolution of growth trajectories. Genetics 124:979–993.[Abstract]

Lidauer, M., E. A. Mäntysaari, and I. Strandén. 2003. Comparison of test-day models for genetic evaluation of production traits in dairy cattle. Livest. Prod. Sci. 79:73–86.

Meyer, K. 2002. Estimates of covariance functions for growth of Australian beef cattle from a large set of data. CD-ROM communication No. 11-01 in Proc 7th World Congr. Genet. Appl. Livest. Prod., Montpellier, France.

Misztal, I., T. Strabel, J. Jamrozik, E. A. Mäntysaari, and T. H. E. Meuwissen. 2000. Strategies for estimating the parameters needed for different test-day models. J. Dairy Sci. 83:1125–1134.[Abstract]

Nobre, P. R. C., I. Misztal, S. Tsuruta, J. K. Bertrand, L. O. C. Silva, and P. S. Lopes. 2003a. Analysis of growth curves of Nellore cattle by multiple-trait and random regression models. J. Anim. Sci. 81:918–926.[Abstract/Free Full Text]

Nobre, P. R. C., I. Misztal, S. Tsuruta, J. K. Bertrand, L. O. C. Silva, and P. S. Lopes. 2003b. Genetic evaluation of growth in Nellore cattle by multiple-trait and random regression models. J. Anim. Sci. 81:927–932.[Abstract/Free Full Text]

Searle, S. R. 1982. Matrix Algebra Useful for Statistics. John Wiley and Sons, New York, NY.

Torres, R. A. A., Jr., and R. L. Quaas. 2001. Determination of covariance functions for lactation traits on dairy cattle using random-coefficient regressions on B-splines. J. Anim. Sci. (Suppl. 1)79:112. (Abstr.)

Van Der Werf, J. H. J., M. E. Goddard, and K. Meyer. 1998. The use of covariance functions and random regressions for genetic evaluation of milk production based on test day records. J. Dairy Sci. 81:3300–3308.[Abstract]

White, I. M. S., R. Thompson, and S. Brotherstone. 1999. Genetic and environmental smoothing of lactation curves with cubic splines. J. Dairy Sci. 82:632–638.[Abstract]


This article has been cited by other articles:


Home page
J ANIM SCIHome page
J. P. Sanchez, I. Misztal, and J. K. Bertrand
Evaluation of methods for computing approximate accuracies of predicted breeding values in maternal random regression models for growth traits in beef cattle
J Anim Sci, May 1, 2008; 86(5): 1057 - 1066.
[Abstract] [Full Text] [PDF]


Home page
J ANIM SCIHome page
J. P. Sanchez, I. Misztal, I. Aguilar, and J. K. Bertrand
Genetic evaluation of growth in a multibreed beef cattle population using random regression-linear spline models
J Anim Sci, February 1, 2008; 86(2): 267 - 277.
[Abstract] [Full Text] [PDF]


Home page
J ANIM SCIHome page
K. R. Robbins, I. Misztal, and J. K. Bertrand
A practical longitudinal model for evaluating growth in Gelbvieh cattle
J Anim Sci, January 1, 2005; 83(1): 29 - 33.
[Abstract] [Full Text] [PDF]


Home page
J ANIM SCIHome page
J. Bohmanova, I. Misztal, and J. K. Bertrand
Studies on multiple trait and random regression models for genetic evaluation of beef cattle for growth
J Anim Sci, January 1, 2005; 83(1): 62 - 67.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Legarra, A.
Right arrow Articles by Bertrand, J. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Legarra, A.
Right arrow Articles by Bertrand, J. K.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS