|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ANIMAL GENETICS |



* Faculdade de Ciências Agrárias e Veterinárias, UNESP, Jaboticabal, SP, 14884900, Brazil;
and
IRTA, Unidad de Cunicultura, Caldes de Montbui, 68140, Spain;
and
Departamento de Ciencia Animal, Universidad Politécnica de Valencia, 46071, Spain;
and
Centre UdL-IRTA, Lleida, 25198, Spain;
and
# Faculdade de Medicina Veterinária e Zootecnia, UNESP, Botucatu, SP, 18618000, Brazil; and
|| Faculdade de Medicina, USP, Ribeirão Preto, SP, 14049900, Brazil
| Abstract |
|---|
|
|
|---|
Key Words: Bayesian analysis beef cattle growth curve longitudinal data selection Von Bertalanffy
| INTRODUCTION |
|---|
|
|
|---|
An alternative would be to use early predictors of adult BW estimated from individual growth curves. The usual functions applied to describe growth are based on differential equations that seek a biological interpretation (Arango and Van Vleck, 2002
). The parameters of these equations have a biological meaning related to initial conditions, growth rate, or adult BW, all linked to economic aspects of production. Usually, the growth function is predicted for each animal and, as a second step, environmental effects and (co)variance components of the parameters are estimated, ignoring the adjustment errors from the first step. Information on relatives is not considered when the curve parameters are estimated (i.e., not all information is used, and animals with few records cannot be included in the analysis).
The objective of this work was to evaluate the Nelore cattle growth curve parameters using a nonlinear model in a nested Bayesian procedure that allowed estimation of the joint posterior distribution of growth curve parameters, their (co)variance components, and the environmental and additive genetic components affecting them.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Data
Data were supplied by the ANCP (Associação Nacional de Criadores e Pesquisadores Ribeirão Preto, São Paulo, Brazil), which has been running the Nelore Breeding Program since 1987 (Lôbo et al., 2005
). Calves were born throughout the year, with a natural concentration in spring and summer, and were weaned and weighed at approximately 210 d of age. Besides weaning, animals were weighed at 90-d intervals from birth to 540 d of age, and those selected for reproduction were continuously weighed every 90 d.
Records were extracted for animals that were reared on pasture without supplemental feeding, did not have a foster dam, did not receive veterinary treatment, had dams older than 2 and younger than 16 yr of age, had birth weights greater than 20 kg, had all BW within the range given by the mean of all animals with the same age ± 3 SD, had at least 5 valid BW (i.e., records meeting the above conditions), and belonged to a contemporary group with at least 6 animals. The contemporary group included sex, herd, management group, and year and month of birth. The data set consisted of a total of 145,961 BW recorded from 15,386 animals, which were the offspring of 501 sires and 7,574 dams. Pedigree information was obtained for 5 generations, resulting in a total of 29,897 animals in the relationship matrix. A greater number of records were taken at the beginning of the growth process, until 540 d of age for males and females, and adult BW were available mainly from females. The data set had 5,661 BW records of 1,420 animals that were more than 4 yr of age.
The Model
A hierarchical model was applied to describe the growth curve of each animal. Each individual had a growth trajectory described by a nonlinear function, and each parameter of this function was considered to be affected by genetic and environmental effects that were described by a linear model. In the first stage of the model, it was assumed that individual growth curves followed the Von Bertalanffy function (Von Bertalanffy, 1957
):
![]() |
where yij is the observed BW of individual i at age j, ai can be interpreted as the average mature BW maintained independent of short-term fluctuations, bi is a time scale parameter related to the initial conditions, ki is a parameter related to the rate of maturing, and
ij is the fitting error, which was considered to be independent and normally distributed among individuals. The fitting error variance (
2) was considered to be the same for all animals at any time, and thus:
![]() |
In a second stage of the hierarchical model, genetic and environmental effects were described in a linear model, explaining the variation of growth curves among individuals. Considering
as a vector with the parameters a, b, and k for all individuals:
![]() |
where ß is the vector of environmental effects (contemporary group and age of dam nested in the order of calving); u is the vector of additive genetic effects; c is the vector of maternal permanent environmental effects; X, Z, and W are the incidence matrices corresponding to each effect; and e is the vector of residuals associated with each parameter. These residuals were also considered independent among individuals and normally distributed. However, because of the biological meaning of the parameters, the residual covariances between parameters from the same individual were considered different from zero. Therefore, for each individual i, cov(eai, ebi)
0, cov(eai, eki)
0, and cov(ebi, eki)
0.
Details about the application of hierarchical models for growth in animal breeding can be found in Varona et al. (1997)
, Blasco et al. (2003)
, and Piles et al. (2003)
.
The Inference
The joint posterior distribution for individual growth curve parameters, their (co)variance components, and environmental and additive genetic effects was estimated under a hierarchical Bayesian framework. Markov Chain Monte Carlo Methods (MCMC) were applied, using Metropolis-Hastings and Gibbs sampling algorithms (Casella and George, 1992
; Chib and Greenberg, 1995
; Sorensen and Gianola, 2002
).
The Conditional Density of Data. Assuming independence among individuals, the conditional distribution of data y, given the curve parameters, was a product of normal distributions:
![]() | [1] |
where N is the number of individuals with data and ni is the number of BW recorded on each individual i.
The density of the growth curve parameters, given the genetic and environmental effects, was a multivariate normal distribution:
![]() | [2] |
where G is the additive genetic (co)variance matrix; P is the maternal permanent environmental (co)variance matrix; R the residual (co)variance matrix between the parameters a, b, and k; and I is an identity matrix.
The Priors. A Bayesian probability model requires assigning prior distributions for all unknown parameters. Normal prior distributions were assumed for the environmental and genetic effects:
![]() |
![]() |
![]() |
where m and V are subjective means and (co)variances for the prior beliefs about the systematic effects, NA is the number of animals in the genealogy, Nd is the number of dams of animals with records, I is an identity matrix of order N, and A is the numerator relationship matrix.
Prior distributions for the (co)variance matrices and fitting error variance were assumed to be flat, with limits that guaranteed proper posterior distributions, as follows:
![]() |
The Posterior Distributions. Bayes theorem, assuming independence between priors, can be expressed by
![]() |
The sampling methods require random independent draws from the conditional posterior distribution for each parameter. Supposing
ik is the kth growth curve parameter for the ith animal, and
_ik are the other parameters for the ith animal and all parameters for all other animals:
![]() |
The fully conditional distributions of parameters of hierarchical multistage models were derived by Sorensen and Gianola (2002)
. In the present work, the fully conditional distributions of growth curve parameters are products of the conditional distribution of data (Eq. [1]) and the prior distribution of the growth curve parameters (Eq. [2]). The same conditional structure was presented by Varona et al. (1999)
in a simulation study applying the Von Bertalanffy function.
Supposing
i,
i, and
i are the means of parameters given the genetic and environmental effects, and raa, rbb, rkk, rab, rak, and rbk are the corresponding elements of the inverse of the residual (co)variance matrix (R), then the fully conditional distribution of parameter a can be written as
![]() |
where
![]() |
and
![]() |
The fully conditional distribution of parameter b can be written as
![]() |
where
![]() |
and
![]() |
The fully conditional distribution of parameter k can be written as
![]() |
where
![]() |
and
![]() |
Therefore, the parameter a could be easily sampled from a normal distribution, but the conditional posterior densities of parameters b and k did not have a known form. In these cases, the Metropolis-Hastings algorithm with normal proposal distribution centered on the values of bi and ki sampled in the immediately previous iteration was used.
The mixed model equations were constructed as if the observed traits were the growth curve parameters:
![]() |
The conditional posterior distributions for each location parameter ßi, ui, and ci were normal distributions defined by the coefficients and the right-hand side (RHS) of the mixed model equations built considering
ik as the record:
![]() |
where ß_i, u_i, and c_i, are the vectors including the current values of these effects after discarding the ith one, and
is the corresponding element from the coefficient matrix of the mixed models equations.
Letting np be the number of parameters of the growth model, the posterior conditionals of the (co)variance matrices were the following inverted Wishart distributions:
![]() |
The conditional posterior distribution of the fitting error variance was an inverted
2:
![]() |
The Chains.
Chains of 110,000, 330,000, and 550,000 samples, with different starting values, were carried out with sampling intervals of 10, 30, and 50, respectively. From each chain, the last 10,000 samples were used to estimate the features of marginal distributions; the burn-in period was greater than the minimum required according to the Raftery and Lewis (1992)
method. Convergence was tested for each chain using the criterion of the Geweke (1992)
and Gelman and Rubin (1992)
test. Autocorrelation between samples and features of marginal distributions of Monte-Carlo error (Geyer, 1992
) were calculated.
The Goodness of Fit.
The overall goodness of fit was checked by calculating the square of the correlation between observed BW and the predicted values marginal densities means. A cross-validation predictive density was applied to assess the goodness of fit in different parts of the curve. The observed values, yij, were compared with their prediction, Yij, obtained using all of the other data, y_ij. The predictive density of observed data is
![]() |
where y_ij is the vector of observed values discarding the observation being predicted (yij). It was considered that, if the model was adequate, the observed value, yij, is a realized random sample from its respective predictive distribution, f(Yij | y_ij), and the evaluation of model fitting was based on the comparison of the observed value with some characteristic of this distribution.
Applying the checking function proposed by Gelfand et al. (1992)
, [g(Yij ; yij) = 1 if Yij < yij and g(Yij ; yij) = 0 if Yij
yij], the E[g(Yij ; yij) | y_ij] with respect to f(Yij | y–ij) was computed for each observed record. This expectation showed the probability of a predicted value being lower than the observed one. The average of these expectations for all individuals at each jth point of the growth curve showed the goodness of fit in different parts of the trajectory. If the model fit the data properly, the averages should be close to 0.5. The estimation of E[g(Yij ; yij) | y_ij] required the computation of the following integral using MCMC methods:
![]() |
which required knowledge of f(Yij |
, y_ij) and f(
| y_ij), which means that resampling all parameters of the growth curve is needed to predict each datum. This would require extremely demanding computing. To avoid this problem, an importance sampling procedure (Gelman et al., 1995
) was applied using f(
| y) as the sampling distribution for all f(
| y_ij). If Nmc samples of the vector of parameters of the model have been drawn from f(
| y), a Monte Carlo estimator of the expectations is
![]() |
The Algorithm. A program was written using Fortran 90. Techniques related to memory use, such as data work file reduction by storing observations, pedigree information, and location parameters on separate files, increased effectiveness of the software by decreasing the reading time from the disk and avoiding storage of large amounts of missing information. The algorithm can be outlined as follows:
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Lack of convergence was not detected for any chain. The autocorrelations between samples were low in almost every chain. Heritabilities and genetic correlation estimates showed greater autocorrelations than other parameters, but this did not lead to greater estimates of Monte Carlo SE. Visual inspection of sample trace plots indicated that the sampling processes were appropriate. Features of marginal posterior densities obtained from chains with different lengths, or the same lengths but different starting values, were very similar.
The goodness of fit was considered adequate. The square of the correlation coefficient between the means of marginal densities of predicted values and observed values was 0.94. Global criteria of fit have the disadvantage of depending more on the last part of the curve than on the first part, due to a scale effect (Blasco et al., 2003
). Absolute values of the errors are greater at the adult BW than at the beginning of the growth curve. Because the model could fit better in some parts of the curve than others, and there was a specific interest in the goodness of fit of each parameter, the whole growth trajectory was examined. The probability of a predicted BW being greater or lower than the true value provided a parameter of adjustment quality for each animal at each point of its growth curve. Figure 1
shows the averages of Gelfands checking function at each measured point of the growth curve. At the beginning, there was a trend of obtaining predictions greater than the observations. The model showed a better fit at the end of the growth curve, after adult BW were reached. The expectations were near 0.5 after maturity was reached; none of the predicted values showed a high probability of being lower than the observed one.
|
Variance Components
Features of the posterior marginal density of the variance components showed a large additive genetic variance for mature BW (Table 1
). An important additive genetic component for this parameter has also been described for beef cattle by DeNise and Brinks (1985)
, Oliveira et al. (1994)
, and Meyer (1995)
. The parameter a of growth curves could be used as a selection criterion to control adult BW increases when selecting for growth rate, mainly in situations when slaughter weight is reached before maturity and adult BW can be recorded for only a few animals. Mature BW was also influenced by maternal permanent environmental effects, probably due to substantial maternal effects on BW until weaning (Albuquerque and Meyer, 2001
) and a carryover effect on postweaning growth. Rumph et al. (2002)
compared different models for estimation of genetic parameters for mature BW of Hereford cattle and reported that the model that best fit the data included maternal genetic and maternal permanent environmental effects. Archer et al. (1998)
compared growth curves from Angus cattle selected for high or low growth rate from birth to a year of age and reported that the effect of maternal environmental on growth was carried through to maturity; low lines calves reared by a high-line dam were heavier and taller at maturity compared with their naturally reared counterparts. Few studies have considered maternal effects when fitting growth curves of beef cattle. Including maternal effects in the models and separating them into genetic and environmental components need to be investigated further.
|
Covariance Components
Covariance estimates showed a negative genetic correlation between mature BW and parameter b (Table 2
). This result was expected because greater values of parameter b lead to lower birth weights [birth weight = a*(1 – b)3] and it is known that BW at different ages show a large positive genetic correlation (Albuquerque and Meyer, 2001
). Greater values of adult BW are associated with greater BW in all periods of growth. DeNise and Brinks (1985)
also reported a negative genetic correlation between the corresponding a and b parameters of the Brody growth curve in beef cattle. These parameters are similarly interpreted in Brody and Von Bertalanffy curves.
|
The length of the HPD95% of the genetic correlation between parameters b and k was too large to permit reliable conclusions, but these parameters clearly showed large positive phenotypic and residual correlations. Regardless of that, the relationship between the initial conditions and subsequent growth is explained well by the large and positive genetic correlations between early BW.
The correlation between maternal permanent environmental effects of parameters a and k suggested that, besides additive genetic effects, variations observed in both parameters in the same direction could also be caused by these effects. Maternal environment may have an overall influence on growth curves, but its relevance for parameters b and k was not clear. Optimum mature size for beef cattle is commonly debated. Cattle vary widely in body size, and the optimum depends on the production system. In general, selection programs have placed emphasis on greater growth rates. This led to changes in body size at all ages, which may be evaluated under specific production characteristics.
In the present work, adjustment error variance was considered to be constant along the curve, when obviously it should be smaller at the beginning of the growth process due to a scale effect. Growth data commonly present time-series errors, which were partially overlooked in this study. Considering other error structures leads to both overparameterization of the model and lack of convergence of the MCMC chains, as previous analyses of the data showed. Using less complex error structures is a usual procedure to avoid these problems, although it leads to some underestimation of the variance components of the parameters (Glasbey, 1979
).
Mature BW, predicted by parameter a of the growth curves, could be included in selection programs. This parameter could be predicted by combining individual BW with information derived from the parents mature BW, as well as BW of other relatives. Selection to change the slope of the growth curve without modifying adult BW would be inefficient because their genetic correlation is large. However, adult BW could be considered in a selection index with its corresponding economic weight to improve the overall efficiency of beef cattle production.
| Footnotes |
|---|
2 Corresponding author: lgalb{at}fcav.unesp.br
Received for publication October 9, 2006. Accepted for publication July 18, 2007.
| LITERATURE CITED |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. Forni, D. Gianola, G. J. M. Rosa, and G. de los Campos A dynamic linear model for genetic analysis of longitudinal traits J Anim Sci, December 1, 2009; 87(12): 3845 - 3853. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Forni, D. Gianola, and G. J. M. Rosa Predictive ability and covariance parameters of dynamic linear models for analysis of longitudinal traits1 J Anim Sci, December 1, 2009; 87(12): 3854 - 3864. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Forni, M. Piles, A. Blasco, L. Varona, H. N. Oliveira, R. B. Lobo, and L. G. Albuquerque Comparison of different nonlinear functions to describe Nelore cattle growth J Anim Sci, February 1, 2009; 87(2): 496 - 506. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |