|
|
||||||||
ANIMAL GENETICS |

,2
* EMBRAPA Pecuária Sul (Brazilian Agricultural Research Corporation Southern Cattle and Sheep Center), Bagé, RS 96401-970, Brazil; and
and
Department of Animal Science, Michigan State University, East Lansing 48824
| Abstract |
|---|
|
|
|---|
Key Words: Bayesian Inference Multiple Breeds Outlier Robustness Residual Heteroskedasticity Structural Variance Models
| Introduction |
|---|
|
|
|---|
The objectives of this study were to 1) present alternative multiple-breed hierarchical Bayes models based on Gaussian or heavy-tailed specifications of residual heteroskedasticity, and 2) apply the proposed models to a dataset on postweaning gains of Nelore-Hereford cattle raised in diverse environments, so as to characterize the magnitude of potentially important sources of residual heteroskedasticity and the associated influence on breed group specific heritabilities.
| Materials and Methods |
|---|
|
|
|---|
The first stage of the model specifies the conditional sampling density of the n x 1 data vector y. We write the linear model for subject or element j of y as
![]() | [1] |
where µ is a fixed reference or overall mean parameter, ß is a p x 1 vector of fixed effects (e.g., gender, age of dam, heterotic effects, etc.), u is a q x 1 vector of random effects, and x'j, and z'j are known row incidence vectors connecting yj to ß and u, respectively. Finally,
across all j = 1, 2, ..., n, where
2ej represents the residual variance that is specific for yj.
We write
2ej to be a multiplicative function of fixed and random effects:
![]() | [2] |
Here, the term
2e functions as a reference parameter in Eq. [2], much like µ does in Eq. [1], albeit on a multiplicative rather than an additive scale. Secondly,
= [
1
2 ...
o]' specifies regression parameters that potentially influence residual heteroskedasticity using information in covariates pj = [p1j p2j ... p oj ]' specified for subject j. Examples of elements in
include not only overall or global regression parameters, but also classification specific regression parameters using, for example, geographically specific breed proportion or breed heterozygosity coefficients as separate covariates in pj. Now, [
1(j)
2(j) ...
r(j) ] specifies the "fixed" effects for levels of each of r different sets of classification factors (e.g., sex of calf) associated with subject j. For example, if the third level (i = 3) of a second factor is associated with subject j, then
2(j) specifies the effect of the third element of
2 = {
2i}t2i = 1 specifying the t2 heteroskedastic effects for the second factor as further presented later. In other words, if
= [
1'
2' ...
r']' is used to specify all fixed classification effects influencing residual heteroskedasticity, then
being a row incidence vector connecting log(
) to
based on the logarithm of Eq. [2]. Similarly,
specifies the "random" effects for levels of each of s different sets of classification factors (e.g., contemporary groups) associated with subject j. Finally, wj represents an optional random "noise" term as useful for specifying heavy-tailed alternatives to a Gaussian residual distribution as illustrated later.
Equation [2] is similar to expressions considered by Foulley et al. (1992)
and much subsequent work, except that previous presentations have been based extensively on the logarithm of Eq. [2], thereby rendering an expression in
as a linear function of fixed and random effects. On that logarithmic scale, log(w1j) then represents an additional source of random variability that has not been previously considered in heteroskedastic residual modeling applications. Note that random regression parameters additionally are possible in Eq. [2] for specifying, for example, subject-specific changes in residual variability over repeated measurements.
In the second stage, we specify our prior assumptions on all unknowns defined in Eq. [1] and [2]. For classical random effects, we adopt the typical structural prior:
![]() | [3] |
In multiple-breed models, G(
) includes, at the very least, a function of more than one genetic (co)variance parameter for additive genetic effects as defined by Lo et al. (1993)
. That is,
includes
2Ab as the additive variance of breed b, b = 1, 2, ..., B, and
2Sbb' as the variance due to the segregation between breed b and b', where b'
b, and b' = 1, 2, ..., B. Note that
2Sbb' can be interpreted as the additional genetic variability observed in the F2 generation relative to the F1 generation. Additional hierarchical modeling details on multiple-breed random effects are provided in Cardoso and Tempelman (2004)
. In addition, u may include nongenetic effects such as contemporary groups (CG) with variance
2CG, which are then also included in
. Finally, subjective multivariate normal or bounded uniform priors can be adopted on ß.
As noted above and by others (Sorensen and Gianola, 2002
), the Bayesian distinction between classical fixed effects ß and random effects u is based on the specification of subjective vs. structural priors. That is, a structural prior density implies a pure distributional characterization of the elements of u such that elements of
structurally define the hierarchical model and hence are inferred. Conversely, parameters defining subjective priors on ß are specified to be "known," either to ensure a proper joint posterior density or to incorporate knowledge from personal beliefs or previous studies. We have previously drawn very similar distinctions between these two classes of priors for mixed effects modeling of residual variances (Kizilkaya and Tempelman, 2005
). Here, we adopt subjectively specified inverted gamma distributions for the fixed effects influencing residual heteroskedasticity as follows:
![]() | [4] |
![]() | [5] |
and
![]() | [6] |
Here,
ki represents the fixed effect of level i of the tk x 1 vector of fixed effects
influencing residual heteroskedasticity as defined within each factor k = 1, 2, ..., r. Furthermore, the hyperparameters
*e, ß*e,
*m, ß*m,
*ki, and ß*ki are presumed to be known. To facilitate parameter identifiability in the absence of suitably informative prior specifications, such as Eq. [4], [5], and [6], or based on the use of bounded uniform priors, the effect
ktk, corresponding to the last level tk within each factor k, is constrained (i.e.,
ktk= 1, k = 1, 2, ..., r). This type of parameterization is not conceptually different from the corner parameterization (Clayton, 1996
) typically invoked for elements of ß in regular linear models software and previously adapted by Kizilkaya and Tempelman (2005)
in a simpler heteroskedastic error specification.
As with classical random effects u, structural rather than subjective inverted gamma priors are specified on random effects influencing residual heteroskedasticity (Kizilkaya and Tempelman, 2005
):
![]() | [7] |
Here,
li represents the random effect of level i within the tl x 1 vector of random effects vl of factor l, l = 1, 2, ..., s. It can be seen that E(
li) = 1 with
(and hence only defined for
l > 2) for i = 1, 2, ..., tl, such that the standard deviation
and the CV are synonymous for random effects within each factor l = 1, 2, ..., s. Note that as
l
, the influence of random factor l on residual heteroskedasticity diminishes. With the structural specification in Eq. [7], there is a borrowing of information across the tl effects of vl, very much analogous to what is done for classical random effects modeling of location parameters u as in Eq. [3] (Robinson, 1991
).
Now, there are several possible distributional assumptions on the weights {wj}, each yielding a different heavier-tailed specification on the distribution of the residuals ej relative to the use of the Gaussian distribution (see also Lange and Sinsheimer, 1993
; Rosa et al., 2003
). We specifically consider two heavy-tailed alternatives to the Gaussian or normal distribution: the Students t and the Slash distributions. The Students t residual distribution has been recently noted as promising for mitigating the effects of preferential treatment and/or deviant observations (Stranden and Gianola, 1998
; Pinheiro et al., 2001
), whereas the Slash distribution was demonstrated to be a better fit relative to the Students t for modeling residual effects in other applications (Rosa et al., 2003
).
The Gaussian distribution requires no distributional specification of wj in Eq. [2], with wj = 1 for all j = 1, 2, ...., n. A distributional specification on wj represents one way of modeling the lack of fit of the marginal density of ej to a Gaussian distribution. Let us momentarily consider the simpler homoskedastic error specification (i.e.,
2ej =
2e for all j), such that conditionally using Eq. [2],
. Given a distributional specification p(wj|
e), conditional on
e, the marginal density p(ej|
2e,
e) of ej is determined by p(ej|
2e,
e) =
ej|
2e,wj)p(wj|
e)dwj. For example, consider a Gamma
specification on wj:
![]() | [8] |
It then can be readily shown that the marginal density p(ej|
2e,
e) is a Students t density with scale parameter
2e and df
e, such that the marginal residual variance
. Alternatively wj may be specified as having the following density:
![]() | [9] |
thereby leading to a Slash distribution specification for p(ej|
2e,
e), with scale parameter
2e and df
e such that the marginal variance of ej is
. It is possible to extend heavy-tailed specifications to heteroskedastic specifications using
2ej from Eq. [2] rather than
in p(ej|
2ej) in tandem then with Eq. [8] or [9]. The resulting marginal residual densities would then be, respectively, heteroskedastic marginal Students t or Slash densities with subject-specific scale parameters as specified by the numerator of Eq. [2].
We recommend the same inverted gamma prior densities on breed-specific genetic variances and segregation variance and remaining variance components in
as considered by Cardoso and Tempelman (2004)
. Similarly, arbitrarily vague or noninformative subjective prior densities could be specified on
l, l = 1, 2, ..., s, and on
e as well.
Markov Chain Monte Carlo Inference
Hierarchical models with several stages, such as those considered in this paper, are particularly amenable to Bayesian inference using Markov chain Monte Carlo (MCMC) methods. The full conditional densities of all unknown parameters/quantities or blocks thereof necessary for inference based on MCMC are derived in the Appendix to this paper.
Application to Analysis of Nelore-Hereford Data on Postweaning Gain
Data Source.
Data analyzed in this study consisted of 22,717 postweaning gain (PWG) records of a beef cattle population comprising Herefords, Nelores, and their crosses under genetic evaluation in Brazil. The records were collected between 1974 and 2000 under a large-scale breeding program called "Conexão Delta G." Extensive details on the characteristics and structure of the data are provided in Cardoso and Tempelman (2004)
.
Candidate Models.
The linear mixed model in Eq. [1] was based on very similar specifications as previously outlined in Cardoso and Tempelman (2004)
. The elements of ß in Eq. [1] included the classification effects of region and sex, the linear effect of the length of the postweaning test period, the linear and quadratic effects of age of dam and their interactions with the additive effects of dam breed, the Nelore breed effect (ßA1), the Nelore-Hereford dominance or heterotic effect (ßD12), and the Nelore-Hereford additive x additive epistatic effect (ßAA12). The corresponding subject-specific covariate information for ßA1, ßD12, and ßAA12are the proportion of Nelore alleles fj1, the heterozygosity coefficient fj12, and the epistatic coefficient fj1(1 fj1) for subject j, respectively; further clarification of these terms is provided in Cardoso and Tempelman (2004)
. Additionally, interactions between the Nelore breed effect with the linear effect of postweaning test period length and the classification of sex were modeled in ß. Other interactions were not considered for reasons provided in Cardoso and Tempelman (2004)
.
The random effects u in Eq. [1] included the effects of 940 normally, independently, and identically distributed (NIID) CG effects with variance component
2CG in addition to the additive genetic effects of 40,082 animals, including ancestors with no data of their own. As clearly outlined previously by Cardoso and Tempelman (2004)
, the (co)variances of the genetic effects depend on breed-specific genetic variances
2A1 and
2A2 for the Nelores and Herefords, respectively, as well as for the segregation variance
2S12 between the two breeds.
Six different specifications of the distribution of the residuals were considered as based on a two-way factorial arrangement. The first factor was defined by the presence or absence of residual heteroskedasticity in a marginal sense (i.e., present based on the specification of Eq. [2] vs. absent based on the homoskedastic specification of
). The second factor was based on three alternative specifications for the marginal density of the residuals: Gaussian, Students t, or Slash. We labeled the six models M1 through M6.
M1: Homoskedastic Gaussian Error.
This model was previously used by Cardoso and Tempelman (2004
), whereby residuals were assumed to be normally, independently, and identically distributed with common residual variance
2e.
M2: Homoskedastic Students t Error.
In this model, the residuals are specified to have independently and identically distributed Students t densities with common scale parameter
2e and common degrees of freedom
e. This specification is conditionally equivalent to
, with wj for j = 1, 2, ..., n, having independent
Gamma
densities as indicated previously.
M3: Homoskedastic Slash Error.
Here, the residuals are specified to have independently and identically distributed Slash densities with common scale parameter
2e and common degrees of freedom
e. This specification is conditionally equivalent to
, where wj for j = 1, 2, ..., n, are specified as independently distributed with density as in Eq. [9].
M4: Heteroskedastic Gaussian Error. As with M1, the residuals are specified to be independently and normally distributed, but with heterogeneous residual variances as determined by Eq. [2] and constant wj = 1 for j = 1, 2, ..., n.
M5: Heteroskedastic Students t Error.
As with M2, the residuals are specified to be independently distributed as independent Students t with common degrees of freedom
e but with heterogeneous scale parameter as specified by the numerator of Eq. [2]. Recall that this specification is conditionally equivalent to the distributional assumption for ej in Eq. [2] followed by the
distribution as specified in Eq. [8].
M6: Heteroskedastic Slash Error.
As with M3, the residuals are specified to have independently distributed Slash densities with common degrees of freedom
e, but with heterogeneous scale parameters as specified by the numerator of Eq. [2]. Recall that this specification is conditionally equivalent to distributional assumption for each ej in Eq. [2] in tandem with each wj having the density specified in Eq. [9].
For each of the three heteroskedastic error models M4 through M6, the specific model chosen for residual heteroskedasticity was based on a single fixed classification effect for sex, representing the male effect by
1, thereby requiring the identifiability constraint
2 = 1 for females. The model also included fixed regression parameters
1 and
2, using information on Nelore breed proportion (p1j = fj1) and heterozygosity coefficient (p2j = fj12), respectively, for subjects j = 1, 2, ..., n. Finally, the effects v = [
1
2 ...
940]' of 940 CG within a single random effects factor also were used to model residual heteroskedasticity with a CV of
as based on Eq. [7]. For example, the residual variance for a particular F1 male animal j (i.e., fj1 = 0.50, fj12 = 1.00) in CG 2 based on the Gaussian heteroskedastic error model would be determined as
2e
1(
1)0.50(
2)1.00
2, whereas a female within the same breed group and CG would have a specified residual variance of
2e (
1)0.50(
2)1.00
2.
Prior Specifications.
For all six models, we used the same linear mixed model in Eq. [1] and the same subjective or structural prior specifications on all parameters with the natural exceptions on wj, j = 1, 2, ..., n and
e, where applicable. Bounded uniform priors were placed on all identifiable elements of ß,
, and
. Conjugate but highly dispersed inverted gamma prior densities were specified for variance components in
, specifically in a manner similar to that for
2e in Eq. [5]. Specifically, the corresponding hyperparameters for
2CG were
*CG = 2.5 and ß*CG = 2125. Similarly, for both
2A1 and
2A2, the hyperparameters were
*A = 2.5 and ß*A = 200, whereas for
2S12, the hyperparameters were
*S = 2.5 and ß*S12 = 25. These specifications also were used previously for the same data in Cardoso and Tempelman (2004)
. For the reference parameter
2e , we specified
*e = 2.5 in Eq. [5], with ß*e chosen such that the prior mean on the marginal residual variance was the same for each of the three residual density types (i.e., Gaussian, Students t or Slash). Now ß*e was chosen to be based on a REML estimate of 350 for
2e using the PWG data. Accordingly, ß*e = 2.5 x 350 for both Gaussian error models (M1 and M4),
for both Students t error models (M2 and M5), and
for both Slash error models (M3 and M6), where E(
e) denotes the prior mean of
e within the respective model. These specifications ensure the same prior mode or mean on the marginal residual variance for each model.
Finally, we specified a gamma prior on
but based on hyperparameters
*
= 0.03 and ß*
= 0.01, leading to a prior mean of E(
) = 3 but with very large prior variance of var(
) = 300. We similarly specified gamma prior densities for
e using
*
e= 0.04 and ß*
e= 0.01 for Students t error models (M2 and M5), and
*
e= 0.015 and ß*
e= 0.01 for the Slash error models (M3 and M6), such that the respective prior means were E(
e) = 4 and E(
e) = 1.5; these two prior means corresponded to a similar degree of kurtosis for these two distributions as based on a simulation study.
Definition of Genetic Parameters.
The additive genetic variance
2Ag of any individual belonging to breed group g was obtained by
2Ag = fg1
2A1 + fg2
2A2 + 2(fs1fs2 + fd1fd2)
2S12 (Lo et al., 1993
), where fib indicates the allelic proportion of breed b in breed group i. Here b = 1 and b = 2 specify Nelore and Hereford, respectively, and superscript i specifies either the breed group of the individual itself (i = g), the breed group of that individuals sire (i = s), or the breed group of the individuals dam (i = d). Within each of the three homoskedastic error models (M1 through M3), the marginal residual variance (
2E) was the same for each breed group, being
, respectively. For Model M4, the marginal residual variance specific for breed group g (
2Eg) was determined using the expression
2Eg =
2e
11
fg11
fg122, noting that E(
l) = 1 across all CG. Similarly, the marginal residual variance was specified to be
. Here,
1 = 0.3197 represents the proportion of calves with records that were male such that we report inference on residual variance weighted according to the calf sex proportions in the data structure. Alternatively, setting
1 = 1 would specify
2 Eg with respect to male calves and
= 0 specifies
2 Eg relative to female calves within breed group g. The marginal phenotypic variance for subjects in breed group g was then specified as
2 Pg =
2 Eg +
2 Ag, such that the additive heritability was h2Ag =
2 Ag/
2 Pg for the same breed group. Note, as in our previous work (Cardoso and Tempelman, 2004
), we deliberately omit
2CG in the determination of
2Pg to facilitate comparisons of our heritability estimates with intraherd estimates reported in the animal breeding literature (e.g., Koots et al., 1994
), in which CG are more commonly modeled as fixed effects.
Definition of Overall Marginal Residual Variance:
For the three homoskedastic error models (M1 through M3), the overall marginal residual variance
2E is naturally defined to be identical for all breed groups using
, respectively. Similarly, an overall marginal residual variance
2E as weighted by the existing calf sex proportion (
1 = 0.3197), proportion of calves that were Nelore (
1 = 0.2281), and average heterozygosity coefficient (
12 = 0.4347) was determined for each heteroskedastic error model. Hence, for models M4 through M6,
2E was defined by
, respectively.
MCMC Implementation.
The length of the MCMC chain for PWG was 200,000 cycles after 15,000 cycles of burn-in for all six models. Samples were saved every 10 cycles after burn-in, such that 20,000 samples were saved for posterior inference. Means, modes, and SD of the parameters were obtained from their respective marginal posterior densities. Furthermore, 95% credible sets or posterior probability intervals (95% PPI), as based on the 2.5th and 97.5th percentiles of each posterior density, were determined as interval estimates. For each dispersion parameter, the initial monotone sequence approach (Geyer, 1992
) was used to calculate effective sample sizes (ESS) based on Sorensen et al. (1995)
, which estimates the number of independent samples with information content equivalent to that contained within the 20,000 postburn-in dependent samples.
Model Choice Criterion.
We considered the pseudo-Bayes Factor (PBF) (Gelfand, 1996
) as a means for model choice. The PBF involves the evaluation of the first stage density in Eq. [1] for each MCMC cycle; let p(yj |y(j), Mf) be the conditional predictive ordinate (CPO) for observation yj under Model Mf, f = 1, 2, ..., 6. The CPO is intended to be a cross-validation density, suggesting what values of yj are likely when Model Mf is fitted to all other observations y(j) except yj. Writing
= [ß' u'
2e
' v' w']' where an MCMC approximation for the CPO is estimated by a harmonic mean
(Gelfand, 1996
), with
(i) denoting the postburn-in MCMC sample for
i = 1, 2, ..., G = 20,000.
An approximate log marginal likelihood (LML) over all observations can be obtained by the following: n
![]() |
![]() |
Finally, for comparing, say, models M1 and M2, the corresponding PBF is determined to be PBF1:2 = exp(LML1 LML2).
| Results and Discussion |
|---|
|
|
|---|
The log marginal likelihood functions (LML1 through LML6) for models M1 through M6 were computed to be 100,563, 99,623, 99,711, 99,455, 99,052, and 99,098, respectively. Based on the PBF as derived from these LML, the heteroskedastic t-error model provided the best fit to the PWG data among the six different residual specifications considered. In fact, this model had a PBF of 9.328 x 1019 compared with the next best-fitting model, the heteroskedastic slash error model. Furthermore, it can be definitively concluded that the conventional homoskedastic Gaussian error model was the poorest choice for this data set, given that its PBF relative to comparisons with the other five models always approached zero. It can be further noted from the LML values that heteroskedasticity was a more important specification relative to heavy-tailedness for the distribution of the residuals, particularly as the heteroskedastic Gaussian error model fitted the data much better than either non-Gaussian homoskedastic error models. Nevertheless, the Students t distribution always was a better fit to the data compared with the Slash distribution, whether those comparisons were made within homoskedastic or heteroskedastic error specifications.
Robustness to Outliers
Based on the homoskedastic Gaussian error model, there were 45 standardized residuals lying outside the range of ± 4.0 SD as estimated by the square root of the posterior mean of
2e. Moreover, the estimated kurtosis of this standardized residual distribution was 2.72, indicating that the distribution was somewhat leptokurtic. These features potentially explain our results of a substantially better fit of non-Gaussian error models to the data. The estimated skewness of the estimated residuals was 0.34, being of moderate magnitude; however, this skewness may be an artifact of heteroskedasticity not considered in a homoskedastic error model. Bayesian hierarchical modeling procedures that allow for skewness in residuals have been developed (Fernandez and Steel, 1998
) and implemented in animal breeding (Von Rohr and Hoeschele, 2002
); our model hierarchy could be extended accordingly.
The posterior distributions of the robustness parameters (
e) were reasonably symmetric for all non-Gaussian error models (data not shown). The posterior means ± SD of
e were 7.33 ± 0.48 and 2.20 ± 0.09 for the heteroskedastic Students t and Slash error models, respectively, whereas corresponding estimates for the homoskedastic Students t and Slash error models were 4.79 ± 0.21 and 1.66 ± 0.05, respectively. Note that higher values of
e were associated with heteroskedastic error relative to homoskedastic error specifications. This association was somewhat expected as residuals that otherwise may seem to be substantial outliers under homoskedastic error specifications, and thereby contributing to lower estimates of
e, may not be as such in subclasses with large variances as determined using heteroskedastic error models.
Assessment of Heteroskedasticity Sources
Fixed Effects and Regression Parameters.
Because the Slash error specification was uniformly poorer than the Students t error specification for fit to the PWG data using both homoskedastic and heteroskedastic error specifications, we will not discuss results involving the Slash error distribution further. Posterior inferences on fixed effects for residual heterogeneity obtained by the two remaining heteroskedastic error models are presented in Table 1
.
|
1) for residual variability was very similar under the two heteroskedastic error models. It also was surprising that the posterior density was not further concentrated away from unity as other investigators have concluded sex to be a significant source of residual heteroskedasticity for growth traits in beef cattle (Garrick et al., 1989
1 > 1|y) = 0.9378 for the heteroskedastic Students t error model. We believe the lack of sharper posterior inference (i.e., statistical power) on
1 may have been partly attributable to relatively poor environmental conditions, such that male calves were not allowed to fully express their extra growth potential relative to female calves; the average daily gain by male calves was 0.43 ± 0.17 kg, whereas it was 0.34 ± 0.13 kg for females.
There was no obvious evidence that residual variability depended on breed proportion based on the wide 95% PPI for
1 including unity in Table 2
. The effects of heterozygosity (
2) on residual heteroskedasticity, however, was more convincing. That is, it was determined that Pr(
2 > 1|y) = 0.0449 for the heteroskedastic Students t error model, indicating that as heterozygosity (i.e., heterosis) increases, residual variability decreases. This result is consistent with the theory that heterozygosity acts as homeostatic buffer to minimize environmental variation (Lerner, 1954
).
|

, thereby allowing information borrowing across CG. Animals in the same CG were kept under the same environmental, management, and feeding conditions throughout their productive life, as CG were defined by animals being born in the same herd, year, and season. Posterior inference on 
is presented using both heteroskedastic error models in Table 1
in both models indicated evidence of significantly large residual heteroskedasticity across CG. It is interesting to note that the largest element of the posterior mean of v, under the Students t heteroskedastic error model was 5.57, whereas the smallest was 0.28. In other words, some CG were estimated to have residual variances that were up to 20 times greater than in others. These results are very similar to those of Kizilkaya and Tempelman (2005)
for birth weights and calving ease scores in Italian Piedmontese cattle. Using posterior means of elements of v may be an effective way for identifying CG that are favorable for providing uniformity of various economically important traits.
It is instructive to note that the posterior mean ± posterior SD of 
(0.84 ± 0.07) under the heteroskedastic Gaussian error model was slightly larger than that (0.72 ± 0.06) estimated for the heteroskedastic Students t error model. This result was somewhat expected as the Students t error specification should attenuate the effects of residual outliers, which would otherwise inflate certain CG-specific variance estimates and therefore estimates of 
under a Gaussian error specification.
We did not explicitly consider region as a fixed-effects factor for modeling residual heteroskedasticity. Because CG are nested within region, regional differences in residual variability can be investigated by examining elements of the posterior mean of v, or estimated scaling factors, by region. The means ± the SD of these estimates were 0.93 ± 0.39, 0.97 ± 0.46, and 1.02 ± 0.57, respectively, for Regions 1, 2, and 3. Hence, there was no evidence of significant regional differences in average residual variances. The variability of these estimated scaling factors, however, tended to increase from Regions 1 to 2 to 3, such that the number of highly variable CG was greatest in Region 3. This result, however, may be due in part to the larger number of CG in this region (621 in Region 3 vs. 198 in Region 1 and 121 in Region 2), thereby increasing the probability of observing extremely highly or lowly variable CG. At any rate, regional differences in the degree of residual heteroskedasticity across CG may be possible. Our model could be readily extended to allow regional differences in the degree of residual heteroskedasticity with region-specific parameters (e.g.,
region, region = 1, 2, and 3) if needed.
Regional differences in residual heteroskedasticity also could be due to differences in the level of production and environmental quality as variances tend to be proportional to mean responses for growth traits in beef cattle (Koots et al., 1994
). Farms belonging to Region 1 were generally located in poorer environments compared with Regions 2 and 3 (i.e., the mean PWG were 82.4 ± 21.2 kg, 105.5 ± 39.5 kg, and 107.7 ± 39.3 kg for Regions 1, 2, and 3, respectively). Larger posterior means of v tended to be associated with larger CG-specific posterior means of u, with an estimated correlation coefficient between these two variables of 0.40 (i.e., CG with above-average performances also tended to be more variable). A positive correlation estimate of a similar nature, but for random genetic rather than for CG effects, has been reported for litter size in sheep (San Cristobal-Gaudy et al., 2001
), whereas, conversely, a negative correlation estimate was observed for litter size in pigs by Sorensen and Waagespeterson (2003). Both studies modeled this relationship by specifying u and log(v) for the same factor as multivariate normal with a genetic correlation between the two classes of random effects.
Variance Components and Heritabilities
Despite the same variance-covariance specifications for u as based on
across the six different models employed to analyze PWG, inference on the genetic components of
changed considerably depending on the residual distributional specification. Posterior inferences of elements of
and for the marginal residual variance (
2E) based on homoskedastic error and heteroskedastic error models are presented in Tables 2
and 3
, respectively.
|
2E widely overlapped, and point estimates (posterior means and modes) were relatively constant across the different residual distributional specifications in Tables 2
2E using the homoskedastic Gaussian error model compared with all other models. This was not surprising because non-Gaussian error models are better able to accommodate the extraneous variation given the additional lack-of-fit term wj in Eq. [2] (Stranden and Gianola, 1999
2CG increased by over 10% in other models relative to the Gaussian homoskedastic model.
Among the genetic variance components, the segregation variance
2S12 was the least affected by the different model specifications, as its posterior mean and mode were quite similar across the four models (Tables 2
and 3
). Conversely, breed specific genetic variances were widely affected by the heteroskedastic versus homoskedastic error specifications. Based on point and interval estimates, Herefords seemed to have a substantially larger genetic variance than Nelores (i.e.,
2A2
2A1 ), under both homoskedastic error models (Table 2
), whereas the opposite (i.e.,
2A1
2A2 ) was observed using heteroskedastic error models (Table 3
).
Despite a seemingly nonsignificant influence of breed proportion on genetic variance under both heteroskedastic error models (Table 3
), the effect of heteroskedasticity on breed group-specific phenotypic variances (
2Pg) was nevertheless quite appreciable, as seen from Table 4
. The wide 95% PPI for
2Pg of the Nelore breed obtained using both heteroskedastic error models indicated poor precision for inferring upon this parameter. This result was anticipated because purebred Nelores were only represented by parents without records, such that information to estimate that breed group-specific genetic and residual variance derived primarily from their crossbred progeny. Greater uncertainty on the Nelore
2Pg using either heteroskedastic error model then better reflects this data structure limitation compared with the relatively sharp 95% PPI obtained using the homoskedastic error models also observed from Table 4
. Conversely, for breed groups with substantial amounts of data, corresponding phenotypic variance estimates should be rather constant across candidate models. This was, to some extent, observed on Herefords and
Nelores (Brafords) in Table 4
. However, the largest discrepancy in posterior means of phenotypic variance between models within breed groups with records was 9.6% for F1 cattle between the two Students t error models; this result is most likely due to the significant effect of
2 on the residual variance of F1 animals in the heteroskedastic error model.
|
|
l |y) > 3) comprised Herefords exclusively. Therefore, Hereford genetic variance estimates are expected to be biased upwards when the residual variance is assumed to be homoskedastic across CG.
We previously reported substantial rerankings of animals for genetic evaluations between statistical models that either do or do not account for heterogeneous genetic variances across multiple-breed groups (Cardoso and Tempelman, 2004
). We observed rankings to be even more discrepant between homoskedastic error and heteroskedastic error models, in agreement with Hill (1984)
, even when heterogeneous genetic variances are modeled across breed groups in both models (data not shown).
| Implications |
|---|
|
|
|---|
| Appendix |
|---|
|
|
|---|
We write
for
2ej defined in Eq. [2]. With flat bounded priors on identifiable elements of ß, the FCD of
can be seen to be
![]() | (A1) |
where
![]() |
provided that X is full rank and
![]() |
with 1 being a n x 1 unitary vector. Univariate sampling strategies for drawing from Eq. [A1] are provided in Wang et al. (1994)
.
Using the prior density in Eq. [4], the FCD for
2e can be demonstrated to have an inverted gamma density with parameters
. Similarly, other parameters defining residual heteroskedasticity have inverted gamma FCD. For example, the FCD for
ki or the ith ordered element of
k that specifies effects for factor k, k = 1, 2, ..., s can be shown to be inverted gamma with parameters
is an indicator variable such that
[k]ij = 1 if the subscript address for the element of interest having row address i within
k is the same as that associated with observation j; otherwise
[k]ij = 0. Furthermore,
indicates the number of occurrences that
ki was associated with a particular observation, and
specifies the product of all elements of
= [
1'
2' ...
']', except
ki, that connect with yj. Similarly, the FCD of
li the ith ordered effect in vl can be shown to be inverted gamma with parameters
is an indicator variable, such that
[l]ij = 1 if the subscript address for the element of interest having row address i within vl is the same as that associated with observation j; otherwise
[l]ij = 0. In addition,
is defined analogously to
, but for effects in v = [v1' v2'... vr']', connected to yj and not including
li.
The FCD of
m, m = 1, 2, ..., p is not as readily recognizable as the other components of Eq. [2]:
![]() | (A2) |
![]() |
where
m denotes all other elements of
m except
m. Hence, we sampled from Eq. [A2] using a Metropolis-Hastings (MH) step. This step was based on a random walk algorithm (Chib and Greenberg, 1995
) using an inverted gamma distribution as the proposal density with scale parameter equal to the last sample value of
m times the shape parameter of the proposal density. This shape parameter was tuned during the first half of the burning period, such that the acceptance of proposed values was intermediate for optimal MCMC mixing (Chib and Greenberg, 1995
).
The FCD for the weights wj depend on the choice of p(wj |
). Adopting the specification in Eq. [8], thereby leading to the Students t specification, these FCD correspond to a series of gamma densities with parameters
represents the numerator of Eq. [2]. On the other hand, adopting Eq. [9] leads to a marginal Slash residual density, the corresponding FCD are truncated gamma densities (0 < wj < 1), with parameters
.
The FCD of
l also is not recognizable. That is, it can be shown (see also Kizilkaya and Tempelman, 2005
) that
![]() | (A3) |
where
l refers to all elements of
except
l and p(
l) is an arbitrarily defined prior. Here, Eq. [A3] does not have recognizable form and also requires MH sampling. In this case, we sampled
l = log(
l) using a random walk scheme with a Gaussian proposal density centered at the value of
l in the previous cycle and with the variance of the proposed density tuned at the midpoint of MCMC burn-in for optimal MCMC mixing (Chib and Greenberg, 1995
).
The FCD for the robustness parameter
e depends on the choices of p(wj |
e) and, of course, on p(
e). Under the Students t specification (i.e., p(wj |
e) as in Eq. [8]), the FCD of
e is given by
![]() | (A4) |
which does not have a recognizable form regardless of the specification on p(
e) but can be sampled using a MH algorithm similar to that adapted for Eq. [A2] and [A3].
Alternatively, adopting the Slash specification with p(wj |
e) as provided in Eq. [9] and a conveniently chosen conjugate Gamma
prior (see also Rosa et al., 2003
), the FCD of
e also can be shown to be gamma with parameters
.
The reader interested in the structure of and strategies for sampling from the the FCD for various elements of
in a multiple-breed animal model is referred to the Appendix of Cardoso and Tempelman (2004)
.
| Footnotes |
|---|
2 Correspondence: 1205 Anthony Hall (phone: 517-355-8445; fax: 517-353-1699; e-mail: tempelma{at}msu.edu).
Received for publication November 15, 2004. Accepted for publication March 21, 2005.
| Literature Cited |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
F. F. Cardoso, G. J. M. Rosa, and R. J. Tempelman Accounting for outliers and heteroskedasticity in multibreed genetic evaluations of postweaning gain of Nelore-Hereford cattle J Anim Sci, April 1, 2007; 85(4): 909 - 918. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |