|
|
||||||||


,3


* Roslin Institute, Roslin, Midlothian EH25 9PS, U.K.;
and
Institut de Recerca i TecnologiaAgroalimentàries, Area de Producció Animal Centre UdL-IRTA, Lleida 25198, Spain;
and
Sygen International PLC, University of Cambridge, Department of Pathology, Cambridge CB2 1QP, U.K.;
and
Swedish University of Agricultural Sciences, Uppsala S-751 24, Sweden; and
and
Universitat Autònoma de Barcelona, Bellaterra, 08193 Catalunya, Spain.
| Abstract |
|---|
|
|
|---|
Key Words: Best Linear Unbiased Prediction Genomes Least Squares Pigs Quantitative Trait Loci Variance Components
| Introduction |
|---|
|
|
|---|
Variance component methods have been promising when tested on simulated data, but there are few reports of their application to real data. In the present study, we tested the applicability of the VC methods to real data by reanalyzing a QTL confirmation experiment and compared this to results obtained by half-sib (HS) regression models. We also compared the performance of the MCMC (Heath, 1997
) and DET (Pong-Wong et al., 2001
) methods to estimate IBD scores with regard to their performance in QTL detection.
| Material and Methods |
|---|
|
|
|---|
|
|
|
Quantitative trait locus analyses were performed within breed and company, resulting in 650 region x trait x population analyses. Variance component analyses were performed at 1 to 5 cM intervals along every candidate region. On average, four positions were evaluated for every candidate region. Following the two-step approach proposed by George et al. (2000)
, the IBD scores were estimated for all positions within the two-generation pedigree of every population. These IBD scores were subsequently used to model the covariance for a putative QTL in a random mixed model. The IBD scores were estimated using an adapted version of the QTL mapping software LOKI (Heath, 1997
). This program uses a MCMC approach to obtain IBD scores in arbitrary pedigrees with missing marker data and unknown haplotypes. Thompson and Heath (1999)
present a detailed description of the method, whereas George et al. (2000)
give an overview of IBD estimation methods in arbitrary pedigrees. We did not evaluate convergence of the MCMC sampler, but instead, we used 10,000 iterations for every position, which is substantially more than the recommended figure of 10 times the number of animals in the pedigree.
For 15 population x region combinations, the estimation of IBD scores was repeated using the DET approach proposed by Pong-Wong et al. (2001)
. Their approach combines the recursive algorithm of Wang et al. (1995)
with the DET approach to estimate IBD between sibs from Knott and Haley (1998)
. To prevent complicated integration over all possible haplotype phases, the Wang et al. (1995)
algorithm is only implemented for nearest phase-known markers.
In the second step, an animal model for the quantitative trait, including a random QTL effect, is fitted for every position:
![]() | [1] |
where y is a (m x 1) vector of phenotypes, X is a (m x s) design matrix, ß is a (s x 1) vector of fixed effects (e.g., sex), Z is an (m x q) incidence matrix relating animals to phenotypes, u is a (q x 1) vector of polygenic effects, v is a (q x 1) vector of additive genotypic QTL effects, W is an (m x q) incidence matrix relating litters to phenotypes, c is the (q x 1) vector of random litter effects, and e is a residual vector. The random genetic effects u, v, and c are assumed to be distributed as multivariate normal densities with mean zero and variances
,
, and
, respectively. Matrix A is the standard additive genetic relationship matrix and G is the (q x q) (co)variance matrix for the additive QTL effects, represented by the proportion of alleles IBD (George et al., 2000
). The VC analyses were performed using ASREML (Gilmour et al., 1998
). A test statistic for a given location was obtained by running an animal model without a QTL effect:
![]() | [2] |
Twice the difference between the logarithms of the likelihood of (1) vs. (2) was used as a log likelihood ratio (LR) test. For hypothesis testing, we imposed a nominal threshold of 5% by assuming that the LR would follow a mixture of a
2 distribution with 1 df and a peak at zero (Self and Liang, 1987
). This may seem anticonservative because we tested multiple positions for every candidate region. To claim significant linkage for new QTL, Lander and Kruglyak (1995)
advocated the use of genome-wide thresholds. However, for many regions and traits in the present study, we aimed to confirm published QTL within commercial lines. For this purpose, Lander and Kruglyak (1995)
recommended the use of 0.01 nominal P-values to claim "confirmed linkage." The same authors also stated that any evidence for QTL exceeding the nominal 5% level should still be reported, even though this is not convincing evidence for the existence of a QTL. To facilitate the comparison between the two methods, we used the 5% nominal level as the basis for detection of a putative QTL. Comparisons were also made at the 1 and 0.1% level to evaluate the effect of statistical stringency on the results.
| Results and Discussion |
|---|
|
|
|---|
Differences Between Half-Sib and Variance Component Results
Figure 1
shows a comparison between the results obtained with the VC analyses and those obtained with the HS analyses by Evans et al. (2003)
. Note that the slope of the trendline is less than unity, at least in part as a result of a large number of VC analyses giving a LR of zero (Figure 1
). Using the nominal 5% significance threshold, the two methods agree in showing 42 QTL as significant. However, 19 QTL are detected only using the VC analyses, whereas 46 QTL, detected by Evans et al. (2003)
using the HS analyses, were not confirmed by the VC analyses. When comparing at the 1% level, 11 QTL were detected under both models, 17 under HS only and 6 only under VC. At the 0.1% level, only three QTL are significant under both models, whereas three others are significant only under the HS model. The comparison between the two methods with regard to detection of QTL seems fairly robust to the choice of threshold. This suggests that the discrepancies between the results reflect something more than just differences in Type-I error between the two methods. In both cases, when only one method detected a QTL, the test statistic for the other model varied between nearly significant to completely insignificant (Figure 1
). In order to understand the discrepancies between the HS and VC results, it is important to note the methodological differences between the two methods. In the HS model, an allele substitution effect is estimated as a fixed effect for every sire, independent of the other half-sib families. The maternally inherited QTL alleles are assumed to be randomly distributed between half-sibs, and the maternal genotypes are only used to increase the number of offspring that are informative for the inheritance of the sire allele. In the VC model, the variance explained by the QTL is estimated across all animals, assuming segregation of the QTL in both parents. From the 46 QTL that were only detected by the HS analyses, in 23 cases, only a single sire was inferred to be heterozygous for the QTL, whereas in five other cases, only two out of the nine sires that made up the population were heterozygous for the QTL. When a QTL is segregating at such a low frequency, it could be missed by the VC analyses because the power of detection depends on the variance explained by the QTL across the population.
|
|
![]() | [3] |
which follows a
distribution. The results for these additional analyses are summarized in Table 4
. For the examples where both HS and VC methods detected a QTL, the joint P-value is always <0.05, even though there is significant evidence for a QTL in the dams in only one example (Table 4
). For the analyses where the QTL is only detected under the HS analyses, none of the maternal HS analyses shows any evidence for a QTL. Although the joint P-values are still significant for all but a single example (Table 4
), they are larger than those from the paternal HS analyses. For the examples where the HS showed no evidence for a QTL, the maternal HS analyses showed significant evidence in four out of six cases. The joint P-values were <0.05 for three cases, whereas all of them were <0.25 (Table 4
). The comparison between the joint P-values and those from the VC analyses is compromised by the fact that results from the separate optimization of a paternal and maternal model under HS are compared with results from the joint optimization under the VC analyses. Nevertheless, the joint P-values provide a better comparison between the HS and VC results and give insight into the mechanisms underlying any discrepancies between the methods. Although we only looked at 17 cases, this provides some evidence that differences in QTL allele frequencies between sexes cause discrepancies between VC and HS analyses. These differences are probably due to sampling but could also reflect effects of selection when the parents originate from specific dam and sire lines. Furthermore, differences between paternal and maternal models could be explained by genomic imprinting, where the allele coming from one parent is silenced in the offspring.
|
To our knowledge, only Zhang et al. (1998)
have compared the performance of HS and VC analyses on real data. They reported that both methods agree well with regard to the QTL positions, but they did not compare the significance of the QTL under the two methods, although their Table 5 lists several cases where the QTL was only detected under one of the methods. However, such a comparison would be complicated by the fact that they derived the thresholds for the two methods in different ways. It must be noted that Zhang et al. (1998)
analyzed a single large granddaughter design, whereas the present study looked at 10 moderately sized HS designs.
Use of Deterministic Identity by Descent Scores
For 15 within-population regions, 127 analyses were repeated with IBD scores obtained by the DET method. These combinations, highlighted in Figure 1
, were chosen to represent all populations and to include a least one putative QTL from either the HS or VC analyses for each within-population region. The 127 analyses represent 15 cases where both VC and HS methods detected a QTL, nine where VC detected a QTL and HS did not, 15 where HS detected a QTL and VC did not, and 88 cases where neither method identified a QTL. The results of the VC analyses using deterministic IBD are compared with both the original HS analyses and the VC analyses using MCMC-derived IBD scores in Figure 2
. The VC results using deterministic IBD methods agree very well with those obtained using MCMC methods (Figure 2B
). Given the close agreement between the MCMC and DET methods to obtain IBD scores, the latter should be preferred when analyzing large amounts of data with relatively few missing markers. It must be noted that the population structure of the present experiment is still fairly simple (few additional links between families), and may therefore not offer the best comparison between MCMC and DET.
A determining factor for the feasibility of genome scans with VC analyses is the computation time that is required. To analyze five positions in a candidate region for a single trait, the complete analyses (including six ASREML runs) using DET were about nine times faster than those using MCMC (175 and 1,664 central processing unit seconds on a DEC Alpha XP1000 [Hewlett-Packard, Palo Alto, CA] with a 500-MHz processor, respectively). When using DET, the ASREML analyses and the processing of the results became the limiting factor in place of getting the IBD scores. Bayesian analyses for arbitrary pedigrees have also been proposed (Uimari et al. 1996
; Bink and Van Arendonk, 1999
), but applications to real data are limited because of computational requirements of these methods (Van Kaam et al., 2002
). A major advantage of the two-step approach (George et al., 2000
), compared with Bayesian methods (Bink and Van Arendonk, 1999
), is that once the IBD scores are estimated, a large number of traits or models can be evaluated without the need to repeat the IBD estimation.
The two-step VC approach can accommodate arbitrary pedigrees and, when using ASREML, a wide range of genetic and statistical models. These include multivariate analyses, time series, and random regression models (Gilmour et al., 1998
). A prerequisite for using more complicated models is the availability of sufficient amounts of data (i.e., large enough genotyped pedigrees). The possibility of using up to six user-defined covariance matrices in ASREML allows exploration of alternative genetic models in addition to the additive model used in the present study. Hanson et al. (2001)
proposed a framework to test for imprinting in sib-pair studies using a VC approach. Shete and Amos (2002)
provided a formal derivation for the methods proposed by Hanson et al. (2001)
and explored the sensitivity of tests for imprinting to differences in recombination fractions between males and females. The methods of Pong-Wong et al. (2001)
allow for parent-specific allelic IBD scores, which could subsequently be used to model separate paternal and maternal QTL effects. However, it is not clear how the sib-pair methodology generalizes to arbitrary pedigrees and how differences between maternal and paternal family sizes affect the power to distinguish Mendelian from imprinted QTL.
We have demonstrated the feasibility of QTL analysis using VC on a large amount of real data, equivalent to a full genome scan. The VC analyses performed well, especially when considering that the marker information was patchy (only 1 to 3 markers per region) and that the number of phenotyped animals was relatively small for reliable estimation of VC. Although the current population structure seems sufficient to detect QTL, larger populations are recommended for more reliable estimation of QTL effects.
Although the VC analyses showed few additional QTL beyond the HS analyses, they provided useful information. The present results increased confidence in those QTL that were detected by both methods and warrant closer scrutiny of the ones that were detected by only one of the methods. The experimental structure was very simple and was designed to be analyzed under a HS model. Advantages of VC methods could be more prominent when phenotypes are also available on the parents, because this information is ignored by the HS methods. As more advanced methodology is becoming available all the time, its usefulness can ultimately only be assessed by the analysis of real data. This will also facilitate further refinement of these methods. When the family structure permits, we recommend HS regression models for the initial analyses of QTL experiments within commercial lines. The advantages are the computational speed and straightforward interpretation of HS analyses, which can be performed online using QTL Express software (Seaton et al., 2002
). Given the effort and money that go into QTL mapping experiments, alternative methods should always be explored to exploit all the information that is present in the experiment. In this context, VC analyses are very useful to reanalyze data because they take all additive genetic relationships into account and provide QTL breeding values for all animals. Any discrepancies between the methods will point to QTL that need closer scrutiny.
| Implications |
|---|
|
|
|---|
| Footnotes |
|---|
3 Present address: Centro Ricerche Studi Agroalimentari FPTP-CERSA, LITA, 20090 Segrate, Italy. ![]()
2 Correspondence phone: +44-131-5274460; fax: +44-131-4400434; E-mail: dj.dekoning{at}bbsrc.ac.uk.
Received for publication January 13, 2003. Accepted for publication April 28, 2003.
| Literature Cited |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Y. Uemoto, Y. Nagamine, E. Kobayashi, S. Sato, T. Tayama, Y. Suda, T. Shibata, and K. Suzuki Quantitative trait loci analysis on Sus scrofa chromosome 7 for meat production, meat quality, and carcass traits within a Duroc purebred population J Anim Sci, November 1, 2008; 86(11): 2833 - 2839. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. van Wijk, H. Buschbell, B. Dibbits, S.C. Liefers, B. Harlizius, H. C. M. Heuven, E. F. Knol, H. Bovenhuis, and M. A. M. Groenen Variance component analysis of quantitative trait loci for pork carcass composition and meat quality on SSC4 and SSC11 J Anim Sci, January 1, 2007; 85(1): 22 - 30. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. van Wijk, B. Dibbits, E. E. Baron, A. D. Brings, B. Harlizius, M. A. M. Groenen, E. F. Knol, and H. Bovenhuis Identification of quantitative trait loci for carcass composition and pork quality traits in a commercial finishing cross J Anim Sci, April 1, 2006; 84(4): 789 - 799. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Estelle, A. Mercade, J. L. Noguera, M. Perez-Enciso, C. Ovilo, A. Sanchez, and J. M. Folch Effect of the porcine IGF2-intron3-G3072A substitution in an outbred Large White population and in an Iberian x Landrace cross J Anim Sci, December 1, 2005; 83(12): 2723 - 2728. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Vidal, J. L. Noguera, M. Amills, L. Varona, M. Gil, N. Jimenez, G. Davalos, J. M. Folch, and A. Sanchez Identification of carcass and meat quality quantitative trait loci in a Landrace pig population selected for growth and leanness J Anim Sci, February 1, 2005; 83(2): 293 - 300. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Varona, O. Vidal, R. Quintanilla, M. Gil, A. Sanchez, J. M. Folch, M. Hortos, M. A. Rius, M. Amills, and J. L. Noguera Bayesian analysis of quantitative trait loci for boar taint in a Landrace outbred population J Anim Sci, February 1, 2005; 83(2): 301 - 307. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Szyda, Z. Liu, F. Reinhardt, and R. Reents Estimation of Quantitative Trait Loci Parameters for Milk Production Traits in German Holstein Dairy Cattle Population J Dairy Sci, January 1, 2005; 88(1): 356 - 367. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |