|
|
||||||||
ANIMAL GENETICS |


* Roslin Institute (Edinburgh), Roslin, Midlothian EH25 9PS, U.K.;
and
Institute of Cell and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, U.K.; and
and
SAC, Edinburgh EH9 3JG, U.K.
| Abstract |
|---|
|
|
|---|
Key Words: Fat Genome Growth Muscle Suffolk Texel
| Introduction |
|---|
|
|
|---|
Compared with other livestock species, there is less published literature on QTL identification in sheep populations. Surprisingly few QTL have been published for traits of direct relevance to meat production, apart from studies of individual major genes, such as the callipyge locus (Freking et al., 2002
) and other candidate genes.
In the United Kingdom, sire-referencing schemes (SRS) in terminal sire sheep breeds create ideal populations for QTL detection. Through widespread use of artificial insemination with semen from high-genetic-merit sires, they create large half-sib families. Moreover, live weight measures and ultrasonic measurements of muscle and fatness are standardized across farms and centrally collated. Such data provide a unique opportunity to investigate segregation of QTL relevant to meat production within commercial populations. This study aimed to determine whether there is evidence for the segregation of QTL affecting growth, muscling, or fatness in the large half-sib families of the United Kingdom terminal sire sheep SRS.
| Materials and Methods |
|---|
|
|
|---|
|
Blood samples were collected at approximately 6 mo of age. For fresh samples, DNA was extracted from the blood using a standard salt extraction method and a phenol-chloroform extraction was used on blood samples that had been frozen. For sires, DNA was available from either blood or semen samples. Blood samples were not collected from dams; hence, dams were not genotyped.
Selection of Genomic Regions
Several previous studies have indicated the presence of a major gene (or genes) for production traits, relevant to growth and carcass traits, in sheep or other mammalian species. These include growth effects on sheep chromosome 1 around the transferrin gene (Kmiec 1999
), muscling effects on sheep chromosome 2 around the myostatin gene (e.g., Broad et al., 2000
), and growth effects around IGF1 (located on sheep chromosome 3) in cattle (Stone et al., 1999
). The leptin gene (located on sheep chromosome 4) has been extensively studied in numerous species and is recognized as a gene with a major influence on fat deposition. The calpastatin gene (located on sheep chromosome 5) interacts with the callipyge gene to affect muscling (Freking et al., 1999
). Previous studies in sheep (Walling et al., 2000
) and on the homologous region in cattle (Casas et al., 2000
) have highlighted a locus affecting growth on sheep chromosome 6. Associations have been shown between GH1 (located on sheep chromosome 11) and cattle growth (Taylor et al., 1998
). Chromosome 18 contains the callipyge gene (Freking et al., 2002
) and the Carwell longissimus muscle (ribeye) muscling locus (Nicoll et al., 1998
). Finally, the major histocompatibility complex (MHC) is located on sheep chromosome 20. Studies in cattle (Elo et al., 1999
) and pigs (Walling et al., 1998a
) have found effects for growth and fatness in the homologous MHC regions of their genomes.
Each of these regions was chosen for further study. Initially, the regions described on chromosomes 2, 3, 4, 5, 11, 18 and 20, were investigated in Texel and Suffolk lambs, and subsequently the regions on chromosomes 1 and 6 were investigated in the Suffolk animals only.
Genotyping Strategy
Informative marker panels were developed separately for each sire, in up to nine of the selected candidate regions of the sheep genome (Table 1
). This was achieved by initially genotyping each sire for all available microsatellite markers across each candidate region and then selecting heterozygous markers at approximately 10-cM intervals wherever possible. All offspring were subsequently genotyped for selected markers that were heterozygous in their sire. Relative marker locations were verified by producing linkage maps using Cri-Map (Green et al., 1990
). These were in close agreement with previous studies (Maddox et al., 2001
), indicating accurate genotype data. Given the comprehensive nature of the Maddox et al. (2001)
data set, marker orders and their relative positions were those published by Maddox et al. (2001)
.
Data Analysis
Preparation of Phenotypic Data.
The six phenotypic measurements (8WW, ScanWT, Mus, Fat, MusWT, and FatWT) were precorrected for known fixed effects and covariates. To achieve this, data collected on all farms in the SRS over the last decade were used, giving data sets of approximately 87,000 and 75,000 records for the Suffolk and Texel SRS, respectively. From these complete data sets, fixed effects for flock-year, sex, birth-rearing rank, and age of dam were estimated for each trait using ASREML (Gilmour et al., 1999
), and these estimates were then used to precorrect the phenotypic records on genotyped animals, including ultrasonic scanning traits corrected for age at scanning. Animals without a complete phenotypic record were removed from the analysis.
Information Content.
Information content was calculated at 1-cM intervals across all the regions under investigation in each population in this study for each analysis. The information content of an individual marker is the proportion of animals in which the allele inherited from the sire can be unambiguously identified. Information content at genome position i was calculated as var(pi)/0.25, where pi is the inheritance probability for each offspring included in the analysis and 0.25 is the expected variance of inheritance probabilities for a fully informative marker.
Estimation of QTL Position and Effects for a Single-QTL Model.
The probability of inheriting a particular sire chromosome at a particular position was calculated for each offspring from the genotype data at 1-cM intervals along each chromosome, using the method of Knott et al. (1996)
. A small number of uninformative offspring (i.e., offspring with genotypes identical to their sire) were removed from the analysis for that particular genomic region. Each of the adjusted phenotypes was then regressed on the inheritance probabilities, at each location on each chromosome. For each regression, an F-ratio of the full model including the inheritance probability vs. the same model without the inheritance probability was calculated. The location with the largest F-ratio was taken to be the best estimated position for a QTL for each trait.
The QTL effects were estimated using two models, within family and breed level. First, in recognition of the fact that the samples were from outbred populations and only a small number of families were available, analyses were applied to each individual sire family. The test within each family produces an F-ratio with 1 df in the numerator and (m 2) df in the denominator, where m is the number of progeny in the analyzed family. Second, breed-level QTL were investigated. To remove scale differences between families arising from between-family heterogeneity of variance, the variance of each phenotypic measurement was standardized within each family to one. Standardized adjusted phenotypes were regressed on the inheritance probabilities for all animals genotyped for the genomic region under investigation at each location. Estimates of the putative QTL effect were calculated for each sire included in the analysis for the genomic region under investigation. This test produces an F-ratio with n df in the numerator, where n is the number of sires analyzed and (
m) 2n df in the denominator.
Three-Generation Analysis.
Our data set comprised two extended families: in the Suffolk breed, Sire 1973 was the progeny of Sire 561, and in the Texel breed, Sire 424 was the progeny of Sire 1. Additional analyses were performed on these related families. In genomic regions where the two sires had at least one informative marker in common, inheritance probabilities for the progeny of the older sire were calculated for the paternal haplotype inherited by the younger sire using an extension of the method of Knott et al. (1996)
. Inheritance probabilities for the same allele were also calculated in the progeny of the younger sire also using the method of Knott et al. (1996)
. Inheritance probabilities of the alleles shared by the two sires in the younger family were calculated by multiplying inheritance probabilities of the sire with those of the progeny. Adjusted phenotypes were subsequently regressed onto the haplotype inheritance probabilities. A single-QTL effect was estimated for the entire family.
To illustrate the methodology to derive inheritance probabilities, consider the simplified pedigree in Figure 1
, in which Sire A is mated to Dam B, producing offspring C and D, and Male offspring D is mated to Dam E, producing Offspring F and G. Each animal is genotyped for three linked markers as shown. For simplicity, the dam genotypes are included and are unique. Probabilities for inheriting alleles 1, 2 and 6 from Animal A are all 1 because these are the alleles inherited by the younger sire (Animal D) regardless of the phase in Sire A. Likewise, probabilities of inheriting alleles 4, 5, and 3 from Animal A are all 0. Inheritance probabilities between markers are calculated using the recombination fractions between flanking markers and the current position. Assigning probabilities of 1 to common haplotype alleles in the younger sire avoids all offspring with 0 inheritance probabilities in the final generation and hence not contributing to the analysis. Likewise, the inheritance probability for alleles 2 and 6 in Animal F is also 1, and the inheritance probability of allele 7 at these marker locations is 0. Final inheritance probabilities for Animals F and G are obtained by multiplying their inheritance probabilities with those of Animal D at each location along the chromosome.
|
Significance Thresholds.
The 5% regional thresholds were calculated for each trait, for each analysis using permutation testing (Churchill and Doerge, 1994
). This threshold was adjusted using a Bonferroni correction to produce a 5% genomewide threshold, assuming the total genome size of the sheep to be 3,500 cM (Maddox et al., 2001
). For example, the 5% genomewide P-value from a 100-cM chromosome would be equivalent to the (100/3,500) x 0.05 = 0.0014 chromosomal P-value. In addition, a suggestive threshold (Lander and Kruglyak, 1995
) equivalent to one false-positive result per genome scan was calculated with a P-value 20 times larger than the genomewide P-value.
Confidence Intervals.
If the largest F-ratio indicated a QTL at the suggestive level, one- and two-LOD (LOD, logarithm of the odds of linkage) support intervals were produced by taking the region of the chromosome encompassed when reducing the largest F-ratio by the equivalent of an LOD score of either 1 or 2 (Lander and Botstein, 1989
). This calculation was preferred to the bootstrap method (Visscher et al., 1996
), which produces conservative intervals around areas of higher marker density (Walling et al., 1998b
, 2002
) and typically produced intervals covering the whole chromosome with these data owing to the shallow pedigree structure and consequently low numbers of recombination events (results not shown).
Two-QTL Model.
When the largest F-ratio indicated a QTL at the suggestive level, a two-QTL analysis was applied. A grid search was performed at 1-cM intervals along the same chromosome. The best-fitting two-QTL model was tested against the model fitting only one QTL using an F-ratio with 1 df in the numerator.
The program QTL Express (Seaton et al., 2002
) was used for the breed and family-level analyses for single- and two-QTL models and the calculation of the thresholds.
| Results |
|---|
|
|
|---|
|
|
|
Three-Generation Analysis
Two regions in each of the joint families produced suggestive statistical evidence for a QTL. The regions were chromosome 3 and 6 in the Suffolk family and chromosomes 2 and 18 in the joint Texel family (T1/6), and the results are summarized in Table 4
. The distribution of the test statistics for those traits with significant evidence for a QTL and the information content across the chromosome are shown in Figures 3
through 6
. The most statistically significant effects were for muscle depth (both unadjusted and adjusted for weight) on chromosome 18 in the Texel family. The estimated position was at 87 cM between markers OB2 and CSSM18, 6 cM proximal in comparison to the individual analysis for T6, with the two-LOD support including all locations distal to marker TGLA122. The information content across the two-LOD support region remained above 0.65 at all locations. The effect of the QTL was 1.31 mm and 1.15 mm for Mus and MusWT, respectively, slightly smaller than, although not significantly different from, the result from the single-QTL analysis. The other significant effect in the Texel family on chromosome 2 affected fat depth with an effect of 0.61 mm at 235 cM around the location of marker BM2113, with an information content of 0.43. The F-ratio was 7.99. This represented a decrease in significance in comparison to the individual analysis for T1; the size of effect remained the same but the estimated positions differed by over 100 cM.
|
|
|
Two-QTL Model
When the data presented evidence of a suggestive QTL, a two-QTL model was applied to the data. The results presented in Table 5
represent cases where the inclusion of a second QTL was statistically significant at a nominal 5% level, and the two putative QTL were located in different marker intervals. Owing to the collinearity between locations within the same interval, estimates of two QTL locations within the same interval are unreliable and highly variable, generally being inflated. This typically leads to an optimal two-QTL model with the loci closely positioned and acting in repulsion, with large effects that are similar but opposite. As observed by Whittaker et al. (1996)
, it is impossible to map nonisolated QTL; hence, such results have not been included.
|
The data for 8-wk weight on chromosome 2 for Family T3 also produced nominal statistical evidence for a second QTL. The two QTL acted in repulsion, with the first QTL with similar position (134 cM) and effect (1.55 kg) to the single-QTL model. The second and larger QTL is 55 cM distal at 189 cM, with an opposing effect of 2.27 kg. The only other analysis to produce statistical evidence for a second QTL in the region under investigation was that of the muscle data for Family T6 and joint Family T1/6. In this result, both the QTL from the two-QTL model were proximal to that from the single-QTL model located at approximately 54 cM and 82 cM. The effects were in repulsion, and the more distal QTL having the larger effect.
Whereas the second QTL were statistically significant at a nominal level for all the results in Table 5
, none were significant at the suggestive significance level that was applied to the single-QTL models.
| Discussion |
|---|
|
|
|---|
From the breed-level analyses, statistical evidence was found for a QTL on chromosome 1. The effect on muscle depth was located 50 cM proximal to the transferrin gene, one of a family of metal-binding proteins with an in-vivo preference for ferric iron. Interestingly, the second significant QTL in the two-QTL analysis was located only 12 cM distal to location of the transferrin gene. Given the confidence intervals for the QTL positional estimate and the sparse knowledge of the sheep genome, the transferrin gene is a possible candidate gene for the observed effects. However, there are many genes in this region and further experiments are required to test specific hypotheses regarding candidate genes.
From the analysis of individual families, the most significant result affected fatness on chromosome 2. The estimated position was 26 cM distal to the myostatin locus, responsible for the double muscling phenotype in cattle. The position and the effects are in good agreement with a previous Texel study in New Zealand (Broad et al., 2000
). The locus detected by Broad et al. (2000)
not only affects muscle measurements, but also is reported to affect fat depth, in agreement with this study. One other study in sheep has also reported effects on fatness around the myostatin region (Marcq et al., 2002
). Although a causative effect of the myostatin locus on fatness in these populations cannot be unambiguously inferred, these results are consistent with the observation that double-muscled animals have carcasses with a low percentage of fat. Therefore, mounting evidence suggests that this region of the sheep genome contains a gene or number of genes with significant effects on carcass composition.
Another significant region detected in the three-generation analysis was on chromosome 18. Effects on muscle depth traits were detected in Texel Family T1 and more significantly in the joint Family T1/6. The position of this effect corresponds to the region of the chromosome containing both the callipyge gene and the Carwell locus. Given the complex nature of callipyge inheritance, in which only heterozygous offspring inheriting the mutation from their sire express the phenotype, as well as the observation that the callipyge effect is localized to the loin region, it is unlikely that the effect in this population is attributable to the segregation of callipyge alleles; however, the effect is similar to the description of the Carwell gene. The position of the Carwell gene was reported to be around 2 to 6 cM telomeric of CSSM18 (Nicoll et al. 1998
); this study found an effect approximately 2 cM telomeric of CSSM18 in the individual family analysis and 5 to 10 cM centromeric of CSSM18 in the joint family analyses. The effect was on muscle depth of the longissimus dorsi, with no effect on fatness, although the position of the phenotypic measurement differs slightly between this and the previous study, namely, 3rd lumbar vs. 12th rib, respectively. The size of the effect in the work of Nicoll et al. (1998)
indicated an increase of muscle depth adjusted for live weight in Australian Poll Dorset sheep of 2.97 mm (SE = 0.82), which is not significantly different from the effect in the Texel families in this study (1.15 to 2.00 mm). The effect was more significant in the three-generation analysis when the model was adjusted for live weight, which is also in agreement with previous findings (Nicoll et al. 1998
), and suggests that the primary effect is an alteration in muscle shape. Given the similarities between the previous description of the Carwell locus and these results, it can be hypothesized that the same or similar locus was also segregating in the flocks used in this study.
The use of a candidate region approach seems to be successful in finding QTL in the chosen regions; however, it is not possible to estimate the number of effects that were not detected as a result of being outside of the genomic regions in this study. Nonetheless, the strategy of selecting genomic regions that have previously been reported to contain QTL affecting similar traits seems to be effective. This study suggests that where constraints that prevent a full genome scan exist, selection of genomic regions previously reported to be associated with effects on traits similar to those being studied is a valid strategy to detect QTL.
The analysis of these data used three differing methodologies assuming different underlying models of any putative gene. The breedwide analyses are most powerful when a QTL is segregating in all the sires. Given the outbred nature of the population and assuming a biallelic QTL, even in the most favorable scenario with QTL allele frequencies of 0.5 on average, the effect will not be segregating in 50% of the sires. Given the limited number of sires likely to be heterozygous for the QTL, the residual variation from other nonsegregating families is such that only very large QTL will be found to be statistically significant. Despite this, for any given significance level, the breed-level analyses are likely to be the most powerful. This study found a greater number of suggestive QTL from the within-family analyses, but the results were not adjusted for the number of families tested.
The mapping of QTL involves substantial multiple testing. This study encompassed 42.3% of the sheep genome and attempted to correct for multiple positional testing using the suggestive and 5% genomewide thresholds suggested by Lander and Kruglyak (1995)
. The study did not adjust for the 12 families tested in the within-family analyses, nor did it correct for the three traits analyzed (growth, muscle, and fat). Given the number of tests, it is likely that some results are Type 1 errors (identifying a QTL when there is not one present); however, increasing the stringency of the thresholds would increase the frequency of Type 2 errors (failing to identify a QTL present in the region studied). Adjusting thresholds for the number of traits analyzed seems to be too stringent, as results previously declared significant at a suggestive or 5% genomewide level may no longer reach the new required threshold simply because of the addition of a new trait in the analyses. Such a result would be discarded despite no change in the evidence for a putative QTL. Hence, in accordance with the recommendation of Lander and Botstein (1995), suggestive results are reported with the recognition of the potential for false positive results.
The three-generation analysis has a number of advantages over the individual family analyses. The analyses benefit from increased power owing to the additional numbers of progeny. Furthermore, the extra generation can generate additional recombination events, which leads to improved resolution of the mapped position. This is usually a better means of improving the positional estimate of the QTL than increasing marker density, which generally adds little precision (Visscher et al., 1996
). In other words, additional recombination events are more likely to be observed through the addition of more animals rather than more markers.
If confined to a single generation, Meuwissen et al. (2002)
demonstrated through work on a QTL for twinning rate that a combination of both linkage and linkage disequilibrium mapping is able to fine-map QTL. Using only large half-sib families, similar to the data analyzed in this study, Meuwissen et al. (2002)
were able to map a QTL within a region less than 1 cM. Application of this methodology to the sheep data used in this study may improve the current resolution of the identified QTL.
In summary, this study has identified at least three regions of the genome associated with muscle and fat depth in both Suffolk and Texel sheep that lend themselves to additional studies as well as exploitation possibilities. As described earlier, tracking haplotypes defining these chromosomal regions through subsequent generations in the same families can, in principle, enhance short-term genetic progress if no recombination occurs within the defined region, or improve the precision of the estimated QTL effect if observed recombinations do occur. In other words, our results present an opportunity to simultaneously improve the estimated position and effect of the desired locus as well as contribute to the benefits of marker-assisted selection.
| Implications |
|---|
|
|
|---|
|
|
| Footnotes |
|---|
2 Correspondencephone: +44 131 5274463; fax: +44 131 4400434; e-mail: stephen.bishop{at}bbsrc.ac.uk.
Received for publication August 25, 2003. Accepted for publication April 30, 2004.
| Literature Cited |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. D. Rodgers and D. K. Garikipati Clinical, Agricultural, and Evolutionary Biology of Myostatin: A Comparative Review Endocr. Rev., August 1, 2008; 29(5): 513 - 534. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. L. Johnson, J. C. McEwan, K. G. Dodds, R. W. Purchas, and H. T. Blair Meat quality traits were unaffected by a quantitative trait locus affecting leg composition traits in Texel sheep J Anim Sci, December 1, 2005; 83(12): 2729 - 2735. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. L. Johnson, J. C. McEwan, K. G. Dodds, R. W. Purchas, and H. T. Blair A directed search in the region of GDF8 for quantitative trait loci affecting carcass traits in Texel sheep J Anim Sci, September 1, 2005; 83(9): 1988 - 2000. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |