J. Anim Sci.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Dodds, K. G.
Right arrow Articles by Sise, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dodds, K. G.
Right arrow Articles by Sise, J. A.
J. Anim. Sci. 2005. 83:2271-2279
© 2005 American Society of Animal Science


ANIMAL GENETICS

Genetic evaluation using parentage information from genetic markers1

K. G. Dodds2, M. L. Tate3 and J. A. Sise

AgResearch, Invermay Agricultural Centre, Mosgiel 9032, New Zealand


    Abstract
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
Genetic evaluation relies on pedigree information to account for the trait information on individuals and their relatives. Recording pedigrees may place unfavorable restrictions on the management of breeding populations, such as the use of single-sire mating groups and the observation of parturition. The use of DNA marker information is an alternative method to identify parents, but it is difficult to assign the parents unambiguously for all progeny in extensively farmed livestock without the use of very many markers. We present methods that use DNA information on parentage within a genetic evaluation system that allow for genotyping errors and for the parentage information to be incomplete, with probabilities assigned to possible parent pairs (i.e., fractional parentage assignment). Two of these methods use a computing strategy that circumvents the high memory requirements associated with the application of previous methods designed for use with fractional parentage assignment. This strategy has an additional advantage of allowing the same statistical models to be used in the evaluation as with recorded pedigrees. The use of DNA marker-based parentage for genetic evaluation is associated with lower genetic gain (at the same survival levels) than by using the true pedigree. This decrease in gain depends on a number of factors, including trait heritability and the DNA markers used. The methods we have described show how DNA marker information could be used to replace traditional pedigree recording.

Key Words: Breeding Value • DNA Marker • Genetic Evaluation • Parentage • Pedigree


    Introduction
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
The genetic evaluation of animals and plants relies (apart from in its simplest form, selection based only on an individual’s phenotype) on the collection of pedigree information on the population under consideration, as well as trait information on these individuals and/or their relatives. This is combined with estimated genetic parameters to provide EBV, which can be used to select individuals to be parents. The need to record pedigree often places restrictions on the management of breeding populations (e.g., the use of single-sire mating groups and the observation of parturition or use of single pens for mothers during parturition).

An alternative to traditional recording of pedigree is to use DNA marker information to identify parents; however, with extensively farmed livestock, DNA markers commonly have been used only to verify or exclude putative relationships rather than to assign parentage per se. This is due to the cost of DNA marker typing, and because the usually large number of potential parents makes it difficult to unambiguously assign the parents for all progeny (Dodds et al., 1998Go; Laughlin et al., 2003Go).

We describe methods that allow the use of DNA information on parentage within a genetic evaluation system. The system we describe is designed to be used when the parentage information is incomplete, with probabilities, allowing for genotyping errors, assigned to possible parent pairs. We also describe a computing strategy that circumvents high memory requirements associated with the application of previous methods designed for incomplete pedigree information. We describe these methods in terms of their application to farmed livestock species and assigning parent-pairs, but the general method also could be applied to plant breeding or with single parent assignment.


    Materials and Methods
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
Management Information
To apply DNA parenting technology, information on the possible parents of an individual must be available and genetic samples from these animals obtained. The potential parents could be considered to be all those on a property during the relevant periods (mating for sires, parturition for dams). However, it is preferable to decrease the number of possibilities and thereby to decrease the "work" required by the genetic markers. This is achieved through accurate recording of the mating group (groups of sires and dams that have the opportunity to mate) and the parturition group (groups of dams maintained together from before parturition begins until their offspring are tagged, along with their collective offspring from that birthing period). The possible parent pairs for an offspring are those with the dam from the same parturition group as the offspring, and the sire from the same mating group as that dam. It is possible to manipulate the formation of these groups in a way that increases the chance of assigning parentage (Sise et al., 2001Go).

Genotyping and Parentage Probabilities
The offspring and their potential parents are to be genotyped with a number of DNA markers. We assume that the markers used are codominant, as is common practice in DNA parentage applications.

Marshall et al. (1998)Go present formulas for calculating the relative likelihood of the genotype data given that a particular male is the sire of an offspring. These formulas are given for both with and without data on the dam and her genotypes, and they allow for a specified level of genotyping errors. The likelihood is calculated relative to the likelihood of the genotypes given that there is no relationship between the individuals involved. We extend these formulas to find the relative likelihood of a particular male-female pair being the parents of an offspring (i.e., simultaneous assignment of both parents).

Suppose there are m markers scored. For a specific marker, let Ai represent the ith allele and pi its frequency, i = 1, 2, . . ., v. For each marker, calculate the likelihood, given parentage, as follows:


and the likelihood, given there is no relationship between the individuals, as follows:


where e is the assumed rate of genotyping errors, P(go), P(gm), P(gf), and P(gp) are the probabilities of the offspring, putative mother, putative father, and combined putative parent-pair genotypes respectively, and the T(.|.) are transition probabilities. For example, T(gp | go) is the probability that the parents were genotypes gp given the offspring is go. Formulas for these quantities are presented in Tables 1Go and 2Go, assuming Hardy-Weinberg equilibrium. The same results could be obtained by considering transitions in the opposite direction (as e.g., T(go | gp) P(gp) = T(gp | go) P(go)). We use the forms in Tables 1Go and 2Go because they can be presented and programmed more succinctly. The same error rate is applied to all markers and classes (sire, dam, or offspring) of individuals; however specific rates for each of these could be applied.


View this table:
[in this window]
[in a new window]
 
Table 1. Probabilities used in the calculation of parentage likelihoods when assigning parent pairsa
 

View this table:
[in this window]
[in a new window]
 
Table 2. Probabilities used in the calculation of parentage likelihoods when assigning a single parenta
 
The relative likelihood is then


If each of the possible parents is a priori considered to be equally likely, then the posterior probability of parentage given the marker information is the likelihood for that parentage divided by the sum of the likelihoods for all possible parentages. We refer to this as the "parentage probability" and to its sum over dams as the "sire probability." In practice, we consider only those that are more likely than a randomly chosen pair of parents (L > 1) or some other cutoff that does not exclude parents that would otherwise have had moderate parentage probability. In some situations, it may not be possible to determine or DNA sample the complete set of possible parents, or it may not be done with certainty. The set of possible parents for which likelihoods are calculated would then include pairs where one or both parents are unassigned. In the former case, the calculations from Table 2Go of Marshall et al. (1998)Go are used; in the later case, the relative likelihood to be used is one. A more extreme case of this situation is one in which one sex of parents is not DNA sampled (commonly the mothers), and it is the relationship to only the other sex of parents that is to be considered. An example of parentage probability calculations is shown in the Appendix.

Estimating Breeding Values
Average Relationship Matrix Method.
Genetic evaluation can be undertaken using the average relationship matrix method (ARM) as described by Henderson (1988)Go. Relationship matrix calculations are undertaken by allowing each of the possible parentages, weighted by their probabilities as calculated above. Genetic evaluation requires the inverse of this matrix, and algorithms are available for calculating this directly (Perez-Enciso and Fernando, 1992Go).

An added complication is that genetic evaluation models often include terms that depend on the parentage (particularly the dam). This includes effects such as birth date, birth rank, rearing rank, age of dam, and breed composition. These can be derived as the weighted (by the parentage probabilities) means of the values derived from each possible parentage. If offspring are tagged during or at the end of the rearing period, then the number of offspring reared can be estimated from parentage records. Normally, the parturition date and litter size of the dam are not observed (one of the reasons for using DNA parenting is to allow birthing groups to be undisturbed), but for livestock species, these effects can be estimated through the use of pregnancy scanning. Once these average effects have been derived, they need to be incorporated into the genetic evaluation model. For terms that were modeled as class effects, this will generally require a modification to the model used, as values intermediate to those used in a known-pedigree system are now possible. The simplest solution is to use rounded values, but it may be more accurate to use a suitable functional relationship (e.g., a polynomial or a spline for birth rank). This would require investigation for each specific situation (i.e., trait and effect being modeled). An alternative method that avoids these difficulties is presented in the Pedigree Sampling Method section. Standard genetic evaluation, using BLUP, involves solving a large set of linear equations (of dimension the number of individuals plus the number of fixed effects levels, if only direct genetic effects are considered), and thus, sparse matrix methods are often used to decrease the computational burden. This is effective because there typically is a large proportion of individuals that are unrelated; however, by allowing multiple parentage possibilities, DNA marker-based pedigree analyses involve a relationship matrix that is much less sparse, thereby increasing the computation required. In particular, the increased memory demands decrease the size of the population that can be evaluated. The Pedigree Sampling Method section describes an alternative computing strategy that circumvents the increased memory demands.

Pedigree Sampling Method.
The method we describe here is a type of multiple-imputation strategy. Missing information (parentage) is filled in by sampling from the possibilities weighted by their probabilities. For example, suppose an individual has one possible sire and two possible dams with equal probability. A set of parents is chosen for that individual by assigning the only possible sire and randomly choosing (with equal probability) one of the two possible dams. Once all the unknown (but partially known) parent relationships have been sampled, the genetic evaluation proceeds identically to the recorded parentage situation, and the EBV is stored. The process is then repeated a suitable number of times (100 is used in the simulations presented below). Finally, the set of EBV from each of these evaluations is averaged to give the final EBV to report. We refer to this method as the pedigree sampling (PS) method. Sen and Churchill (2001)Go used a similar multiple-imputation strategy in the analysis of QTL data. Each sampled parentage allows for the calculation of parent-dependent effects in much the same way they are calculated with known parentage. These generally are of the types that are available with full parentage recording and can therefore be handled by standard genetic evaluation software. The breed is the average of the sampled parents’ breeds, the age of dam is the age of the sampled dam, and the rearing rank is calculated as the number of progeny with the same sampled dam (in that sampling). Birth rank and approximate birth date can be obtained from the sampled dam’s pregnancy scanning information. It will sometimes be the case that a dam is assigned more progeny than her (inferred) litter size. In these cases, the rearing rank could be considered equal to the birth rank for the genetic evaluation.

Upper Quartile Pedigree Sampling Method.
When performing pedigree sampling, some samples seem more likely than others, particularly with respect to family sizes. For example, we expect a dam to have no more progeny assigned to her than her (estimated) litter size, we might expect the number of progeny per sire to follow a particular distribution, and we expect litter mates to often be full-sibs. Intuitively, we would prefer samplings where these expectations are mainly met. Here we investigate an ad hoc method, the upper quartile pedigree sampling (UQPS) method, which favors samplings with more likely family sizes, to see whether this might give a more efficient evaluation procedure. A weighting is applied to each sample. The specific details will depend on the species and production system. We use weightings, which might be used with prolific sheep, as follows. Initially, the weight is set to one. For each dam with progeny, the weight is multiplied by 0.75(s–1), where s is the number of different sires assigned for the progeny of that dam. For each dam, the weight is multiplied by 4w, where w is found in Table 3Go according to the number of progeny detected by scanning the dam and the number of progeny assigned (reared). These values approximately represent the (relative) probabilities of rearing the given number of progeny given the scanned number of progeny (and allowing for errors in the scanning process). The values are multiplied by four to aid computation. For combinations not shown, a value of w = 0.01 was used. Samples with higher weights have family sizes closer to expectation than those with low weights. Once a set of samples has been taken and their weights calculated, only the upper quartile (on weight) is retained for averaging to give EBV.


View this table:
[in this window]
[in a new window]
 
Table 3. Values of w for different combinations of offspring scanned and reared for use in calculating the weightings for the upper quartile pedigree sampling (UQPS) method
 
Simulation Study
A small simulation was undertaken to see how the methods perform. A population was generated with parameters that are typical for a prolific sheep breeding operation using DNA markers that could give moderate discrimination among potential parents. Fifty replicates were simulated.

Animals.
The pedigree simulated comprised three generations. This allowed some genetic similarities between the parents, as would normally be the case in practice. The first (grandparent) generation consisted of unrelated animals: five grandsires and 20 granddams for the paternal line, and 10 grandsires and 120 granddams for the maternal line. The second (parent) generation was generated from the respective paternal/maternal grandparent lines. The number of progeny produced from each mating was based on a distribution of 0.15, 0.45, 0.35, 0.04, and 0.01 for one to five, respectively. These progeny were randomly assigned a sex, resulting in 20 to 30 potential sires and in excess of 100 potential dams. Of these, 10 were chosen as sires and 100 selected as dams from their respective groups. The third (progeny) generation was generated from these with the same litter size distribution as in the second generation (giving an average litter size of 2.3) and randomly assigned a sex. Progeny were randomly removed (to mimic deaths) according to birth rank, in the proportions 0.15, 0.15, 0.30, 0.5, and 0.5 for one to five lambs, respectively, giving an average of 1.7 progeny reared per dam. True birth rank and rearing rank (assuming no cross-mothering) data were stored for all progeny. Any individuals in the first two generations that were not ancestors of the progeny were removed from analysis.

Genetic Markers.
Marker data were simulated for up to eight markers with allele frequencies as shown in Table 4Go. These markers are based on a set of sheep markers currently being used by a commercial genotyping facility. The allele frequencies for these markers were estimated from a set of unrelated animals from a single breeding operation. The first six of these markers were used in the simulations (as in the commercial genotyping facility), unless stated otherwise. Genotypes were assigned to grandparents by randomly selecting alleles based on the allele frequencies. Alleles for the subsequent generations were selected randomly from each parental set of alleles. To conservatively model the proportion of results from a commercial high-throughput laboratory, 10% of genotypes were randomly removed from mothers and progeny. No genotypes were removed from the sires. All genotypes were assigned without error. Once a set of genotypes is generated, those from the grandparent generation are ignored, whereas those for the parent and progeny are used to calculate parentage probabilities. The rate of genotype errors was assumed to be 1% when performing the likelihood calculations, and only those parentages with L > 1, and for which the parentage probability exceeded 0.02, were retained. All relevant animals belonged to a single mating or lambing group.


View this table:
[in this window]
[in a new window]
 
Table 4. Characteristics of markers simulated
 
Trait Values.
Grandparents were assigned genetic values for a trait with a known mean (40), residual SD (8.4), and heritability (0.3). Genetic values were determined for subsequent generations using known pedigree relationships and quantitative genetics theory. Phenotypes (y) for the progeny generation were produced from their genetic values (g), a component (c(BR,RR)) that depended on their birth rank (BR; number of progeny born to their dam) and rearing rank (RR; number of progeny reared by their dam), and a randomly sampled environmental component (e) as follows:


The values used for the BR/RR component were c(1,1) = 0, c(2+,1) = 1.4, c(2,2) = 4, c(2+,1) = 5, c(1+,3) = 7, c(1+,4) = 10, and c(1+,5) = 12, where, for instance, 2+ denotes a value of 2 or greater. Phenotypes were not modified according to sex. The true genetic values of the progeny were retained for comparisons.

Breeding Value Estimation.
The EBV were calculated using ASREML (Gilmour et al., 2002Go), with the progeny trait values and (estimated) genetic relationships between the parent and progeny generations. For each of the replicate pedigrees, five different analyses were performed using the true pedigree, the ARM method (by supplying ASREML with the inverse relationship matrix), the PS method, the UQPS method, and the best pedigree (BP; i.e., the parentage with the highest parentage probability). The PS and UQPS methods used the same 100 samplings. The model included fixed effects of sex and BR/RR (with BR of three or more grouped and similarly for RR). Analyses were performed both with the true BR/RR and with the estimated values. Estimated BR and RR were rounded to the nearest integer with the ARM method. It was assumed that the litter size of each dam was known. Correlations were found between breeding values estimated by each of the methods and between these and the true genetic values. Results were partitioned into progeny, sires, and dams.


    Results
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
Under the conditions simulated, approximately 12% (SD over replicates = 4%) of progeny had a highest parentage probability of at least 0.9, whereas the highest sire probability exceeded 0.9 for 57% (SD = 8%) of progeny. The best parentage (regardless of probability) was the true parentage for 62% (SD = 5%) of progeny, whereas the greatest sire probability belonged to the true sire 89% (SD = 4%) of the time. The number of true parents assigned by the best parentage averaged 1.55 (SD = 0.07), whereas the number of true grandparents was 3.35 (SD = 0.11).

Means (over simulation replicates) of the correlations between breeding values calculated using the true pedigree and by the various methods are shown in Table 5Go. The ARM and the PS methods were similar if birth and rearing ranks are known. Because ARM uses rounded average birth and rearing ranks, when these are estimated, it performed slightly worse than the PS method in this situation. Alternative methods of modeling average birth and rearing ranks may improve the ARM method. The UQPS method gave similar results to the PS method, which suggests that it may be possible to devise a weighting scheme or another method that uses prior family size information for improving the process, for example to decrease computation at the genetic evaluation step. The BP method was the poorest of those considered.


View this table:
[in this window]
[in a new window]
 
Table 5. Mean and standard error of correlations between breeding values calculated using the true pedigree and by the other methods
 
The accuracy of EBV is the correlation between them and the true breeding value (which are known within the simulations). Table 6Go shows the accuracies for the three groups of animals using the true pedigrees and each of the other methods for the situation where birth and rearing rank are estimated. The ratio (with pedigree known) of the accuracies gives the relative genetic gain from using DNA marker-based pedigrees compared with that with known pedigrees.


View this table:
[in this window]
[in a new window]
 
Table 6. The accuracy of the estimate breeding value and relative genetic gain for sires, dams, and progeny, with true pedigrees and by the other methods for the situation where birth and rearing rank is estimated
 

    Discussion
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
In many species, the need to record pedigree places restrictions on the management of breeding populations. For example, to be certain of pedigree in farmed species such as the sheep, ewes must be single-sire mated and at lambing, the birth must be observed and the lamb uniquely tagged. There are a number of drawbacks with this situation. A high level of labor input and suitable containment for many small groups of animals are required. Single-sire matings risk low pregnancy rates if the sire is not active or infertile. Intensive recording often causes stress to dam groups during parturition, which may increase mis-mothering and loss of offspring. However, as the intensity of recording is relaxed, error rates in parentage assignment increase, which in turn results in lower genetic gains. The requirement for intensive recording often means that selection of breeding stock is undertaken in a substantially different environment to commercial production systems, casting doubt on the relevance of the selection procedure. In some other species, for example in aquaculture, it may not be possible to attach identifiers to the offspring at birth. This means that to capture parentage information, offspring groups must be reared together until they are large enough to tag, and therefore, genetic differences are confounded with the early rearing environment, decreasing the effectiveness of selection.

Intensive recording at birth is often used to gain information on effects to be used in the genetic evaluation in addition to pedigree. If dead offspring are collected and genotyped along with live offspring, then litter size could be estimated as the average number of dead plus live offspring assigned to the dam. This would involve a greater genotyping cost without increasing the number of candidates for selection. It also is likely to be unpractical or inaccurate due to unrecovered dead offspring. A more feasible solution is to use pregnancy scanning records to give estimates of parturition date and litter size. Pregnancy scanning is gaining widespread adoption as a management tool in sheep, particularly highly fecund strains, to allow differential feeding of ewes bearing different numbers of lambs. Experienced scanning operators can achieve high accuracies (90 to 97%) in diagnosing fetal numbers, with errors tending to be undercounts for the higher litter sizes (Fowler and Wilkins, 1984Go; Logue et al., 1987Go). Using pregnancy scanning results to partition a flock could result in lambing contemporary groups that would contain lambs all of the same parity (i.e., a birth rank could be assigned based only on birthing group membership).

Using DNA marker information to identify parents enables the management constraints of visual recording and confinement of animals during mating and parturition to be relaxed. The use of DNA testing has been widely used in the breeding of high-value animals (such as horses) to monitor the accuracy of pedigree records. Such parentage matching is typically very reliable and effective for an offspring with few possible parents, such as a known mother with two or a few potential fathers. If all but one of the possible parent-pairs can be excluded, the remaining pair is declared the true parents.

With extensively farmed livestock, however, it is difficult to unambiguously assign the parents for all progeny (Dodds et al., 1998Go; Laughlin et al., 2003Go). For example, in sheep where 10 sires may be mated with 1,000 ewes, a lamb could have one of 10,000 possible combinations of parents. Parentage assignment requires that all 9,999 incorrect parent-pairs are excluded. Sherman et al. (2004)Go achieved 86% unambiguous paternity assignment with 11 microsatellite markers when there were 26 possible sires. Sise et al. (2001)Go investigated management strategies to improve unambiguous assignment.

Parentage matching strategies should allow for genotyping errors. A common method is to require two or more exclusions among the marker tests. Although this works reasonably well in situations where many markers are typed, it is somewhat arbitrary and is less useful when fewer markers are scored. It also makes it more difficult to exclude incorrect parent-pairs, exacerbating the problem of excluding large numbers of false parentage possibilities.

A number of statistical methods have been developed to attempt to assign pedigree in natural populations, but typically only aim to identify the sire or to assign the most likely sire. The objective of these studies is to provide information on population parameters, such as the distribution of mating success (e.g., Devlin et al., 1988Go; Dickinson and McCulloch, 1989Go; Neff et al., 2001Go). One of these methods (Devlin et al., 1988Go) allows paternity to be spread over several possible fathers, a method the authors refer to as "fractional paternity." Another application is the generation of putative sibships to allow the estimation of genetic parameters (Thomas and Hill, 2000Go). In all of these cases, the parentage assignment of specific individuals is of lesser importance, as opposed to the case of selection of breeding individuals, where the parentage may have a large influence on which individuals are chosen. Methods have been proposed to account for multiple possibilities of sire pedigrees during the genetic evaluation process (Henderson, 1988Go; Foulley et al., 1990Go; Perez-Enciso and Fernando, 1992Go; Cardoso and Tempelman, 2003Go). Although mentioned as a possibility, these methods have not demonstrated the use of genetic markers in assigning probabilities to possible parentages.

We have described methods that allow the use of DNA information on parentage within a genetic evaluation system. The simulation results show that these methods allow much of the genetic progress that could have been made had the true parents been known. This is particularly true for the groups of animals where the most selection pressure is applied (sires and progeny). The BP also can be used but with lower genetic progress. Relative genetic gain results for the ARM and PS methods are similar if birth/rearing rank is known (both 97% for progeny), but PS performed better than ARM when birth/rearing rank was estimated (e.g., for progeny the values are 94% for ARM compared with 96% for PS). The birth/rearing rank known results also give an indication of the outcome for traits that are not influenced by such effects. If fixed effects can be modeled suitably or are absent from the model, lower memory costs may make the ARM method more practical than with current technology.

The DNA marker-based fractional parentage methods of genetic evaluation work well because the "mistakes" in parentage tend to be conservative. By this we mean that if the most likely parentage is not the correct one, it will often comprise close relatives of the true parents, as they will often have similar genotypes. This also will be the case for other parentages with nonzero probabilities. In this way, the system is tolerant to the incompleteness of the parentage information. In our simulations the best pedigrees had, on average, 84% of the grandparents correct, compared with 77% of parents correct (62% of offspring had both parents correct, and another 30% had one parent correct).

Many factors may affect the amount of genetic progress achieved using DNA markers. These include the number of DNA markers used, missing animals and genotypes from the dataset, and the heritability of the trait under test. We have made a brief assessment of the effect of these factors on genetic progress by undertaking additional simulation runs. Results are presented as the relative genetic progress achieved in progeny using the PS method compared with the true pedigree, which achieved 96% in the situation presented in the results section. Changing the number of DNA markers from six to four or eight (the first four and eight markers, respectively, in Table 4Go) gave a 2% decrease or 2% increase, respectively, in the relative genetic gain. The methods that gave lower relative genetic gain (Table 6Go) were more influenced by the number of markers used, so that with eight markers the relative genetic gain ranged from 94 to 98% across the four methods. Changing the heritability from 0.3 to 0.1 or 0.5 gave a 1% decrease or a 2% increase, respectively, in the relative genetic gain, reflecting the lower dependence on information from relatives as the heritability increases. Decreasing the number of missing genotypes in the mothers and progeny from 10% missing to 5%, increased the relative genetic gain by 1% with the six DNA markers. When the genotypes for dams and progeny were generated with a 1% error rate, there was a negligible change to the relative genetic gain. The parentage analysis used assumes all animals are present and available for the analysis. Practically speaking, in a farm situation, some animals may be unrecorded and unavailable for inclusion in the analysis. Although the effect of 5 to 10% of mothers missing does not greatly affect the overall result, if one of the 10 sires used is absent from the analysis, the relative genetic gain decreases by 1%.

The results presented compare the gain against the "gold standard" of perfect pedigree recording. In practical farming systems this is seldom, if ever, achieved. A number of reports have investigated the level of pedigree errors (Table 7Go). There also have been a number of reports investigating the decrease in genetic gain due to errors in pedigree recording. Israel and Weller (2000)Go, Banos et al. (2001)Go, and Spelman (2002)Go found decreases of 4 to 15% in genetic gain in dairy cattle due to 10 to 15% errors in pedigree recording. Differences in these results are likely to be due to differences in the selection programs, but the results indicate that the gains with the proposed marker-based methods relative to what can practically (due to pedigree errors) be achieved would be up to 10% greater than those tabulated above.


View this table:
[in this window]
[in a new window]
 
Table 7. Reported estimated pedigree error rates
 
The number of samplings needed for the PS or UQPS methods will depend on several factors. For example, traits with low heritability are likely to need more samplings than traits with high heritability because of their greater reliance on pedigree information. In our simulations that used the true birth/rearing rank, the correlations of true pedigree EBV with those using either the PS or ARM methods had similar SE, indicating that there was little extra variability with the PS method, in this case with 100 samplings. The extra variability induced by PS could be monitored by collating the SE of individuals’ EBV over the samplings. The number of samplings could be set to ensure that some summary statistic (e.g., mean or upper quartile) of these is below a specified threshold. This threshold could be set relative to the spread of EBV within the cohort(s) where selection is taking place; it is the rankings in these animals that we would like to remain unchanged with increased samplings.

We have used a somewhat simple approach when considering the possible parent combinations—all are considered equally likely a priori. It may be possible to use additional information to construct unequal prior probabilities for each parentage. For example, if mate sires are only present during certain portions of the mating period, then the expected conception date would favor mate sires present at that time; however, this also would increase the complexity of practical recording systems.

The methods we have described show how DNA marker information could be used to replace traditional pedigree recording. This is associated with lower genetic gain (at the same survival levels) but enables relaxed management strategies. The pedigree sampling system has been implemented for a number of large sheep breeding operations in New Zealand and is being implemented into the Sheep Improvement Limited (Wellington, New Zealand) system as a routine option.


    Implications
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
Genetic evaluation relies on pedigree information to account for the trait information on individuals and their relatives. A set of DNA markers can be used to provide parentage information, while allowing certain animal management constraints to be relaxed. Unambiguous parentage assignment in extensively farmed livestock requires highly informative marker systems; therefore, to decrease genotyping costs, a genetic evaluation system that can be used when the parentage information is incomplete has been developed. One option when implementing this system is to use a particular computing strategy that circumvents high memory requirements. Users of DNA parentage will need to consider tradeoffs among genetic gain, computational complexity, genotyping costs, manual parentage recording accuracy, animal husbandry labor costs, and potential differences in fertility and survival. The methods show how DNA marker information could be used to replace traditional pedigree recording.


    Appendix
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 
Table A1Go contains an example illustrating the calculation of parentage probabilities. For simplicity, the calculations are shown for one marker assuming an error rate (e) of zero and for one progeny, two potential sires, and five potential dams. The marker used is the first marker from Table 4Go, and alleles are labelled A, B, C, etc., in the order shown in that table. Transition probabilities and the probability of the parental genotypes are calculated using the formulas in Table 1Go when both potential parents are genotyped, and using the formulas in Table 2Go when only one potential parent is genotyped.


View this table:
[in this window]
[in a new window]
 
Table A1. Example of parentage probability calculations
 


    Footnotes
 
1 This work was partly funded by Ovita Ltd. We also acknowledge the help of Sheep Improvement, Ltd., and Rissington Breedline, Ltd. in the practical development of breeding values systems using DNA parentage. Back

3 Current address: Ovita Ltd., P.O. Box 5520, Dunedin, New Zealand. Back

2 Correspondence: Private Bag 50034 (phone: +64-3489-9083; fax: +64-3489-9037; e-mail: ken.dodds{at}agresearch.co.nz).

Received for publication February 9, 2005. Accepted for publication June 21, 2005.


    Literature Cited
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Implications
 Appendix
 Literature Cited
 


Alexander, G., D. Stevens, and B. Mottershead. 1983. Problems in the accurate recording of lambing data. Aust. J. Exp. Agric. Anim. Husb. 23:361–368.

Banos, G., G. R. Wiggans, and R. L. Powell. 2001. Impact of paternity errors in cow identification on genetic evaluations and international comparisons. J. Dairy Sci. 84:2523–2529.[Abstract]

Cardoso, F. F., and R. J. Tempelman. 2003. Bayesian inference on genetic merit under uncertain paternity. Genet. Sel. Evol. 35:469–487.[Medline]

Crawford, A. M., M. L. Tate, J. C. McEwan, G. Kumaramanickavel, K. M. McEwan, K. G. Dodds, P. A. Swarbrick, and P. Thompson. 1993. How reliable are sheep pedigrees? Proc. N. Z. Soc. Anim. Prod. 53:363–366.

Devlin, B., K. Roeder, and N. C. Ellstrand. 1988. Fractional paternity assignment: theoretical development and comparison to other methods. Theor. Appl. Genet. 76:369–380.

Dickinson, J. L., and C. E. McCulloch. 1989. Use of maximum likelihood estimation to apportion broods among potential sires. Anim. Behav. 38:719–721.

Dodds, K. G., R. W. McDonald, and M. L. Tate. 1998. Probability of parentage assignment using genetic markers. Proc. 6th World Cong. Genet. Appl. Livest. Prod., Armidale, Australia. 25:637–640.

Dodds, K. G., M. L. Tate, J. C. McEwan, and A. M. Crawford. 1996. Exclusion probabilities for pedigree testing farm animals. Theor. Appl. Genet. 92:966–975.

Foulley, J. L., R. Thompson, and D. Gianola. 1990. On sire evaluation with uncertain paternity. Genet. Sel. Evol. 22:373–376.

Fowler, D. G., and J. F. Wilkins. 1984. Diagnosis of pregnancy and number of foetuses in sheep by real-time ultrasonic imaging. I. Effects of number of foetuses, stage of gestation, operator and breed of ewe on accuracy of diagnosis. Livest. Prod. Sci. 11:437–450.

Gilmour, A. R., B. J. Gogel, B. R. Cullis, S. J. Welham, and R. Thompson. 2002. ASReml user guide release 1.0. VSN International Ltd, Hemel Hempstead, UK.

Henderson, C. R. 1988. Use of an average numerator relationship matrix for multiple-sire joining. J. Anim. Sci. 66:1614–1621.[Abstract/Free Full Text]

Israel, C., and J. I. Weller. 2000. Effect of misidentification on genetic gain and estimation of breeding value in dairy cattle populations. J. Dairy Sci. 83:181–187.[Abstract]

Laughlin, A. M., D. F. Waldron, B. F. Craddock, G. R. Engdahl, R. K. Dusek, J. E. Huston, C. J. Lupton, D. N. Ueckert, T. L. Shay, and N. E. Cockett. 2003. Use of DNA markers to determine paternity in a multiple-sire mating flock. Sheep and Goat Res. J. 18:14–17.

Logue, D. N., J. T. Hall, S. McRoberts, and A. Waterhouse. 1987. Real-time ultrasonic scanning in sheep: the results of the first year of its application on farms in south-west Scotland. Vet. Rec. 121:146–149.[Abstract]

Marshall, T. C., J. Slate, L. E. B. Kruuk, and J. M. Pemberton. 1998. Statistical confidence for likelihood-based paternity inference in natural populations. Mol. Ecol. 7:639–655.[Medline]

Neff, B. D., J. Repka, and M. R. Gross. 2001. A Bayesian framework for parentage analysis: The value of genetic and other biological data. Theor. Pop. Biol. 59:315–331.[Medline]

Perez-Enciso, M., and R. L. Fernando. 1992. Genetic evaluation with uncertain parentage—a comparison of methods. Theor. Appl. Genet. 84:173–179.

Sen, S., and G. A. Churchill. 2001. A statistical framework for quantitative trait mapping. Genetics 159:371–387.[Abstract/Free Full Text]

Sherman, G. B., S. D. Kachman, L. L. Hungerford, G. P. Rupp, C. P. Fox, M. D. Brown, B. M. Feuz, and T. R. Holm. 2004. Impact of candidate sire number and sire relatedness on DNA polymorphism-based measures of exclusion probability and probability of unambiguous parentage. Anim. Genet. 35:220–226.[Medline]

Sise, J. A., K. G. Dodds, A. G. S. Ramsden, and M. L. Tate. 2001. Optimising DNA parentage testing in sheep. Proc. Assoc. Adv. Anim. Breed. Genet. 14:317–320.

Spelman, R. J. 2002. Utilisation of molecular information in dairy cattle breeding. Proc. 7th World Cong. Genet. Appl. Livest. Prod., Montpellier, France. Communication No. 22–02.

Thomas, S. C., and W. G. Hill. 2000. Estimating quantitative genetic parameters using sibships reconstructed from marker data. Genetics 155:1961–1972.[Abstract/Free Full Text]

Visscher, P. M., J. A. Woolliams, D. Smith, and J. L. Williams. 2002. Estimation of pedigree errors in the UK dairy population using microsatellite markers and the impact on selection. J. Dairy Sci. 85:2368–2375.[Abstract/Free Full Text]

Wang, S., and W. C. Foote. 1990. Protein polymophism in sheep pedigree testing. Theriogenology 34:1079–1085.

Welch, R. A. S., and R. Kilgour. 1971. Mismothering in Romney sheep. Proc. N. Z. Soc. Anim. Prod. 31:41.


This article has been cited by other articles:


Home page
J ANIM SCIHome page
A. L. Van Eenennaam, R. L. Weaber, D. J. Drake, M. C. T. Penedo, R. L. Quaas, D. J. Garrick, and E. J. Pollak
DNA-based paternity analysis and genetic evaluation in a large, commercial cattle ranch setting
J Anim Sci, December 1, 2007; 85(12): 3159 - 3169.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Dodds, K. G.
Right arrow Articles by Sise, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dodds, K. G.
Right arrow Articles by Sise, J. A.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS