J. Anim. Sci. 2003. 81:150-155
© 2003 American Society of Animal Science
On-line prediction of yield grade, longissimus muscle area, preliminary yield grade, adjusted preliminary yield grade, and marbling score using the MARC beef carcass image analysis system1
S. D. Shackelford2,
T. L. Wheeler and
M. Koohmaraie
Roman L. Hruska U.S. Meat Animal Research Center, USDA, ARS, Clay Center, NE 68933-0166
2 Correspondence:
phone: 402-762-4223; fax: 402-762-4149; E-mail:
shackelford{at}email.marc.usda.gov.
 |
Abstract
|
|---|
The present experiment was conducted to evaluate the ability of the U.S. Meat Animal Research Centers beef carcass image analysis system to predict calculated yield grade, longissimus muscle area, preliminary yield grade, adjusted preliminary yield grade, and marbling score under commercial beef processing conditions. In two commercial beef-processing facilities, image analysis was conducted on 800 carcasses on the beef-grading chain immediately after the conventional USDA beef quality and yield grades were applied. Carcasses were blocked by plant and observed calculated yield grade. The carcasses were then separated, with 400 carcasses assigned to a calibration data set that was used to develop regression equations, and the remaining 400 carcasses assigned to a prediction data set used to validate the regression equations. Prediction equations, which included image analysis variables and hot carcass weight, accounted for 90, 88, 90, 88, and 76% of the variation in calculated yield grade, longissimus muscle area, preliminary yield grade, adjusted preliminary yield grade, and marbling score, respectively, in the prediction data set. In comparison, the official USDA yield grade as applied by online graders accounted for 73% of the variation in calculated yield grade. The technology described herein could be used by the beef industry to more accurately determine beef yield grades; however, this system does not provide an accurate enough prediction of marbling score to be used without USDA grader interaction for USDA quality grading.
Key Words: Beef Carcasses Instrumentation Processing
 |
Introduction
|
|---|
At present, beef carcass value is primarily a function of USDA quality (a subjective estimate of meat palatability) and USDA yield grades (a subjective estimate of carcass composition). Although expert calculated USDA yield grade is a relatively accurate predictor of carcass composition (Abraham et al., 1980), the development of an objective system to predict beef carcass cutability has been an industry priority. In 1997, Shackelford et al. (1998) developed a system to predict beef carcass cutability based on image analysis of the 12th-rib cross section that was removed from carcasses for tenderness classification. The ARS entered into a cooperative research and development agreement with IBP, Inc. to adopt this technology for application directly to beef carcasses. The resulting beef carcass image analysis system was designed to predict beef carcass value-determining characteristics based on an image of the 12th-rib cross section that is used for quality and yield grading. The system was designed to be functional under industrial conditions without modification of conventional slaughter, dressing, trimming, and ribbing procedures. This experiment was conducted to evaluate the ability of this system to predict calculated yield grade, longissimus muscle area, preliminary yield grade, adjusted preliminary yield grade, and marbling score under commercial beef processing conditions.
 |
Materials and Methods
|
|---|
Experiment 1. Predictive Accuracy
Carcasses.
Image analysis was conducted on the beef-grading chain immediately after the conventional USDA beef quality and yield grades were applied at IBPs Lexington, NE, and Amarillo, TX, beef processing facilities. These two beef processing plants were chosen for this experiment in order to include any possible differences among plants in chilling regimen, ribbing method, grading chain speed, and cattle type. At each facility, 400 carcasses were selected for inclusion in this experiment in 10 groups of 40 consecutive carcasses on the bloom chain prior to grading and image analysis. Carcasses were selected in groups of consecutive carcasses so that image analysis was conducted under the same conditions as would be applied in those facilities (i.e., grading rates of 350 to 425 carcasses per hour). The number of carcasses selected in each group was limited to 40 because that was the number of carcasses that could be off-railed onto a single regrade rail for expert evaluation of quality and yield grade factors. By sampling 20 groups of carcasses, the sample included carcasses from 57 different livestock producer lots and included extreme variation in all carcass grade traits. The number of cattle sampled per lot ranged from 1 to 40.
A single technician conducted the image analysis for the entire experiment. The technicians duties included: 1) determining which side of the carcass should be imaged in order to provide the most accurate prediction of yield grade (i.e., which side had the least severity of s.c. fat removed or which side was most correctly or completely ribbed); 2) preparing the longissimus muscle cross section for image analysis (i.e., removal of bone dust, pieces of fat, exudate, water, and tags from the longissimus muscle); 3) properly positioning the camera unit on the 12th-rib cross section; 4) triggering the computer to initialize the image acquisition and analysis process; and 5) observing the analyzed image to see that the process was completed correctly. Another technician recorded which side of the carcass was imaged.
Carcass Grade Data.
After the image analysis process was completed, carcasses were transferred to stationary rails in regrade bays with ample lighting and space for expert evaluation of quality and yield grade factors. Hot carcass weight (HCW) and official USDA quality and yield grades were recorded.
For the imaged side of each carcass, three meat scientists independently traced the outline of the 12th-rib longissimus muscle cross section onto acetate paper. Subsequently, each acetate tracing was digitized using a flatbed scanner, and the longissimus muscle area was measured in triplicate using image analysis (Image-Pro Plus, version 4.1, Media Cybernetics, Silver Springs, MD). The overall mean longissimus muscle area, which was calculated as the simple mean of all nine observations (triplicate tracing x triplicate measurement), was used for subsequent calculation of yield grade and other analyses.
A team of three supervisory level personnel from the Standardization and Meat Grading branches of the Livestock and Seed Program of USDA Animal Meat Science determined preliminary yield grade (measured on the side that was imaged), adjusted preliminary yield grade, kidney, pelvic, and heart fat percentage, and marbling score. As with conventional quality grading, marbling score was recorded for the side of the carcass with the highest degree of marbling.
Statistical Analysis.
For each plant, the carcasses were ranked by observed calculated yield grade, and alternating carcasses were assigned to either a calibration data set, which was used to develop regression equations, or a prediction data set, which was used to validate the regression equations (Neter et al., 1989). The calibration and prediction data sets each contained a total of 400 carcasses (200 from each plant). This method of assignment ensured that the calibration and prediction data sets had similar simple statistics for carcass traits (Table 1
).
Regression equations were developed using two sets of independent variables. The first set included image analysis traits and HCW. The second set only included image analysis traits.
Because there were 63 independent variables, the RSQUARE procedure (SAS Inst., Inc., Cary, NC) would not solve for high-order equations. Therefore, backward stepwise regression was used to narrow the pool of predictors to 40 for each dependent variable. For each dependent variable, the RSQUARE procedure of SAS was used to select the best 1- to 10-variable equation from the appropriate unique set of 40 independent variables. For each dependent variable, the highest-order equation in which the partial significance of each component was less than 0.001 was selected as the regression for that trait and was tested against the prediction data set. Mallows (1973) CP statistic was calculated for each equation. However, because the CP statistic exceeded the number of variables for each equation, it was not used in the equation selection process.
Experiment 2. Repeatability
As before, image analysis was conducted on the beef-grading chain at IBPs Lexington, NE, beef-processing facility. Images were captured in triplicate on each of 200 carcasses at chain speed by repeating the process of camera unit placement on the cross section, image acquisition, and camera unit removal. Because of the time required to repeat the process of positioning the camera on the longissimus muscle cross section three times, it was not possible to sample consecutive carcasses for this experiment. Instead, alternating carcasses were tested. For each trait, repeatability was calculated as
2carcass/(
2carcass +
2error).
 |
Results and Discussion
|
|---|
Approximately two-thirds of the carcasses in the calibration (68%) and prediction data (65%) sets were steers, and approximately one-third of the carcasses were heifers. Simple statistics of the calibration and prediction data sets are presented in Table 1
. The SD of yield grade was greater for the present data sets (1.06 and 1.04) than for the 1990, 1995, and 2000 National Beef Quality Audits (SD = 0.9, 0.8, and 0.87, respectively; Lorenzen et al., 1993; Boleman et al., 1998; McKenna et al., 2001). In comparison to the National Beef Quality Audit-2000 (McKenna et al., 2001), our data sets contained a substantially higher percentage of yield grade 1 carcasses (26 vs 12%) and a lower percentage of yield grade 3 carcasses (25 vs 39%).
Prediction of Marbling Score.
When HCW was included in the independent variable set, it was not one of the variables in the "best" equation. However, inclusion of HCW altered the process of variable selection slightly and resulted in different equations being selected as the best equation when HCW was included and excluded from the independent variable set. In each case, a nine-variable regression equation accounted for 79% of the variation in marbling score in the calibration data set and 76 or 75% of the variation in marbling score in the prediction data set (Table 2
). Steiner et al. (2000) reported that 66 or 49% of the variation in expert marbling scores was accounted for by the CVS Computer Vision System or VIAscan (both of which are marketed by Research Management Systems, Inc., Calgary, Alberta, Canada), respectively. Although the proportion of variation in expert marbling scores accounted for by this system is greater than that accounted for by CVS or VIAscan, this system is not accurate enough to be used for USDA quality grading. For the prediction data set, the mean absolute error was 39% of one degree of marbling and the error of prediction was greater than one marbling degree for 4.8% of the carcasses.
View this table:
[in this window]
[in a new window]
|
Table 2. Prediction equations for estimating yield grade, longissimus area, preliminary yield grade (PYG), adjusted PYG, and marbling score using image analysis variables in combination with hot carcass weight or alone
|
|
Prediction of Longissimus Muscle Area.
A 10-variable regression equation that included HCW and image analysis variables accounted for 91 and 88% of the variation in longissimus muscle area in the calibration and prediction data sets, respectively (Table 2
). A 10-variable regression equation that included only image analysis variables accounted for 90 and 87% of the variation in longissimus muscle area in the calibration and prediction data sets, respectively. Cannell et al. (1999) reported that the VIAscan system predicted 88% of the variation in longissimus muscle area. Steiner et al. (2000) reported that the CVS and VIAscan systems predicted 81 and 69% of the variation in longissimus muscle area, respectively.
Prediction of Preliminary Yield Grade.
A three-variable regression equation that included HCW and image analysis variables accounted for 91 and 90% of the variation in preliminary yield grade in the calibration and prediction data sets, respectively (Table 2
). A five-variable regression equation that included only image analysis variables accounted for 91 and 90% of the variation in preliminary yield grade in the calibration and prediction data sets, respectively. Cannell et al. (1999) reported that the VIAscan system predicted 71% of the variation in preliminary yield grade.
Prediction of Adjusted Preliminary Yield Grade.
A six-variable regression equation that included HCW and image analysis variables accounted for 88% of the variation in adjusted preliminary yield grade in the calibration and prediction data sets (Table 2
). A five-variable regression equation that included only image analysis variables accounted for 88% of the variation in adjusted preliminary yield grade in the calibration and prediction data sets. Cannell et al. (1999) reported that the VIAscan system predicted 72% of the variation in adjusted preliminary yield grade. Steiner et al. (2000) reported that the CVS and VIAscan systems predicted 44 and 76% of the variation in adjusted preliminary yield grade, respectively.
Prediction of Yield Grade.
A four-variable regression equation that included HCW and image analysis variables accounted for 91 and 90% of the variation in expert yield grade in the calibration and prediction data sets, respectively (Table 2
). A seven-variable regression equation that included only image analysis variables accounted for 86 and 84% of the variation in expert yield grade in the calibration and prediction data sets, respectively. Belk et al. (1997) proposed a system where an instrument would be used to measure longissimus muscle area, online USDA graders would assess carcass fatness, and yield grade would be calculated using a computer. Steiner et al. (2000) reported that such a system could account for 81 or 74% of the variation in expert yield grade if longissimus muscle area was measured by CVS or VIAscan, respectively. Thus, it appears that yield grade can be predicted more accurately by our system than by augmented yield grading. In our experiment, the official USDA yield grade as applied by online graders accounted for 73% of the variation in calculated yield grade.
The percentage of carcasses in the prediction data set assigned to each yield grade by "official" USDA graders, expert yield grades, and image analysis predicted yield grades is presented in Table 3
. The percentage of carcasses assigned to yield grade 1 was lower (P < 0.01) for "official" USDA grades than expert yield grades. In contrast, image analysis assigned the appropriate percentage of carcasses to yield grade 1. Image analysis assigned more carcasses to yield grade 1 than did the USDA graders during "official" online grading. This suggests that if this technology were implemented, the percentage of carcasses that would be assigned yield grade 1 payment premiums would increase.
View this table:
[in this window]
[in a new window]
|
Table 3. Percentage of carcasses in the prediction data set assigned to each yield grade by "official" USDA graders, expert yield grades, and image analysis-predicted yield grades
|
|
There are numerous ways in which predicted yield grade could be used in a formula or grid to determine carcass value. A regression equation could be used to linearly relate yield grade to value. However, that probably is not appropriate because value is likely not linearly related to yield grade. For example, although the average yield difference between yield grade 4.2 carcasses and yield grade 3.8 carcasses is small, the value difference is often quite large because specifications for many product lines exclude yield grade 4 or higher carcasses. Therefore, if predicted yield grade from this technology is used in a value-determining formula, it is likely that value would be assigned using a grid, even though that is not the most accurate method of using image analysis-predicted yield grade. The structure of such a grid will affect the amount of information that will be retained from predicted yield grade. For example, as a continuous variable, image analysis-predicted yield grade accounted for 90.5% of the variation in observed yield grade in the prediction data set. When those carcasses were assigned to a grid that was structured the same as the current yield grades (i.e., five classes divided at 2.0, 3.0, 4.0, and 5.0), predicted yield grade only accounted for 80.8% of the variation in observed yield grade. That is, 56% ([90.5 - 80.8]/[90.5 - 73.3]) of the gain in predictive accuracy that could be obtained by using image analysis-predicted yield grade vs the official USDA yield grade would be lost by assigning the carcasses to a grid in that manner. Because 6% of carcasses had yield grades below 1.0, a portion of the loss in information that occurred in this scenario arose from the broad grouping of all carcasses with a predicted yield grade less than 2.0 into the same class. By classifying carcasses with predicted yield grades less than 1.00 separately from those with predicted yield grades between 1.00 and 2.00, a greater proportion of the variation in observed yield grade was accounted for by the yield grade classification (Table 4
). As the width of the yield-grade classes was decreased from 1.0 to 0.2 units, there was an increase in the proportion of the variation in observed yield grade accounted for by the yield grade classification. For the prediction data set, essentially all of the predictive accuracy was retained when carcass were classified into 30 yield-grade classes in 0.2 increments as <0.2, 0.2 to 0.4, 0.4 to 0.6, ..., 5.4 to 5.6, 5.6 to 5.8,
5.8 (Table 4
). However, practical application of predicted yield grade in a grid structure might require the use of fewer classes. Cannell et al. (1999) reported that expert yield grades assigned to the nearest tenth accounted for more (74 vs 66%) of the variation in closely trimmed subprimal cut yields than expert yield grades assigned to the whole grade. In their analysis, Cannell et al. (1999) included all carcasses with yield grades less than 2.0 in the same yield grade.
View this table:
[in this window]
[in a new window]
|
Table 4. Effect of the structure of the yield-grade classes to which image analysis-predicted yield grade is applied on the proportion of variation in calculated yield grade that is accounted for by the yield grade classification
|
|
Repeatability of Image Analysis (Exp. 2).
Repeatability estimates for predicted calculated yield grade, longissimus muscle area, preliminary yield grade, adjusted preliminary yield grade, and marbling score were 0.99, 0.98, 0.99, 0.99, and 0.97, respectively. That is, the process of collecting and analyzing the images was highly repeatable.
 |
Implications
|
|---|
The technology described herein could be used by the beef industry to more accurately determine beef yield grades and, thus, should help facilitate value-based beef marketing systems. This technology may improve the ability of packers to identify which carcasses should be fabricated into closely trimmed cuts. Given the complexity of beef marketing, where changes in yield grade are often antagonistic toward quality grade, it is difficult to predict the impact of this technology. Producers may use information from an objective grading instrument in selection strategies to improve the leanness of beef.
 |
Footnotes
|
|---|
1 Names are necessary to report factually on available data; however, the USDA neither guarantees nor warrants the standard of the product, and the use of the name by USDA implies no approval of the product to the exclusion of other products that may also be suitable. The authors are grateful to J. W. Wise, S. Cave, W. G. Goad, and C. R. Rains for providing the expert estimates of yield and quality grade factors. The authors are grateful to P. Bambrick, P. Beska, K. Brockmoller, T. Burch, D. Danilson, H. Eger, P. Grueber, P. Haagensen, A. Hinz, I. Hwang, J. Johnson, L. Kershner, C. Koritzky, B. McFarlane, L. Nemec, R. Rothe, and S. Roggenbauer for their assistance in the execution of this agreement, and to M. Bierman and C. Grummert for their secretarial assistance. The authors are grateful to M. E. Dikeman, K. A. Leymaster, J. W. Savell, J. W. Wise, and J. Wray for their review of this manuscript. 
Received for publication June 26, 2002.
Accepted for publication August 20, 2002.
 |
Literature Cited
|
|---|
Abraham, H. C., C. E. Murphey, H. R. Cross, G. C. Smith, and W. J. Franks Jr. 1980. Factors affecting beef carcass cutability: An evaluation of the USDA yield grades for beef. J. Anim. Sci. 50:841851.[Abstract/Free Full Text]
Belk, K. E., J. A. Scanga, J. D. Tatum, J. W. Wise, and G. C. Smith. 1997. Simulated instrument augmentation of USDA yield grade application to beef carcasses. J. Anim. Sci. 76:522527.
Boleman, S. L., S. J. Boleman, W. W. Morgan, D. S. hale, D. B. Griffin, J. W. Savell, R. P. Ames, M. T. Smith, J. D. Tatum, T. G. Field, G. C. Smith, B. A. Gardener, J. B. Morgan, S. L. Northcutt, H. G. Dolezal, D. R. Gill, and F. K. Ray. 1998. National Beef Quality Audit1995: Survey of producer-related defects and carcass quality and quantity attributes. J Anim. Sci. 76:96103.[Abstract/Free Full Text]
Cannell, R. C., J. D. Tatum, K. E. Belk, J. W. Wise, R. P. Clayton, and G. C. Smith. 1999. Dual-component video image analysis system (VIASCAN) as a predictor of beef carcass red meat yield percentage and for augmenting application of USDA yield grades. J. Anim. Sci. 77:29422950.[Abstract/Free Full Text]
Lorenzen, C. L., D. S. Hale, D. B. Griffin, J. W. Savell, K. E. Belk, T. L. Frederick, M. F. Miller, T. H. Montgomery, and G. C. Smith. 1993. National beef quality audit: Survey of producer-related defects and carcass quality and quantity attributes. J. Anim. Sci. 71:14951502.[Abstract]
Mallows, C. L. 1973. Some comments on Cp. Technometrics. 15:661.
McKenna, D. R., P. K. Bates, D. L. Roeber, T. B. Schmidt, D. S. Hale, D. B. Griffin, J. W. Savell, J. B. Morgan, T. H. Montgomery, and G. C. Smith. 2001. National Beef Quality Audit-2000: Results of carcass assessments. J. Anim. Sci. 79(Suppl. 1):62 (Abstr.).
Neter, J., W. Wasserman, and M. H. Kutner. 1989. Applied Linear Regression Models. 2nd ed. Irwin, Homewood, IL.
Shackelford, S. D., T. L. Wheeler, and M. Koohmaraie. 1998. Coupling of image analysis and tenderness classification to simultaneously evaluate carcass cutability, longissimus area, subprimal cut weights, and tenderness of beef. J. Anim. Sci. 76:26312640.[Abstract/Free Full Text]
Steiner, R., A. M. Wylie, D. J. Vote, D. L. Roeber, R. C. Cannell, R. J. Richmond, K. Markey, J. W. Wise, M. E. OConnor, R. R. Jones, K. E. Belk, J. A. Scanga, J. D. Tatum, and G. C. Smith. 2000. Real time augmentation of USDA yield grade application to beef carcasses using state-of-the-art VIA instrumentation. Final report to the National Cattlemens Beef Association and the Cattlemens Beef Board. Colorado State Univ., Fort Collins.