|
|
||||||||

* Department of Animal Sciences and
and
Department of Statistics, Colorado State University, Fort Collins 80523
| Abstract |
|---|
|
|
|---|
10%) when the mean consumer rating for tenderness, juiciness, or flavor for a steak was 5 or higher on the hedonic scale. As mean consumer sensory ratings for tenderness, juiciness, or flavor decreased from 3 to 5, the probability of acceptance of steaks by consumers diminished rapidly in a linear fashion. These results suggest that small changes in consumer sensory ratings for these sensory traits have dramatic effects on the probability of acceptance of steaks by consumers. Marbling score displayed a weak (adjusted R2 = 0.053), yet significant (P < 0.01), relationship to acceptance of steaks by consumers, and the shape of the predicted probability curve for steak acceptance was approximately linear over the entire range of marbling scores (Traces67 to Slightly Abundant97), suggesting that the likelihood of consumer acceptance of steaks increases approximately 10% for each full marbling score increase between Slight to Slightly Abundant. The predicted probability curve for consumer acceptance of steaks was sigmoidal for the WBSF model, with a steep decline in predicted probability of acceptance as WBSF values increased from 3.0 to 5.5 kg. Changes in WBSF within the high (>5.5 kg) or low (<3.0 kg) portions of the range of WBSF values had little effect on the probability of consumer acceptance of steaks.
Key Words: Beef Consumer Satisfaction Marketing Palatability Prediction Tenderness
| Introduction |
|---|
|
|
|---|
| Materials and Methods |
|---|
|
|
|---|
Data presented in this report were from the same source as data reported by Platter et al. (2003)
. Detailed descriptions of the cattle management history, experimental procedures through harvest, and strip loin sample collection were provided in the preceding report (Platter et al., 2003
). Briefly, strip loins (Institutional Meat Purchase Specification 180; USDA, 1988
) were collected from the right sides of 550 carcasses originating from crossbred steers of various biological types. Strip loins were transported to the Colorado State University Meat Laboratory, where a single 2.54-cm-thick steak was removed from the anterior end of each strip loin. The remaining portion of the strip loin was placed in a vacuum-sealed bag and aged at 2°C for 14 d. After the aging period, the strip loin sections were frozen and stored at -20°C. Strip loin sections were fabricated (in the frozen state) into 2.54-cm-thick steaks using a band saw (model 5700, Hobart, Troy, OH). The first steak from the anterior end of each strip loin was identified and placed in an individual vacuum-sealed bag for subsequent WBSF determination. The next two steaks were identified, individually vacuum-packaged, and stored for subsequent untrained, consumer taste panel analysis. Upon completion of the fabrication process, steaks were sorted for intended use and returned to frozen storage (-20°C). Samples used for sensory analysis were stored frozen for approximately 180 d.
Shear Force Determination
The 14-d aged steak from each strip loin section was removed from freezer storage and allowed to thaw for 24 h at 2°C. Steaks were cooked on an electric conveyor grill (model TGB-60, Magikitchn, Quakertown, PA) for 6 min and 35 s at a setting of 176°C to a target internal temperature of 70°C. After cooking, each steak was allowed to equilibrate to room temperature (22°C), and 6 to 10 1.27-cm-diameter cores were removed from each steak parallel to the muscle fiber orientation. Each core was sheared once, perpendicular to the muscle fiber orientation, using a Warner-Bratzler shear machine (G-R Electric Manufacturing Co., Manhattan, KS), and peak shear force measurements were recorded and averaged to obtain a single WBSF value for each steak.
Consumer Sensory Panels
Branson Research Associates (Bryan, TX) performed extensive demographic analyses of the Denver, CO, metropolitan area to select a test population representative of the age, income, and ethnic background of the U.S. population. Consumers were contacted by telephone and prescreened to ensure that they were at least 18 yr of age and consumers of beef products. Four testing sites, located in the Denver metropolitan area (Denver, Lakewood, Arvada, and Littleton), were used to conduct a total of 25 consumer panel sessions that included a total of 489 consumers. Because this sample population represented consumers from diverse age levels, income levels, and ethnic backgrounds, and included consumers categorized as "beef-eaters," the consumer demographic profile was considered to be an acceptable population to model relationships of consumer sensory ratings, WBSF, and marbling scores to overall acceptance of steaks by consumers.
Within a session, paired steaks from 22 different carcasses were selected for sensory evaluation. Steak identification numbers, as well as order of service to consumer panelists, were assigned randomly to each of the 20 consumers per session. Frozen steaks prepared for consumer panel evaluation were thawed at 2°C for 24 h and cooked for approximately 15 min on electric grills (model GGR64, Salton, Inc., Mt. Prospect, IL) designed to heat steaks from both sides, simultaneously, to a final internal temperature of 70°C. A Type K thermocouple (Omega Engineering Inc., Stamford, CT) was placed in the geometric center of each steak, and internal temperature was monitored during cooking using a microprocessor thermometer (model HH21, Omega Engineering Inc.). Steaks were cut into 1.3 x 1.3 x 2.5-cm portions, covered, and placed in a warming oven (49°C) until consumers were served. One steak from each pair of steak samples from each carcass identification number was prepared for serving during the first half of the session, whereas the other matched steak was prepared for serving in the last half of the session to minimize any changes in sensory attributes associated with holding samples for longer periods of time in a warming oven.
Consumer panel evaluation procedures used for this study were approved by the Colorado State University, Use of Humans in Research Committee. At each location, consumers were randomly seated at tables arranged in a circular order in a room containing standard fluorescent lighting. Instructions regarding the structure of the ballot and sampling procedures for the steak samples were provided verbally to the consumers in each session. Panelists were provided double-distilled, deionized water and saltless saltine crackers and were instructed to take a bite of cracker and a drink of water before evaluating each sample to cleanse their palates and to minimize sensory fatigue between samples.
Consumers rated each steak sample for like/dislike of tenderness, flavor, and juiciness using nine-point, end-anchored hedonic scales (1 = like extremely and 9 = dislike extremely). Additionally, consumers were asked if they were satisfied ("yes" or "no") with the overall eating quality of each sample.
Statistical Approach and Rational
The statistical technique chosen to model the relationship between consumer sensory ratings, marbling, or WBSF and overall acceptability of steaks by consumers was logistic regression analysis. Logistic regression analysis models the relationship between a binary or ordinal response variable (e.g., "yes" or "no" response) and one or more explanatory variables (Ott and Longnecker, 2001
). Logistic regression transforms the dependent variable into a logit variable (the natural log odds of the response variable occurring or not). After transformation of the dependent variable, logistic regression applies maximum likelihood estimation, and, in this way, can estimate the probability that a certain event will occur. Maximum likelihood estimation seeks to maximize the log likelihood that the observed values of the dependent variable will be predicted from the observed values of the independent variables. Unlike ordinary least squares regression, logistic regression does not require normally distributed variables, does not assume linearity of relationship between the independent variables and the dependent variable, and does not assume homoscedasticity (Hosmer and Lemeshow, 2000
). The binomial response variable in this case was the overall consumer acceptance ("yes" or "no") of a steak. Explanatory variables included WBSF value, marbling score, or mean consumer sensory rating for tenderness, juiciness, or flavor.
Numerous studies have documented the existence of a tenderness gradient across longissimus muscle steaks that may impact WBSF and sensory tenderness ratings of longissimus muscle (Smith et al., 1969
; Wheeler et al., 1996
; Kerth et al., 2002
). Additionally, the ASTM (1968)
Manual on Sensory Testing Methods, Special Technical Bulletin 434, stated that, although the variability of consumer preference testing is high, the use of larger numbers of panelists would improve the discrimination of the test. These reports suggest that both location of the sample within a steak and panelist bias may influence individual consumer sensory ratings. Averaging consumer sensory ratings by steak reduces the variance of consumer panel data (Polkinghorne et al., 1999
). Nevertheless, outlier sensory rating responses made by individual consumers can greatly influence average steak sensory ratings. Recently, researchers employed by Meat and Livestock Australia developed standards for a new grading system (Meat Standards Australia) to describe palatability of beef based solely on the use of sensory results from consumer testing (Thompson, 2002
). Outliers among consumer sensory ratings were prevalent in the dataset used for the development of Meat Standards Australia, requiring researchers to devise a method to minimize the impact of these observations on the palatability prediction equation (Polkinghorne et al., 1999
). In that analysis, average consumer sensory ratings were calculated after four responses (the two highest and the two lowest sensory ratings) out of a total of 10 individual consumer responses for each steak were removed from the dataset (Polkinghorne et al., 1999
). In our study, 7 to 10 consumers were asked if they were satisfied (yes or no) with the overall eating quality of each steak. Effects of individual consumer bias were minimized in our analyses by using an average consumer satisfaction rating, by steak (without removal of outliers), to designate whether the overall palatability of a steak was acceptable to consumers. Steaks were designated as acceptable, in our analyses, if two-thirds or more (an average of 66% or more) of the consumers sampling each steak indicated that they were satisfied with its overall eating quality.
Data Analysis
Analyses were conducted to examine the correlation structure of the data using the PROC CORR procedure, and logistic regression equations were developed using the PROC LOGISTIC procedure of SAS (SAS Inst., Inc., Cary, NC). Generalized adjusted coefficients of determination were calculated for each model using the RSQ option of the LOGISTIC procedure. Predicted probability values were obtained from each logistic regression model, and the accuracy of these predictions was tested against actual observations in the original dataset via a classification table using a procedure that approximates an unbiased "jackknifing" method (SAS, 1999
). The predictive accuracy of each model also was tested on a separate population (Wheeler et al., 2002
) by computing the predicted probability values of overall acceptance of steaks by consumers and comparing these predictions with actual observations via a classification table.
| Results and Discussion |
|---|
|
|
|---|
Frequency distributions for consumer demographic attributes are reported by category as a percentage of the total sample population (Table 1
). Approximately 56% of the consumer panelists were female and 44% were male. A high percentage (80%) of panelists had some form of postsecondary education, and about 52% were married. The age of consumers in the sample population ranged from 18 to 84 yr, and the mean age of consumers was 46 yr. Of the incomes reported, the highest percentage of consumers had a total household income of between $40,000 and $49,000/yr, but incomes ranged from less than $10,000/yr to more than $90,000/yr among panelists. A high percentage of consumers reported their ethnicity as Caucasian (76%), followed by African-American (11%), and Hispanic (8%). A very low percentage of the sample population reported an ethnicity of American Indian and Asian or Pacific Islander.
|
|
Distribution of marbling scores of steaks in the experimental sample is provided in Figure 1
. According to data from the 2000 National Beef Quality Audit, only 0.43% of carcasses from fed steers and heifers had Moderately Abundant or Abundant marbling scores and only 0.02% of carcasses had marbling scores of Practically Devoid (McKenna et al., 2002
). The mean marbling score of steaks used in this analysis was 458 ± 105 (Small58) with a range in marbling score from Traces67 to Slightly Abundant97, resulting in a sample that closely resembled the range in marbling scores of the majority of U.S. fed steer and heifer carcasses.
|
|
Results of the analysis of the correlation structure of the data revealed moderate to high correlations (P < 0.05) among mean marbling scores, WBSF values, and mean consumer palatability ratings (Table 3
). The correlation between consumer tenderness ratings and WBSF values was moderately high (r = 0.63). Marbling scores were moderately correlated with WBSF, consumer tenderness ratings, consumer juiciness ratings, and consumer flavor ratings (r = -0.31, -0.27, -0.34, -0.22, respectively). High, positive correlations (r = 0.80 to 0.84) were observed among all consumer sensory ratings.
|
Statistics describing the strength of the relationship between the independent variables and the dependent variable, the discriminatory power, and the success of a regression equation are different for logistic regression than for ordinary linear regression (Hosmer and Lemeshow, 2000
). There is no direct analog to the coefficient of determination (R2) in logistic regression; however, a generalized R2 (R2 adj) that attempts to measure the strength of association of a logistic regression model has been proposed (Nagelkerke, 1991
). The area under the receiver operating characteristic curve (c-statistic) is a measure of the discriminatory power of the logistic equation (Hosmer and Lemeshow, 2000
). The value of the c-statistic can range from 0.50 (the models predictions are no better than chance) to 1.0 (the model can perfectly discriminate the observed responses in the sample). Classification tables are often used to tally correct and incorrect estimates of the logistic regression model and measure the predictive accuracy of the model (SAS, 1995
). A 2 x 2 classification table for binary response variables can be created by comparing the predicted outcomes of the model with the actual observations in the dataset. When the value of the estimated probability of an observation is greater than, or equal to, 0.50, the observation is classified as a predicted event (i.e., an acceptable steak). When the estimated probability is less than 0.50, the observation is classified as a predicted nonevent (i.e., an unacceptable steak). From the classification table, the probability that the model correctly classifies the sample data (percentage correct) can be calculated. Additionally, the ratio of the number of correctly classified acceptable steaks to the total number of acceptable steaks (sensitivity) and the ratio of the number of correctly classified unacceptable steaks to the total number of unacceptable steaks (specificity) can be derived from a classification table.
Sensory Rating Models
Figure 3
displays plots of the predicted probability curves for overall acceptance of steaks by consumers as derived from the cumulative logit response functions of average steak consumer ratings for tenderness, juiciness, or flavor. The R2 adj values for predicting overall acceptance of steaks by consumers were 0.574, 0.516, and 0.520 for models using average steak consumer ratings for tenderness, juiciness, and flavor, respectively. The discriminatory power of the sensory rating models was high (c-statistic = 0.869 to 0.891), and the models correctly classified a large percentage of the observations (percentage correct = 77.7 to 79.3%). Sensitivity percentages were higher than specificity percentages for consumer tenderness (83.2 vs. 75.2%), juiciness (78.6 vs. 76.7%), and flavor (82.1 vs. 75.6) rating models predicting overall acceptance of steaks by consumers (data not shown). Ordinary least squares regression analysis performed by Huffman et al. (1996)
indicated that consumer tenderness ratings accounted for the most variation (R2 = 0.56) in overall consumer palatability ratings for steaks prepared in a restaurant setting, whereas consumer flavor ratings accounted for the most variation (R2 = 0.67) in overall consumer palatability ratings for steaks prepared in the home. In that study, a three-variable ordinary, least squares regression model that included tenderness, juiciness, and flavor ratings accounted for 79% of the variation in consumer overall palatability ratings (Huffman et al., 1996
). Similarly, in the current study, a three-variable logistic regression equation including average steak consumer ratings for tenderness, juiciness, and flavor had a stronger relationship (R2 adj = 0.62) to overall steak acceptance and more discriminatory power (c-statistic = 0.908) than single consumer sensory rating models (data not shown).
|
10%) when the average consumer rating for tenderness, juiciness, or flavor for a steak was five or higher. As mean consumer sensory ratings for tenderness, juiciness, or flavor decreased from three to five, the probability of overall acceptance of steaks by consumers diminished rapidly in a linear fashion. These results suggest that small changes in consumer sensory ratings for these sensory traits have dramatic effects on the probability of overall consumer acceptance of steaks. Additionally, these results suggest a mean consumer sensory rating for tenderness, juiciness, or flavor which is at, or beyond, the mid-point of a nine-point scale would result in a low probability of overall steak acceptance by consumers. Marbling Score Model
The probability curve for overall acceptance of steaks by consumers, as predicted by the cumulative logit response functions of marbling scores, is presented in Figure 4
. The R2 adj value for predicting overall steak acceptance for the marbling score model was 0.053, suggesting a relatively weak relationship between marbling score and overall consumer acceptability. The marbling score model had a c-statistic of 0.574, and correctly classified 54.5% of the observations, indicating low discriminatory power of the model. These results are consistent with research that has shown marbling scores to have a low to moderate relationship with overall beef palatability ratings (Crouse and Smith, 1978
; Tatum et al., 1982
; Smith et al., 1984
).
|
Shear Force Model
A majority of consumers in this (Table 2
) and other studies (Huffman et al., 1996
; CCA, 2002
) indicated that tenderness is the single most important palatability trait for determining overall steak acceptance. Warner-Bratzler shear force is routinely used by scientists as an objective measurement of meat tenderness and, despite criticism, has remained the most popular and accurate instrumental measurement of meat tenderness (Wheeler et al., 1997
). Otremba et al. (1999)
and Wheeler et al. (1996)
reported WBSF value correlations (r) of -0.68 and -0.85, respectively, with trained panelist overall tenderness ratings for longissimus steaks. Shackelford et al. (1995)
reported that a single variable regression equation using WBSF values explained 73% of the variation in trained panelist overall tenderness ratings for longissimus muscle steaks. Because tenderness is an important driver of overall steak acceptability and WBSF is used as an objective measure of tenderness, it is reasonable to assume that WBSF values could be used in a logistic regression model to predict overall acceptability of steaks by consumers.
Figure 5
represents a plot of the predicted probability curve for overall consumer acceptance of steaks resulting from the cumulative logit response functions of WBSF value. The strength of the relationship between consumer acceptance of steaks and WBSF values was moderate (R2 adj = 0.225). The WBSF model had moderate discriminatory power (c-statistic = 0.738) and correctly classified 66.7% of the observations. Boleman et al. (1997)
demonstrated that consumer perceptions of tenderness and overall satisfaction of beef top loin steaks were affected when steaks were segmented into categories based on WBSF value. Lorenzen et al. (2003)
reported low correlations of WBSF values with "in-home" consumer panelist ratings of tenderness (r = -0.26) and consumer overall likeness ratings (r = -0.18) for top loin steaks, but attributed the lack of relationship of these traits, in part, to the variation in steak preparation encountered during "in-home" consumer studies.
|
An advantage of the current form of analysis describing the relationship of WBSF values to overall acceptance of steaks by consumers is that the relationship can be compared over a wide range of WBSF values. The predicted probability curve for overall acceptance of steaks by consumers was sigmoid shaped for the WBSF model, with a steep decline in predicted probability of acceptance as WBSF values increased from 3.0 to 5.5 kg. Changes in WBSF within the high (>5.5 kg) or low (<3.0 kg) portions of the range of WBSF values had little effect on the probability of consumer acceptance of steaks.
Validation of Logistic Regression Models
Consumer sensory ratings, marbling score, and WBSF value models developed in our analysis were tested against the population described by Wheeler et al. (2002)
. Predicted probabilities for observations in the validation dataset were calculated from the logistic regression equations developed from the original population and compared with actual consumer acceptance ratings via a classification table. Of the three sensory ratings models, the tenderness-rating model was the most accurate (78.7%), whereas the juiciness and flavor rating models were less accurate (63.7 and 68.7%, respectively) for determining whether two-thirds of consumers would have rated steaks as acceptable. The equations from the marbling score and WBSF models were 57.3 and 70.7% accurate, respectively, for determining whether two-thirds of consumers would have rated steaks as acceptable. Applying the logistic regression equations to a separate dataset for validation purposes produced results similar to those observed in the original analyses.
| Implications |
|---|
|
|
|---|
1 Correspondencephone: 970-491-6530; fax: 970-491-0278; E-mail: dtatum{at}lamar.colostate.edu.
Received for publication February 18, 2003. Accepted for publication June 9, 2003.
| Literature Cited |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
R. J. Rathmann, J. M. Mehaffey, T. J. Baxa, W. T. Nichols, D. A. Yates, J. P. Hutcheson, J. C. Brooks, B. J. Johnson, and M. F. Miller Effects of duration of zilpaterol hydrochloride and days on the finishing diet on carcass cutability, composition, tenderness, and skeletal muscle gene expression in feedlot steers J Anim Sci, November 1, 2009; 87(11): 3686 - 3701. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Radunz, S. C. Loerch, G. D. Lowe, F. L. Fluharty, and H. N. Zerby Effect of Wagyu- versus Angus-sired calves on feedlot performance, carcass characteristics, and tenderness J Anim Sci, September 1, 2009; 87(9): 2971 - 2976. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Schneider, J. D. Tatum, T. E. Engle, and T. C. Bryant Effects of heifer finishing implants on beef carcass traits and longissimus tenderness J Anim Sci, August 1, 2007; 85(8): 2019 - 2030. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Okumura, K. Saito, H. Sakuma, T. Nade, S. Nakayama, K. Fujita, and T. Kawamura Intramuscular fat deposition in principal muscles from twenty-four to thirty months of age using identical twins of Japanese Black steers J Anim Sci, August 1, 2007; 85(8): 1902 - 1907. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Gruber, J. D. Tatum, J. A. Scanga, P. L. Chapman, G. C. Smith, and K. E. Belk Effects of postmortem aging and USDA quality grade on Warner-Bratzler shear force values of seventeen individual beef muscles J Anim Sci, December 1, 2006; 84(12): 3387 - 3396. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. T. Choat, J. A. Paterson, B. M. Rainey, M. C. King, G. C. Smith, K. E. Belk, and R. J. Lipsey The effects of cattle sex on carcass characteristics and longissimus muscle palatability J Anim Sci, July 1, 2006; 84(7): 1820 - 1826. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. M. Cho, H. Choi, I. H. Hwang, Y. K. Kim, and K. H. Myung Effects of 25-hydroxyvitamin D3 and manipulated dietary cation-anion difference on the tenderness of beef from cull native Korean cows J Anim Sci, June 1, 2006; 84(6): 1481 - 1488. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. Platter, J. D. Tatum, K. E. Belk, S. R. Koontz, P. L. Chapman, and G. C. Smith Effects of marbling and shear force on consumers' willingness to pay for beef strip loin steaks J Anim Sci, April 1, 2005; 83(4): 890 - 899. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |