As part of their Six Sigma projects, practitioners often must assess the capability of the measurement system used. In the Service Quality/Transactional arena, there are many similarities with standard measurement system analysis (MSA) procedures. Some subtle differences, however, also require consideration.
Several types of data exist for use in MSA studies:
The emphasis of this article, however, is on continuous data such as cycle times in minutes and financial loans in dollars. In the four sections that follow, we examine continuous data in relation to MSA studies in the Service Quality/Transactional arena, beginning with the theoretical background of the standard (manufacturing) approach and finishing with a practical example:
MSA with Continuous Data: The Standard (Manufacturing) Approach
The following widely used model estimates the contribution of the measurement system to the overall variation in a process:
(1) Yijk = μ + Operatori + Partj + (Operator * Part)ij + εk(ij), i = 1,2,,a; j = 1,2,,b; k = 1,2,,n
Typically, a operators and b parts are randomly selected from wider populations. The experiment is run by allowing each operator to measure each part n times. The type of model considered in (1) is a two-factor random effect model, where Operator, Part, Operator * Part, and ε are assumed to be normally distributed with zero mean and variances of σ2Operator, σ2Part, σ2Operator*Part, and σ2ε, respectively. Then, the total variation in a measurement (Y) can be decomposed into:
(2) σ2y = σ2Operator+ σ2Part + σ2Operator * Part + σ2ε
As suggested by (2), the total variation in Y contains two parts: the variation due to the measurement system (gage), σ2Gage, and the Part-to-Part variation, σ2Part, where
σ2Gage = σ2Operator+ σ2Operator * Part + σ2ε
Among the gage variation component, the variation across operators plus the variation for the Operator by Part interaction is called gage reproducibility. The variation due to the random error term ε is called gage repeatability.2 The ratio of the gage variation and the total variation or functions of that ratio are widely used to assess a measurement system.3
A Frequently Encountered Issue for a Service Quality/Transactional MSA
Consider a system that requires analysts to enter information into a computer database and then determine how much money should be lent to a customer. In practice, analysts may have different interpretations of the protocols involvedpossibly through variation in the training they received.
Though this may lead to different dollar amounts calculated for a particular loan between analysts, the amount "within" each analyst typically remains identical. In other words, an analyst would arrive at exactly the same amount each time for a particular loan request. The analyst would be conforming to his or her own interpretation of the protocols each time.
Consider a model for this system as:
(3) Yijk = μ + Analysti + Loanj + (Analyst * Loan)ij + εk(ij), i = 1,2,,a; j = 1,2,,b; k = 1,2,,n
Note that with this scenario, since the analysts always provide the same amounts for the same loan, the variance for repeatability is zero. The measurement system therefore only computes variance for reproducibility (which may still have merit for analysis purposes). However, when analyzing the data using ANOVA functionality in a statistical software package, it may be impossible to compute certain F-tests to assess significance.
Occasionally, practitioners unwisely "fudge" one or more responses to create an artificial error term, simply to obtain statistical output. Clearly, this is wholly inappropriate. In such a situation, it may be more beneficial instead simply to assess the level of agreement:
MSA with Continuous Data: Service Quality/Transactional Scenario
Using the previously discussed scenario and model (3), consider the response variable of interest (Y) to be instead the length of time (in seconds) for a loan request to be assessed and placed into the database. Though the actual dollar amounts reached are obviously of more concern for analysis, useful insights may still be obtained.
First, note certain important features of the designed experiment:
Though the experiment itself does not directly address the resulting loan values themselves (those responses could be assessed using an attribute agreement analysis study), it may still provide useful process information. For example, with regard to the processing times, the study may exhibit:
A Practical Example
Three analysts - Jack, Renee, and Rossrandomly selected for the study each enter data from twenty randomly selected loans. They complete this task twice, the loans being assessed in separate random sequences. The time taken between the cycles of data entry is considered large enough to negate any memory recall effects.
Figure 1 shows the ANOVA output from the experimental results using the two-way random effects model in (3).
Fig. 1 Gage R&R Study - ANOVA Method
Two-Way ANOVA Table With Interaction
| Source | DF | SS | MS | F | P |
| Loan | 19 | 134774 | 7093.36 | 122.236 | 0.000 |
| Analyst | 2 | 282 | 140.93 | 2.429 | 0.102 |
| Loan * Analyst | 38 | 2205 | 58.03 | 8.597 | 0.000 |
| Repeatability | 60 | 405 | 6.75 | ||
| Total | 119 | 137666 |
Clearly, most of the variation in processing times is attributable to variation between loans. Importantly, there is a statistically significant interaction effect (i.e., reject H0: σ2Loan * Analyst = 0 in favor of H1: σ2Loan * Analyst> 0). The presence of an interaction effect can seriously impact conclusions drawn from solely assessing the main effects.
With regard to the variance components, similar to (2), we may consider:
(4) σ2y = σ2Loan + σ2Analyst + σ2Loan * Analyst + σ2ε
Regarding certain ratios of these variance components, as shown in Figure 2, only 2.86% of the variation of entry times in the study can be attributed to the measuring system. Of this small percentage, it is the Analyst * Loan interaction effect that has the most importance (2.12%).
Fig. 2 Variance Components
Gage R&R
| Source | VarComp |
%Contribution (of VarComp) |
| Total Gage R&R | 34.46 | 2.86 |
| Repeatability | 6.75 | 0.56 |
| Reproducibility | 27.71 | 2.30 |
| Analyst | 2.07 | 0.17 |
| Analyst * Loan | 25.64 | 2.12 |
| Part-To-Part | 1172.56 | 97.14 |
| Total Variation | 1207.02 | 100.00 |
Considering the results in Figure 3, we see that the repeatability (i.e., the differences in the two times recorded for each analyst/loan combination) is relatively consistent with an average range of 3.03 seconds. Renee appears to exhibit slightly more variation in the recorded times than Jack and Ross.
Fig. 3 Gage R&R (ANOVA) for Time
As the interaction plot in Figure 4 (reproduced from Figure 3) clearly indicates, a marked difference occurs in the mean entry times between the operators for loan 13. Renee takes longer than Ross, and especially more time than Jack. A continuous quality improvement initiative would address this issue by investigating why the difference occurred for loan 13, and not to the same extent for the others.
Fig. 4 Interaction Plot: Analyst * Loan

Summary
This article has sought to discuss certain nuances when Six Sigma practitioners address MSA procedures in the Service Quality/Transactional arena. Admittedly, the practical example illustrated cannot be considered a traditional MSA per se. The cycle time may be considered a "by-product" of the final recorded value (the loan amount in dollars). That is not to diminish, however, the practical importance of a well-defined and executed study.
References
© Keith M. Bower. All rights reserved.