JMPer Cable

tonya_mauldin · Jul 9, 2018 10:52 AM

You learned about CUSUM control charts in my colleague @AnnieDudley’s post “How to detect small shifts in Control Charts.” We saw that the CUSUM control chart was better at detecting small shifts than the IMR (Individual and Moving Range) chart. How much better is the CUSUM chart than the IMR chart at detecting small shifts? What is considered a small shift? In other words, how does the CUSUM chart perform compared to the IMR chart?

If this were a hypothesis test, we could measure the α (Type I) and β (Type II) error. This method can’t be used for control charts because control charts are a series of sample by sample hypothesis tests. Instead of the traditional α and β error that is used for hypothesis tests, the Average Run Length (ARL) is typically used to measure the performance of control charts.

Run Length is the number of samples (subgroups) taken before there is a signal (point beyond the control limits). In the image on the left, the run length is four because there were four samples taken before a point went beyond the upper control limit. Run length is a random variable that can only take on integer values. We need to use the expected value or Average Run Length to measure performance. ARL (Average Run Length) is the expected value of the run length, i.e., the expected number of subgroups until a control chart first signals.

You will hear about two types of ARL discussed in literature: in-control ARL (ARL₀) and out-of-control ARL (ARL_Δ). ARL₀ is the expected number of samples until a control chart signals, given that the process is in control. You can also think of this as sending a false alarm. Since the process is in control (on target), you want ARL₀ to be as large as possible. ARL_Δ is the expected number of samples until a control chart signals, given the process is out of control (the mean has shifted). You would like for ARL_Δ to be as small as possible. If the process has shifted to an unsatisfactory level, you want to learn about it sooner rather than later.

How do you calculate ARL?

Let’s consider the example used in Annie’s blog post. This example examined data from a car engine thermostat. The engine was designed to operate in a temperature range of [90° C, 110° C]. First let's look at the IMR chart that was used in her post, using only Test 1. We are only considering Test 1 because the addition of run rules makes the determination of a false alarm probability more difficult. With run rules, the probability of a false alarm at a given observation is dependent on the values of the previous observation. This means that the value of α is not constant.

In-Control ARL (ARL₀)

α=Pr[Shewhart control chart signals for a given sample | the process is in-control]
1-α=Pr[Shewhart control chart does not signal for a given sample | the process is in-control]

For any Shewhart chart with the usual 3-sigma limits, α=0.0027 is the probability that a single point falls outside the control limits when the process is in control. Therefore, the ARL₀ is 1/0.0027≈370. If the process is in control, a signal will be given every 370 samples, on the average.

Out-of-Control ARL (ARL_Δ)

1-β=Pr[Shewhart control chart signals for a particular sample | the process is out-of-control]
β =Pr[Shewhart control chart does not signal for a particular sample | the process is out-of-control]

β=φ(z_α/2-k)- φ(-z_α/2-k)
With 3-sigma limits, z_α/2=0.0027/2=3
k=|µ₁- µ₀|/σ₀=|110-103.9632|/ 9.730≈.60204
β=φ(3-.60204)- φ(-3-.60204)=0.9916
ARL_Δ=1/(1- β)=1/(1-.9916) ≈119

Although ARL₀ is large as we desired, ARL_Δ is also large, which we did not want. On average, it would take 119 samples to signal when the mean has shifted from 104 to 110.

CUSUM ARLs

Unfortunately, the ARL calculations for CUSUM charts are not as straightforward. The plotted points are cumulative deviations and are therefore not independent. You can find tables of ARL calculations in textbooks. Lucky for us, JMP can perform these calculations for you. Click on the red triangle next to CUSUM Control Chart – Y and choose “Show ARL.”

Let's take a look at the in-control ARL: ARL₀=465.44, which is good (even higher than the IMR chart). If the process is in control, we expect to get a signal every 465 samples on average. Now let's look at the out-of-control ARL: ARL_Δ =10.38. This number is low, which is good (much lower than the ARL_Δ for the IMR chart). If the process mean has shifted by 10° C, expect to get a signal every 10 samples on average.

The table below illustrates the ARL performance of a Shewhart control chart and the CUSUM chart with k=0.5 and h=5 (JMP’s defaults).

Shift, in multiples of σ	Shewhart ARL	CUSUM ARL
0	370	465
0.25	281	139
0.5	155	38
0.75	81.83	17
1	43.96	10.4
1.5	15.97	5.75
2	6.3	4.01
2.5	3.24	3.11
3	2	2.57
4	1.19	2.01
5	1.02	1.69

You can see that, the CUSUM chart will detect a shift sooner than Shewhart charts when the shift is less than 2.5*σ. For shifts larger than 2.5*σ, a Shewhart chart will detect the shift sooner.

How h and k affect the ARL

JMP also provides a nifty ARL Profiler. You can find this option under the red triangle beside CUSUM Control Chart – Y.

ARL Profiler, Two-Sided

This graph tells us that when h=5 and k=0.5, then ARL_2*k/σ=ARL_2*0.5/10=ARL_.1=348.6502. As you move the slider for h around, you see that decreasing h decreases the ARL and increasing h increases the ARL. Conversely, if you move the slider for k around, you find that decreasing k increases the ARL and increasing k decreases the ARL. You can play with these h and k sliders to determine how altering these values together affect the ARL.

Conclusion

ARL can be used to view the performance of a particular control chart or to compare the performance of different control charts. If the shift is small (less than 2.5*σ), a CUSUM chart will detect the shift before a Shewhart control chart. If the shift is larger (more than 2.5*σ), a Shewhart chart will detect the shift sooner than a CUSUM chart.

Clau · ‎08-23-2019

Hello,

I have a problem when trying to calculate k, I don't understand what number do i use for µ1.
Can you help me?

AnnieDudley · ‎08-26-2019

Generally, the target or µ is defined by your specific data. JMP defaults to the sample mean as an estimate of µ, however if you have more information about an "ideal" value for the target, you can change it accordingly.

It's this value, summed together with the shift (k) that is subtracted from each individual point to form the cummulative sum of the differences.

Clau · ‎09-23-2019

@AnnieDudley thank you so much for your help!

Can you tell me what's the function used as φ when we calculate bets?

I've tried to calculate the following in excel but using phi function the result is not the same.

β=φ(3-.60204)- φ(-3-.60204)

Thank you so much in advanced.

tonya_mauldin · ‎09-23-2019

This is the normal distribution function. You can run the following in JMP to see.

normal distribution(3-.60204)-normal distribution(-3-.60204)

This yields

0.991598802362608