Bootstrap Interval Calibration Using New “Save Simulation Formula” Feature in JM...

Hadley Myers, JMP Systems Engineer, SAS
Chris Gotwalt, JMP Director of Statistical Research and Development, SAS

The calculation of confidence intervals of parameter estimates should be an essential part of any statistical analysis. Failure to understand and consider “worst-case” situations necessarily leads to a failure to budget or plan for these situations, resulting in potentially catastrophic consequences. This is true for any industry but particularly for pharmaceutical and life sciences. Previous work has explored various methods for generating these intervals: Satterthwaite, Parametric Bootstrap and Bias-Corrected (Myers and Gotwalt, 2020 Munich), and Bias-Corrected and Accelerated (Myers and Gotwalt, 2020 Cary), which were all seen to have error rates that were too high for the small samples typical in DOE situations. Therefore, we make use of the new “Save Simulation Formula” feature in JMP Pro 16 in an add-in that improves upon these by allowing users to perform a “Bootstrap Calibration” on the Satterthwaite estimates. The add-in also includes the ability to do this for linear combinations of random components, taking advantage of another addition to JMP Pro 16. Further, we investigate a new version of the fractionally weighted bootstrap that respects the randomization restrictions of variance component models, as an alternative to the parametric bootstrap, using the new “MSA Designer” debuted at this conference.

Auto-generated transcript...

Transcript

Hello, my name is Chris Gotwalt. My co-presenter Hadley

Myers and I are presenting an add-in for obtaining improved

confidence intervals on sums or linear combinations of variance

components. This is part of a series of talks we have given as

we work on improving and evaluating several approaches.

Obtaining confidence intervals on sums of variance components

is important in quality because it provides an uncertainty

assessment on the repeatability plus the reproducibility of our

measurement system. The problem is that when we ask for a 95%

confidence interval, there are approximations involved and the

actual interval coverage can be as low as 80%.

In our previous studies, we found that two methods have

improved coverage rates, parametric bootstrapping and

Satterthwaite intervals, but it was still less than 95% in small

samples. The earlier version of the add-in implemented the

parametric bootstrap as a stopgap and Elizabeth Claassen

implemented the Satterthwaite intervals in the Fit Mixed

platform natively in JMP Pro 16. I want to stop here to give

Elizabeth Claassen credit for making interval estimation of

linear combinations of variance components so much easier and

JMP Pro 16. She also greatly extended the Mixed Model output,

which has made this presentation vastly easier. I'm also hoping

that this presentation will serve as an inspiration to

others to check these new Save commands out so they can get

more from JMP Pro’s Mixed Model capabilities. Now we're going to

combine the two approaches using a technique called Bootstrap

Interval Calibration that was introduced by Loh in a 1991

Statistica Sinica article. Bootstrap Calibration is a very

general procedure for improving the coverage of confidence

intervals that can be applied to almost any parametric

statistical model. I'm going to introduce the basic idea of

Bootstrap Interval Calibration in the simplest terms that I

can, and hand the mic over to Hadley, who's going to demo the

add-in and discuss our simulation results. To make this

simple, let's make it specific. Consider a very small nested

Gauge R&R-type study where we want to estimate the total

variation. We collect the data and run a nested variance

components model with an Operator effect, a Part within

Operator effect, and a residual effect. The software reports a

Satterthwaite-based interval on the total. It's well known that

this is an approximation that assumes a “large”

amount of data is present in order for the actual coverage of

the interval to be close to 95%.

In small samples, the actual coverage, the probability that the

interval procedure generates intervals that actually contain

the true value of the estimated quantity, will tend to be less

than 95%. Thing is the actual interval coverage is a

complicated function of the design, true values of

the functions, and a long list of other assumptions that are hard

or impossible to verify. What we can do though is used the fitted

model and their parameters to do a parametric bootstrap. When we

do this, we know the true value of the quantity we are

estimating because we were simulating using that value.

We can do the simulation thousands of times. We apply the

same model fitting process to all the simulated samples. We

can collect the intervals from JMP and calculate how often

they contain the generating value of the quantity that you

were interested in. In this case we were interested in the sum of all

the variance components, so the true value is 4.515.

Suppose we took our original data set, took the estimates,

use the Save Simulation Formula that is comes from Fit Mixed,

and generated a large number of new data sets, and applied the

same model fitting process that we applied here to each of them,

and we collected up all of the confidence intervals that were

reported around the total. After having done this, suppose

that that...the estimated coverage, the estimated number

of times that these intervals actually contained the truth,

turned out to be 88%.

So we wanted that 95% interval, but the Bootstrap

procedure is telling us that the actual coverage is closer to

88%. Now we can play a little game and we can repeat the

Parametric Bootstrap using a 99% interval this time. So we go

through that process, we redo all the bootstrap intervals and

when we did the 99% interval we get an actual coverage of

approximately 98%. Now suppose we did this game over and over

again until we found an alpha

with actual coverage approximately 95%. So in this

case, suppose we did that and we ended up with finding that 97.6%

when we asked for a 97.6% interval, we actually got

something like a 95% coverage.

Then what we can do is set 1 minus alpha to 0.976 using the Fit

Model launch dialogue, set alpha option and will get an

interval that has been Bootstrap Calibrated to have

approximate coverage 95%. This is still an approximation. There

is still a simulation component to it, as well as a deeper

underlying approximation that is extraordinarily hard to analyze,

but it can be made easy to use, and this is where Hadley comes

in. Now I'm going to hand it over to him and he will demo the

add-in and go over the simulations that he did that

show that we are able to get better coverage rates than

before by applying Bootstrap

Calibration to Satterthwaite intervals on linear combinations

of variance components. Take it away Hadley.

Thank you very much, Chris, and hello to everyone watching online

wherever you are. So I'm going to start out by showing you how the add-in

works and how you can use it to calculate Bootstrap Calibrated

confidence limits for random components in Mixed Models in

JMP Pro 16.

And from there we'll take a step back. We'll see how the add-in

makes these calculations and I'll highlight some of the additions

to Mixed Models in JMP Pro 16 that allow it to do that.

From there, I'll show you the results of some simulation

studies to give you an idea about how accurate this

interval estimation method is, the Bootstrap Calibration

method, and how it compares to some of the other methods for

calculating confidence limits, as well as the situations where it's

more or less accurate and some of the limitations and things you

should be aware of if you're going to be applying it.

We’ll discuss possibilities for improvements in future work

just briefly, and from there, I'll conclude by showing you the

new MSA Designer, Measurement Systems Analysis Designer,

available from the DOE menu in JMP Pro 16 so that you can quickly

and easily design and analyze your own MSA Gauge R&R

studies. So let's start out by looking at this data set.

This is one that I pulled from the sample data files.

I'm going to run this Fit Mix script here that I've saved. So

what we've got here are our random estimates, estimates

for a random components.

Now.

it could be that you want to, for some reason, calculate an

intermediate total, for example Operator and Part nested with

Operator, or the three of these,

you know, Operator and residual.

So to calculate those is very simple, we simply add these

estimates, but what's not so simple is to determine those

confidence limits.

There's a new feature in Mixed Models that's been added in 16.

The linear combination of

The linear combination of variance components feature

right here, and so what you can do is you can click that.

You can choose the combination of variance components that you're interested in,

and you can press done. So now we have an estimate for those.

Components as well as their

confidence limits. So,

what I'm going to do now is I'm going to take this one step

further and I'm going to calculate the Bootstrap

Calibrated Satterthwaite estimates and I'm going to do that by

going to my add-ins and clicking the Bootstrap Calibrated

confidence intervals there. So from here we can estimate the

number of simulations.

2500 is a

recommended number to the default number. It's also the

default number in some of the other simulation platforms and

in JMP Pro. I'm going to choose

this one. But one thing to note is that it takes some

time to be able to do this, and so in the interest of time

what I'm going to do is I'm going to stop it early.

And here we have our

calibrated intervals, calibrated upper and lower confidence

limits added to the report.

So let's take a step back and see what happened there.

I'm going to go ahead and add this again.

Now, one thing that the add-in

does, as soon as you run it,

is it adds

this simulation formula to the data table, so you can see the

simulation formula here.

When the add-in is closed,

the simulation formula disappears.

The simulation formula there takes advantage of another

feature that's been added to...

to the Mixed Models platform in JMP Pro, and that is the Save

Simulation Formula feature

here. So what this would allow you to do is to save the

simulation formula and then to

use that, for example, to simulate

these values here. So, we can swap out our “Y” with our

new simulation formula,

and go ahead and run that. So when you run the add-in, this is

all done in the background. But this is how the add-in goes

about calculating these intervals. So I'm going to stop

this early, once again in the interest of time.

And now we see here the samples

estimated for each.

simulation. And so how the add-in works is

it takes all of these.

And it calculates new estimates for the upper and

lower Satterthwaite intervals from this

estimate and this standard error, swapping out different

values for alpha. So what we're aiming for 0.05, right?

So that we get 95% upper and lower limits, and what it

does is it finds

an alpha value that results in 95% coverage, that is 95% hits

and 5% misses,

swaps that in, that's how you get your calibrated intervals.

So I hope you enjoyed seeing that. I hope you find it useful.

We've done some simulation studies and what we found out

is that

the intervals, which you can see here for four operators

and 12 days as our random components,

we've achieved misses of about

7%, so a 92.8 hit ratio. Now this is better than all of the

others, including this, so the linear combination, which is

simply the standard Satterthwaite interval

calculated on the combination of linear components, as well as the

Bootstrap quantiles, the bias-corrected intervals in the

bias-corrected and accelerated intervals, but as you'll see

these intervals improve, all of them,

as you increase your number of Operators from

4 to 8 and the number of Days from 12 to 24. So increasing the

levels of these random components

result in much better, much more

accurate estimates for the confidence limits, and so much so

that we now have

a method here

that is equivalent,

just, to an

alpha value of .05.

So. this improvement in performance of course, comes at

a cost, and one of those costs is the length of the intervals.

And so you can see here,

that with our

Bootstrap calibrated, well with all of our intervals in fact,

that when we have increasing number of Operators,

that the length of the interval is much more

bundled closer to 0 than it is when you've got smaller

number of Operators. You can see that this tails out much

further, so that's this blue area here. That's true for

all of them, but it's especially true for the

Bootstrap Calibrated interval. You can see this

long tail here.

On average, you're going to get longer lengths using this

method, but you have a more accurate method.

Exploring that a little bit deeper, you can see here

that this increase in length is true for four Operators, as

well as eight Operators, and it is significant.

Statistically significant.

The other thing that I looked at,

is the effect of adding repetitions, so the difference

between two repetitions and five repetitions, and what you'll see

here is that there really is no

difference. So looking across

the different sets of combinations from

four Operators and two reps to four Operators and five reps, about 6

measurements total versus 3 measurements, we really don't

gain anything. All of these are equivalent to each other.

So that's something to be aware of, that you see improvements in

accuracy when increasing the number of Operators, and you

don't see improvements when increasing the number of

repetitions.

One thing that I'd like to mention as a

possibility to improve upon these results is the

Fractional Random Weight Bootstrap, which we would have

liked to have been able to implement for this in time for

this conference. We weren't able to do that, to take this

and to apply it to random variance components, and so we

hope to be able to do that in future work and perhaps even

see an improvement upon the Bootstrap Calibrated interval.

And then the other thing that I'd like to highlight before I

go is the new

MSA designer that's been added to JMP 16, and so

from here what we can do is we can very quickly

create our own design in order to be able to

perform our own MSA or

Gauge R&R analysis. And so let's see, I'll do this with three

Operators and Five parts.

I'll label these

A, B and C.

And we'll do one repetition of each. So that's two

measurements total.

So here we've got a table with our

design. What I can do is I can press this button to very

quickly send that to the different operators, have

them fill out their parts, send that back to me.

And then I can add

those results together.

So I'll just sort this because I've got another table over here

where I've done this ahead of time. So I'll just add these

values over there. And now from the scripts within the table we

can quickly and easily do our own Measurement Systems Analysis

and Gauge R&R. So I hope you found this useful. I hope you

continue to enjoy the talks at this conference. Thank you very

much for listening.

Presented At Discovery Summit Europe 2021

Presenter

Hadley Myers

Files

Bootstrap Calibrated Confidence Intervals.jmpaddin

Bootstrap Interval Calibration Using New “Save Simulation Formula” Feature in JMP® Pro 16 (2021-EU-30MP-744)

Presenter

Files

Advanced Statistical Modeling

Automation and Scripting

Basic Data Analysis and Modeling

Consumer and Market Research

Design of Experiments

Predictive Modeling and Machine Learning

Quality and Process Engineering

Reliability Analysis

Sharing and Communicating Results