Discussions

frankderuyck · Feb 5, 2025 08:45 AM

I have a question on the setting of Stage 1 P value for analysis of a 7-factor DSD in attachment.

I need to set the stage 1 P value at 0,4 to detect the 5 active factors, while the default setting is 0,05; is there an explanation?

Is the stage 1 P default of 0,05 not too low? How to set stage 1 P, is there a rule of thumb?

Victor_G · Feb 6, 2025 7:09 AM

Hi @frankderuyck,

I guess the example you show is for education/training purpose since the response is based on a formula ?

A good reminder from JMP Help to answer your question is : "A minimum run-size DSD is capable of correctly identifying active terms with high probability if the number of active effects is less than about half the number of runs and if the effects sizes exceed twice the standard deviation" (Source: Overview of the Fit Definitive Screening Platform)

In your case, you have 7 factors and expect 5 to be detected as "active", so it's a very complex task to perform when using a minimum run-size DSD with a relative high response noise.

When I look at the "noise"/variability part of your response formula, I can see that the random uniform part is quite large (from -25 to 25) compared to the size of the estimates (from -20,8 to 5,2) for the factors you want to detect in this first stage. So for most factors for which you have specified an effect size to estimate, it might be very hard to detect them and estimate their effect size, compared to the noise/variability part that you have added in the formula. You also mention 5 "active" factors, but I think there is a confusion : they will be only considered "active" only if their relative size of estimates are large enough compared to noise/natural variability of your experiments. It is not because you have specified an estimate different from 0 that they should be considered as active/important. It's a question of signal-to-noise ratio, and a difference between statistical significance and practical significance.

The significance level (p-value threshold) to adjust could be estimated depending on your design and assumed noise (anticipated RMSE) in the Evaluate Designs platform. In this platform, you can adjust significance level and Anticipated RMSE to evaluate the power (probability to detect an effect if active).

Perhaps not "exactly rigorous", but as an exercice, you can check if the outputs from your analysis seem right as you're working on a theoritical response formula. When adding a new formula column to look at the noise part of the formula, I can estimate a standard deviation of approximately 12 :

When using this value as Anticipated RMSE (probably as a low estimate for RMSE), fixing the significance level at 0.05 and using the theoritical estimates of the formula as Anticipated coefficients (I left "1" for the null effects of your formula to see the power "baseline" of your design for main effects with this RMSE), it matches the outputs from your analysis :

Only X1 has an effect size large enough to be detected with a high probability. The other effects have a relatively too low effect size (compared to the Anticipated RMSE of the response). If you have a higher significance threshold with the same expected noise, the power to detect main effects will increase :

Several options (alone or combined) for your example could be used depending on your teaching objectives :

You can reduce the variability (random uniform part in the Y formula response), to make sure you can detect most of the active effects you want to detect (and estimate).
You can increase design size, to enable easier detection of active effects. From JMP Help : "However, by augmenting a minimum run-size DSD with four or more properly selected runs, you can identify substantially more effects with high probability." : Overview of the Fit Definitive Screening Platform
You can increase effect size (size of the factors estimates in the formula), so that it becomes easier to detect effect despite the high noise/variability.
You can also try different analysis platforms to detect the main effects. The Fit Two Level Screening Platform may offer a different perspective for example :
No matter what you choose, the Evaluate Designs platform (and Compare Designs platform to compare different designs choice) is really important to show and explain to students, as it enables to better assess the pros and cons of any design depending on the objectives, complexity of the model, ability to detect effect, aliasing properties, etc... It can also be used to assess the significance level to adjust if you have prior information about anticipated RMSE and anticipated coefficients.

I hope my answer will help you understand what's going on in your example,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)

View solution in original post

Victor_G · Feb 6, 2025 7:09 AM