I'm running a DSD DOE in JMP18 Pro with 2 categorical factors (Factor A (level A1, A2), Factor B (level B1, B2) and 2 continuous factors (Factor C, Factor D). My goal is to select the better Level in each Factor A, and B (e.g., either A1 or A2, B1 or B2) including any two-way interaction terms for all factors and estimate the quadratic terms in Factor C, D.
I chose to add 8 more runs with blocks to add more power to my model and resulted in total of 22 runs.
After running the experiment, about 1/3 of my runs were below detection limit. I tried to use the "Fit model" --> "Generalized Regression/LogNormal" with censored data to analyze my data.
My questions are:
1/ What should I put in Construct Model Effects? Should I only include the Main Effects in the model or should I have all main effects + two-way interaction + quadratic terms (for Factor C, D)?
2/ In Generalized Regression, what is the best estimation method? By default, the Estimation Method was Two Stage Forward Selection. Should I try other methods and use AICc or BIC to evaluate which model works best? (I tried, and Ridge/Holdback gave me the lowest AICc, should i use it?)
3/ When working with DSD, I usually use the Fit Definitive Screening Platform as my go-to, which has 2 stages of evaluating terms in the analysis. Can I do this with Detection Limit or Left-Censored Data?
Appreciate for all your help!
Hi @D_P2025,
@statman already gave a lot of useful information about the use of Definitive Screening Design and model building regarding your topic.
I will try to provide additional comments and answer some of the other questions you had :
There are a lot of options you can try to build several designs according to a model, and compare the relative performances of the design easily thanks to an enhanced table with all informations :
Example of terms agreement between each estimation method :
(Visuals from my presentation in the french User group : Découverte des plans OML (Orthogonal Main Effects Screening Designs for Mixed Level Factors) pour le...
Hope this additional answer may help you,
There is not enough information provided and without any SME, there is little basis for critique. I do have the following general comments, IMHO:
1. DSD is a screening design methodology. You only have 4 factors in your experiment. This is NOT the intended use for DSD. Although you can run DSD with categorical factors, that is not the strength of DSD which is intended to provide quadratic information inside the design space.
2. Whenever you are running an experiment, the design you use is a function of your hypothesized model. That is the model is predicted from your SME and tested via the experiment. I always start with a saturated model and remove insignificant/uninteresting terms from the model (AKA subtractive method of model building). You should include all estimable terms in your initial model.
3. Which statistics you use to evaluate your model is a personal choice. I use several, NOT ONE, to evaluate the model as I remove terms (e.g., R-Square-R-Square Adjusted delta, R-Square adjusted magnitude, RMSE, residuals plots, p-values). Always with the assistance of SME.
4. I am always careful to diagnose the data before quantitative analysis. This is why I ALWAYS use the sequence: Practical significance, graphical analysis and lastly quantitative analysis (PGQ). If the data makes no sense or does not change of any practical significance, you can't fix it with quantitative analysis.
5. Blocking is not a strategy to increase the power of the design (while that may happen, that is not why it was invented, see Fisher). Blocking is used to increase the inference space while not decreasing the precision of the experiment.
Thank you very much for your input. I'm pretty new to DOE and I haven't too many types of design yet so I thought DSD would be a good option for my purposes. I actually tried to design with Custom Design: with same factors, and included all main effects, two-way interactions and quadratic terms, + blocking = 20 runs, which is pretty similar to DSD. At that point, I was debating which one to use, and I chose the DSD with a little bit higher runs thinking that would give me better power.
The reason I used blocking was because we can't run the entire experiment in one go (maximum about 12 runs each time), so I decided to break it into 2 blocks (and conveniently this is an option in DSD) when i designed the model
I, of course, am a huge supporter of experimental design. I applaud your efforts to run one. I do suggest you read some books and take some classes to get a better understanding of the different experimental situations and strategies. Pay particular attention to how you handle noise and restrictions in an experiment.
Hi @D_P2025,
@statman already gave a lot of useful information about the use of Definitive Screening Design and model building regarding your topic.
I will try to provide additional comments and answer some of the other questions you had :
There are a lot of options you can try to build several designs according to a model, and compare the relative performances of the design easily thanks to an enhanced table with all informations :
Example of terms agreement between each estimation method :
(Visuals from my presentation in the french User group : Découverte des plans OML (Orthogonal Main Effects Screening Designs for Mixed Level Factors) pour le...
Hope this additional answer may help you,
Thank you very much for your reply.
Hi @D_P2025,
Concerning your additional questions :
For these reasons, A-Optimality is the recommended optimality criterion used by JMP when using two-level designs for main effects and two-factor interactions.
Concerning the analysis of DSDs, there are other discussions dealing about situations where different estimation methods and model selection have been used :
Setting Stage 1 P value in Analyis of DSD at a high level to dedect active effects
DSD Interpretation and Next Steps
Significance of factors in Definitive Screening Design
Hope this complementary answer will help you,
For the modeling, I recommend starting with the full response surface model, and then use Two-Stage Forward Regression in Generalized Regression. You should still should be able to use the censoring of the data, but a simple approach is to use substitute the reporting or detection threshold for the results.
I have no experience with censoring but with regular, good quality data, 22 DSD runs with 4 factors should be enough to give you at least a direction to understanding and correct modeling. I am curious to know the result when, in the DSD fit model platform, you increase the treshold P-value from 0,1 to 0,4. Did you also try SVEM?