Hi @Thommy7571,
Dadurch wird die Attraktivität von DSDs vermindert, die ja schon ohne weitere Messungen in der Lage sein sollen, quadratische Effekte und Wechselwirkungen zwischen den Haupteffekten zu trennen. Andererseits hatte ChatGPT mir auch vorgeschlagen.mehrere Zentralpunktmessungen hinzuzufügen, so wie es MathStatChem geschrieben hatte. Bradley hatte das Thema nicht näher in seinem DSD Artikel behandelt. Der genaue Effekt und zitierfähige Gründe dafür liegen mir daher nicht vor. Gibt es einen Artikel, indem der Effekt unterschiedlicher Anzahlen von hinzugefügten Zentralpunkt-messungen und / oder Blöcken mit Zentralpunktmessungen untersucht wird?
Again, I would avoid using ChatGPT for precise answers, particularly on modern designs like DSD. The responses may be highly generic, and not particularly suited for your specific use case, or the use of DSDs.
Concerning the effect of adding centre points or extra runs, you can search for publications or try several scenarii in JMP, by creating several designs and compare them. I created 3 DSDs for this scenario :
- DSD for 7 continuous factors, minimum amount of runs required (17 runs).
- DSD for 7 continuous factors, 21 runs with 5 centre points (1 created by default, 4 added in the datatable after)
- DSD for 7 continuous factors, 21 runs with 4 extra runs.
Here are the main differences :
With the model specified here (assuming 3 interactions and 3 quadratic effects), you can see that both "augmented" DSD designs (with 21 runs) have increased power to detect main effects, interactions, and quadratic effects. The difference is :
- DSD with added centre points has a slightly higher power to detect quadratic effects, but the gain on detecting interactions is not very high compared to the situation with 4 extra runs.
- DSD with 4 extra runs has similar power to detect quadratic effects and higher power for detecting 2-factors interactions. This is due to the fact that extra runs are not replicates runs (unlike centre points which correspond to the same setting/treatment) and DSD runs have factors set to 3 levels, so extra runs do significantly improve power for main effects, interactions and are able to increase power for quadratic effects as well.
Looking at relative standard error of estimates for the assumed model, you have the same interpretation than with power analysis : adding centre points only improve significantly the precision to estimate quadratic effects (and intercept), whereas adding extra runs enable to improve the precision to estimate terms more uniformly.
In terms of prediction variance, centre points enable to have a lower and more stable prediction variance in the centre of the experimental space. This can be particularly useful if you think your optimum point may be in the centre of the experimental space. However, DSD with only centre points tend to have a prediction variance higher in the corners of the experimental, since all added points are centre points which only decrease prediction variance around the centre of your experimental space :
You can see here by maximizing prediction variance, the DSD with 5 centre points has a prediction variance close from the "non-augmented" original DSD, whereas DSD with 4 extra runs enable to reduce prediction variance more homogeneously at the borders of your experimental space.
- Efficiencies :
You can see that DSD with 4 extra runs has better D and G-efficiencies (the comparison reference is the original 17 runs DSD, so the lower the relative efficiency of the original DSD with another DSD, the better the gain for this other DSD). Little reminder about efficiencies here : Design Diagnostics (jmp.com) and https://en.wikipedia.org/wiki/Optimal_experimental_design
So having better relative D-Efficiency is useful in the screening stage, as this enable to have a better precision in the estimation of effect terms. G-Efficiency may also be useful, as it enables to minimize the maximum prediction variance, so to have a prediction variance more homogeneous between the centre of the experimental space and the borders.
Relatives A-efficiency, focused on minimizing the average variance of the regression coefficients, and I-efficiency, focused on minimizing the average prediction variance over the experimental space, are slightly better in the design with 5 centre points than in the design with 4 extra runs.
Sind DSDs für 7 Faktoren erfahrungsgemäß noch gut brauchbar?
If you have continuous factors, I would highly recommend using DSD for an efficient screening for 5+ factors. You can always check if this design seems useful for you, by creating DSD and other screening designs (classical screening designs with Hadamard matrices, D-/A-Optimal designs, ...) and use the platform Compare Designs (jmp.com). Take into consideration that most screening designs only have 2 levels for the factors, unlike DSDs.
Unklar ist, was jetzt "Fake factors" sein sollen.
"Fake factors" are factors added in the creation of the DSD to generate a higher number of runs in the design, but that have no practical interest: they are not linked to a specific experimental factor you want to change, simply here to increase the matrix size. Adding runs with "fake factors" is a trick to enable better effects estimation and increased power, and also increase degree of freedoms available to fit additional terms in the model.
Some ressources to help you on understanding DSDs :
Introducing Definitive Screening Designs - JMP User Community
Definitive Screening Design - JMP User Community
The Fit Definitive Screening Platform (jmp.com)
Using Definitive Screening Designs to Get More Information from Fewer Trials - JMP User Community
And more generally on DoEs :
DoE Resources - JMP User Community
I added the three designs generated as an example if you want to reproduce the analysis and comparison.
Hope this answer will help you,
Victor GUILLER
L'Oréal Data & Analytics
"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)