I'm about to prepare a DoE for a novel process which involves up to 11 continuous factors while one of these is hard to change.
Since I'm trying to combine this with an effort to make the ideas behind DoE a bit more transparent to colleagues, I've come across a few questions, which can be summarized like this:
What's the best design approach when domain knowledge suggests the existence of two-pair interactions and possibly quadratic effects, while I cannot assume strong heredity?
I'm hesitant to create a custom design involving "all" terms (i.e., main effects, two-way interactions and quadratic effects) as it looks like a brute-force solution and contradictory to an efficient experimentation. I would rather prefer to do an initial screening design to (hopefully) ignore non-contributing factors and run a subsequent augmented design. My major concern is that an improper screening design may lead to the loss of important factors due to a missed interaction or quadratic term.
It seems that there are a few threads with similar questions but touching only little aspects of this, so I'm posting this as a new thread here.
In order to study the design and analysis methods, I've setup a test case which I'm attaching. I've created a definitive screening design for five factors, including a central point. For the sake of simplicity, I've ignored the possible existence of hard-to-change factors. The target variable Y is generated through a formula, where you'll see the relevant terms.
I tried several approaches to find the proper model (since I have a priori knowledge about the true function, I'm always biased) and I don't want to explain in all detail what I've done but a brief summary is:
- From a pure screening for main effects, I might drop factors X2, X3 and X4 and might not include them in a subsequent augmented design.
- By fitting a definitive screening design, I would be mislead to interactions that aren't there and also quadratic terms would be confused. I'd have to drop the assumptions of strong heredity for both interactions and quadratic terms.
While I understand where this is coming from (strong correlations between interactions and also quadratic terms within the given design), the question is, how to best deal with this case?
I understand that my target function is made up arbitrarily and hence, it might well be a fairly pathological case. I'm wondering how often this issue arises in practice?
Looking forward to your feedback.
Björn