Hi All,
I need advice with setting up a DoE.
The goal of the DoE is to build a (logistic regression) model for a binary response (y) depending on five continuous (x1, x2, x3, x4, x5) variables. The five variables need to add up to 1, hence a mixture response design may be ideal. The response as mentioned is pass/fail depending on the variables combination. In addition, there are constraints on some of the continuous variables .... for example 0.4 < x4 < 0.5 etc.
I tried a mixture response design with these design limitations. And when I simulated responses and built a model to check the model diagnostics, I found that the VIF is too high for constrained variables suggesting confounding effects. I think the high VIF is because of high correlation between x5 and its cross terms with other variables. This can be confirmed by looking at correlation of estimates.
With that background I have couple questions.
1. Is mixture response surface design the best approach for my problem? Or should I try something like a sequential design approach. there is a bit of literature on sequential design application for binary response models.
2. How can I reduce the VIF or the confounding between a constrained variables and its cross terms?
I appreciate any help that will solve my problems. Please let me know if more information is required.
Regards
-Praveen.