Hi @Justin_Bui,
There may be several explanations to why you don't find the terms in the model statistically significant :
- Too narrow factors ranges (so it may be more difficult to find statistical differences between high and low levels),
- High variability of the response(s),
- Inappropriate model (and/or choice of terms in the model),
- Too small sample size for the design (so low power for detecting statistically significant effects),
- Missing factor(s) explaining statistically the variation of the response(s),
- Effectively non-statistically significant factors,
- Combination of the options above,
- Other considerations...
Considering your design and the assumed model, you can start evaluating you design to see how likely you would be to detect statistically significant effects if they are active (see script "Evaluate design" in the datatable). It seems you have quite low power for detecting main effects and interaction, so it's not a big surprise to not find any statistically significant terms for the response Y if you consider the high variability of this response (around 5% of chances to detect a statistically significant effect (if active) for a significance level of 0,05 and RMSE around 5 for the response Y with your 6 experiments) :
For a good starting point before modeling, it's always best to start simple, and do some graphs/visualizations to see the trends. That could further improve and inform your modeling options, and visualizations can also help you during the modeling to assess the relevance of your model (for example by looking at residuals plot, actual vs. predicted, etc...).
What is also interesting to consider is practical significance of the factor(s). In the example you provide for response Y and looking at the parameter estimates, does a variation of 20 in the response Y for one unit of factor B represents a practically significant increase/decrease ?
What you may find for response Y are trends, that could be further investigated by doing additional experiments (augmenting the design to add runs confirming this model, and/or replicating existing runs to better assess variability of the responses). The statistical significance helps you have more certainty in the interpretation of your results. For example in response Y, you can see that parameter estimates have relatively high standard error, meaning that the size of the effect may not be precisely estimated. So it's always best in those situations to interpret the trends with high caution.
For response X, it seems that a model involving the terms Factor A and its quadratic effect is sufficient to have an appropriate (or at least relevant enough) model. I let you assess the model for Response X in the datatable attached.
I hope this first answer may help you,
Victor GUILLER
L'Oréal Data & Analytics
"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)