Yes, we are aware that measurements could also be the problem. Each time we did at least 3 repetitions of measurement (which showed a relatively low coefficient of variance) for each sample. We had some problems with the lasser diffraction analysis, which probably raised RMSE in some responses, but RMSE in the responses which do not include laser diffraction analysis is also a bit high, but not as much.
Thank you for very good ideas. We decided to repeat the second run twice, so we will see if variation is high also in one run, or just in the whole model. Then we will add the new runs to the model and see if RMSE is any lower.
Does that makes sense to you?
This is a start. A couple of cautions on these repeat runs: you would be assuming that the variability is constant over the design space (in other words, run #2 is representative of all other experimental conditions). Does the variability on your repeat runs capture all of the variability sources encountered when running the initial design (did operators change? instrumentation changes? setup changes?). How comfortable are you in estimating a variance based on only 2 or 3 data points? That is essentially what you are doing.
One correction from my previous post: I meant we will repeat the central run twice, not the second run (I apologise for a little confusion).
Anyway, the central run should also cover all of the points (cautions) you mentioned.
Since we will be repeating central runs, we are hoping, that the variability of the central runs would be the best (the best of the 17 runs we can repeat) approximation of the variability of the whole design . Furthermore, we are hoping, that RMSE of our design will drop as well, since there will be more runs in the model. If the drop of the RMSE will be big enough to detect some new effects, we won't have to estimate a variance based on only 3 data points.
Just to tell you how we solved the problem in the end. We tryed different data transformations, so we could get more normal distribution, with more constant variance over the entire range of response. When we analysed transformed response, we could detect many more effects (main and second order effects).
You mean a normal distribution of the residuals, right?
Glad you found a solution!
I mean the distribution of response.
Why would you expect the response to be normally distributed if there are active effects?
There are no labels assigned to this post.