I have tried to follow your thought process, but, of course, I don't understand the "science" of your investigation. I will share some of my thoughts.
I see you ran a Res. V fractional factorial with 3 center points (although there is no CP for the one categorical factor). There is no mention of practical significance of the response? How much of a change in the response is of practical importance? Statistical significance is a bit of a challenge because you have 3 replicates of the CP. One of the DF's is used to estimate curvature and the others are unassignable and therefore may be the basis for estimating the MSE. How does the MSE compare with the variation you see in the process normally? E seems to be the most "active" with C second. I don't see much (again I don't have any context with respect to the response variable) with the rest. Why did you continue with A and B? No evidence of curvature. A model based on E and C looks like there are some issues (one unusual event and the residuals are certainly not normally distributed).
What model did you use for steepest ascent? Looks like you used insignificant factors in that model? Not sure why you continued so far past the best results in your SA method?
The most significant factor was set to the least optimum level...huh? I would want to investigate that factor more. Could it be more quantitative?
Not sure why the CCD? How does it relate to the other work you did? Also be careful here as statistical significance does not play as big a role in optimization. By the point of optimization, you already know the factors matter, so what are you looking for? Sweet spots (local maxima which are difficult to maintain) or flat spots that provide more robust results.
Lastly, I don't see any strategy to "handle" noise? No repeats/nesting (short term noise like measurement error) or blocks or split-plots (for long term noise like ambient, raw material lots)?
"All models are wrong, some are useful" G.E.P. Box