Some follow-up. Since you have been iterating, you should already know statistical significance. By the types of factors you are experimenting on (e.g., pressure or no pressure), I'm curious of previous DOE's. My suggestion is to develop a contour map of pH (think of it as a topographic map). Consider where you want to develop the map (region based on factor levels) geometrically and run those treatments.
If you are averaging the within batch pH, what is that standard deviation?
Of course, I don't know the specification. In understanding causal structure, specifications are not useful as they are derived independently. Specifications, however, do give possible insight into practical significance. Look at the data for practical significance before statistical.
If your measurement error is .1 (1 standard deviation), then you have too much measurement variation given the range of pH in your experiment is 7.1-7.58. (measurement error distribution is greater than the entire range of your data.)
"To simplify the problem, I don't put all those factors in the table, because my main issue is the pressure step. But yes you are right. All the factors that could impact the input product and the process itself are registered in my real data table."
I can't provide appropriate advice when the situation isn't fully understood. The question is over what conditions was the experiment conducted. How were noise variables handled during the experiment? They were either:
1. Held constant - this is a really bad idea as it restricts the inference space and therefore reduces the likelihood your model will work in the future.
The exact standardization of experimental conditions, which is often thoughtlessly advocated as a panacea, always carries with it the real disadvantage that a highly standardized experiment supplies direct information only in respect to the narrow range of conditions achieved by the standardization. Standardization, therefore, weakens rather than strengthens our ground for inferring a like result, when, as is invariably the case in practice, these conditions are somewhat varied
R. A. Fisher (1935), Design of Experiments (p.99-100)
2. Allow to vary randomly during the experiment - better, but still not optimal as this reduces the precision of the experiment
3. Partitioned (e.g., blocking) - Better yet as this can increase the inference space while increasing the precision of the experiment.
"All models are wrong, some are useful" G.E.P. Box