Sorry, a bit late to the discussion. I'm not sure I understand what you are doing? It would be also helpful to us if you can provide the data table, even if you want to code the factors and response variables for confidentiality. This way we can try to reproduce/understand what you're getting as outputs of the analysis.
I don't understand this statement:
We have a variation in the Lot starting material which we already know, we want to consider this extravariability but not for future investigation (there is no need to add the lot as X3 in the model even if it is significant).
My thoughts:
1. Seems to me you want/need to account for the incoming variation of lot-to-lot variation which you know may be significant?
2. If you know there is lot-to-lot variation, do you know whether it effects the yield? If it does, what are you doing to reduce the incoming lot-to-lot variation?
3. Can you measure the incoming lot? Is there some measure of the incoming lot that you think relates to yield? If so, why not treat that measure as a covariate?
4. I will provide a different view of the use of blocking...As you indicate there are 2 ways to treat the block effect, as a random effect or as a fixed effect. To me, what determines which you use is the amount of understanding of what noise is confounded with the block. If you have a thorough understanding of what noise variables are confounded with the block, a more effective use of the block is as a fixed effect. When you analyze the model, you want to include the blocks AND all block-by-factor interactions. In fact, the most important effects might be the block-by-factor interactions. Why, because you want your factor effects to be consistent over changing noise (this is the definition of robust), this, in essence, means the absence of block-by-factor interactions.
Now if you have not done the due diligence of identifying what noise factors are confounded with the block, then you are left with treating the block as a random effect and a means of lowering the mean square error (which may provide for a more significant F-test).
"Block what you can, randomize what you cannot" G.E.P. Box
Block what you can identify, randomize what you cannot identify.
"All models are wrong, some are useful" G.E.P. Box