Whether you use the individual observations or the mean observation determines the meaning of the errors. This interpretation is important in any hypothesis tests that you use such as the F ratio for the whole model in the analysis of variance or the t ratio for a parameter estimate. The errors represent the random variation in the response that occur when you do not change conditions or when you return to the same condition. They provide a benchmark for assessing effects when conditions change. How much of the effect is due to the new condition and how much of it is just random change that would happen anyway?
Since you are investigating the effect of mouse strain and treatment, the error term should ideally represent the accumulation of all the perturbations (sum) associated with any run. That is a single observation. If you repeat the entire run, not just make another measurement, that run is a replicate. If you do not repeat the run but make another measurement, then that is not replication.
It seems like you made multiple measures within each run due to concern about measurement accuracy (bias + precision). That is to say, you did not sacrifice a mouse more than once! In that case, I would model the mean response.