Thanks @thestrider,
Yes, there are multiple things that need to be addressed here. I can't tell you what you should do statistically (which analysis you would need to do or how to address the between run variability due to external factors, like the shifting of control and treatment values) since we are advised to not give statistical advice on how to treat each of these issues, but I can point you to which platforms within JMP that could help address some of them
Within run statistical comparisons cannot be made since there is only one control per run and no way to assess variability for within run variation to make that comparison (at least not by traditional statistical methods that I am aware of). Between run variation can be estimated since there are three controls, but it will be biased because you can have up to 9 possible measurements for the condition of interest. This would be considered unbalanced and possibly give you unequal variance. Thankfully, JMP has a means to deal with unequal variance in the Fit Y by X platform (gene X response in the Y, Response section and condition in the X, Factor section). Then choose t-Test from the red triangle which will give you the means for each group and a p-value for testing if the condition is different from control, assuming unequal variance. The censored data is basically missing data (not measurable due to the reason given) and makes sense why you have added technical replicates to accommodate such issues for each run.
Also, as you have mentioned, values shift from one run to the other, including the control, almost like a match pair situation (or paired t-test like situation) or a batch effect. So the above alone is not enough to see if there is a statistical difference because of this additional bias. There could be many ways to address this. One may be to do multiple regression where run and condition are effects (kind of like batch and condition looking for batch effects and accommodating for them).
In this case, Fit Model is the platform to use and choosing Standard Least Squares as the personality will get you to the type of test/analysis that might be most appropriate in your case. I would first test to see if there is a run effect by putting run, condition and the interaction of run and condition as model effects and then Gene X response as the Y. You could do this for all genes (and that would be a lot to look at for hundreds of genes). The small example data you have, I have done this and there is definitely an effect due to run to run variability, so that means it must be accounted for in the statistical tests you perform.
I have created a JMP table and saved the Fit Y by X and the Fit Model analyses to the the data table so you can see how I performed each test. It is attached to this post.
It is best to either contact a statistician in house or see if one of our gracious community members are willing to advise on how best to treat your situation. Maybe also take the free Statistical Thinking for Industrial Problem Solving course might help learn more about statistics that could help in this situation. Or take a look at the Statistics Knowledge Portal for more statistical upskilling.
Sorry I am not able to give the statistical advice you might need, but I hope I at least pointed to where in JMP you could go to analyze your data and assess the experiment and test for significance of the conditions.
Best,
Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 āŖ Mobile: +1-303-378-7419 āŖ E-mail: chris.kirchberg@jmp.com
www.jmp.com