Hi @xaris19801001,
Looking at your Leverage Plots (jmp.com) for treatment, you can see that confidence curve is asymptotic to horizontal line, so it's borderline in terms of statistical significance (hence the p-value 0.0385 close to threshold 0.05 you have chosen). This "borderline" situation can also be seen when looking on the left at your "Effect tests", with "Treatment" in red showing its value close to 0.05.
Statistics is rarely a black or white situation, and p-values are helps to find best compromise in your decision/conclusion. Here the situation is not well defined, which might be due to data itself, or the choice of the test (Tukey-Kramer):
- Have you checked the assumptions are valid for parametric test (like Tukey-Kramer) : independence of observations, data are (quasi)-normally distributed, and homogeneity of variances for the groups ?
- Are you interested in all comparisons, or do you have a "control" (in this case a Dunnett's test might be more appropriate) or "best" group you want to compare to ?
Depending on the choice of the test, the significativity threshold (fixed BEFORE the experiment, not during/after to have the conclusion you would like to have), its assumptions and the number of comparisons, you may have different results.
Hope this first answer will help you, looking forward to other responses from members
Victor GUILLER
L'Oréal Data & Analytics
"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)