Discussions

billi · Mar 10, 2020 09:04 AM

I am looking at fit least square results and there are two factors in the model (after reducing the model two factors are left). The whole model is not significant but the leverage plot for factor 'A' is significant. I wanted to check if this is the correct interpretation of the leverage plot: 'Although whole model is not significant but leverage plot shows that factor 'A' has an impact on the response'. ?

Mark_Bailey · Mar 10, 2020 09:59 AM

Thanks for sharing. There are a number of ways to proceed.

You might accept the determination from the whole model test that none of the terms are significant. This approach is conservative. The risk is that you make a type II error.
You might relax the common standard of alpha = 0.05 for significance if this case is a screening study. That is, accept a higher chance of a type I error in order to reduce the chance of a type II error, which is more concerning when screening. This approach assumes that you will verify the conclusion with future observations. For example, make predictions of the response at new factor levels and empirically confirm them.
You might collect more observations, especially at the ends of the factor ranges, in order to see if the trends hold when repeating the regression analysis. You might consider extending the ranges at this stage to produce a larger effect, if that change is physically feasible. This approach is prudent and economical because such observations would have maximum leverage.

View solution in original post

Mark_Bailey · Mar 10, 2020 09:31 AM

A regression analysis often involves many hypothesis tests. It is possible, therefore, that you might get a false significance, a type I error. In this case, it is also possible that you have a high leverage observation for one of the factors. Can you show us a picture of the Actual by Predicted and the Leverage Plots?

billi · Mar 10, 2020 09:41 AM

@Mark_Bailey these are the plots.

Mark_Bailey · Mar 10, 2020 09:59 AM

Thanks for sharing. There are a number of ways to proceed.

You might accept the determination from the whole model test that none of the terms are significant. This approach is conservative. The risk is that you make a type II error.
You might relax the common standard of alpha = 0.05 for significance if this case is a screening study. That is, accept a higher chance of a type I error in order to reduce the chance of a type II error, which is more concerning when screening. This approach assumes that you will verify the conclusion with future observations. For example, make predictions of the response at new factor levels and empirically confirm them.
You might collect more observations, especially at the ends of the factor ranges, in order to see if the trends hold when repeating the regression analysis. You might consider extending the ranges at this stage to produce a larger effect, if that change is physically feasible. This approach is prudent and economical because such observations would have maximum leverage.

billi · Mar 10, 2020 10:25 AM

@Mark_Bailey Thank you for your response. One follow-up question: Do you think FDR p-value helps in this case to filter out the coincidence (that factor A is significant)? Because of this result (that I showed you before), the effect summary shows that factor 'A' is significant (shown below).

And if I check the FDR effect summary shows following results.

Mark_Bailey · Mar 10, 2020 10:41 AM

NO!

FDR is intended for situations with very many terms in the model. It is aimed at controlling the inflation of the type I error rate experiment-wise.

statman · Mar 10, 2020 08:24 PM

I'm confused by your initial post. Looking at the leverage plots, none of the factors or the whole model look significant. My understanding is that the confidence curve must cross the horizontal line. Also it looks like you might have 1 or 2 residuals that look unusual. I'm not sure how you selected these factors from your screening experiment, but you might want to revisit this. And, of course, Mark has some good advice.

"All models are wrong, some are useful" G.E.P. Box

billi · Mar 11, 2020 09:27 AM

@statman @Mark_Bailey So the part I am confused about is no factor is significant but still in the effect summary factor 'percent A' is significant.

Mark_Bailey · Mar 11, 2020 10:14 AM

I am not sure what you mean or why you say, "So the part I am confused about is no factor is significant but still in the effect summary factor 'percent A' is significant." I am guessing that you mean to ask is how can the ANOVA return a p-value above 0.05 when the p-value for one of the factors is less than 0.05.

The ANOVA uses the F ratio to compare the mean sum of squares of the model to the mean sum of squares of the errors. It is an omnibus test of any non-zero parameter. The parameter estimates use the t ratio to compare the estimate to the hypothesized value (zero) to the standard error of the estimate. The ANOVA and t tests are not directly connected in any way. These two tests are based on different hypotheses and methods. It is true that if one factor has a p-value that is much less than 0.05 then it is more likely that the ANOVA will have a p-value less than 0.05, but it is not directly proportional. In your case, the p-values for the ANOVA and the factor are both close to the arbitrary 0.05 significance level. Neither enjoys particularly strong evidence to reject the null hypothesis (zero).

Let me know if either I did not interpret your last question correctly or if I did not clarify the issue.

Mark_Bailey · Mar 11, 2020 10:17 AM

Regarding the decision of significance by the leverage plots, the horizontal reference line (no effect) is not contained within the confidence region for percent A if you extend the plot beyond the original scale. The visual assessment in the plot must agree with the numerical assessment of the p-value to the default significance level of 0.05.

Discussions

correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Re: correct interpretation of leverage plot

Recommended Articles