cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
JMP is taking Discovery online, April 16 and 18. Register today and join us for interactive sessions featuring popular presentation topics, networking, and discussions with the experts.
Choose Language Hide Translation Bar
Marc
Level II

Lack of fit - significant R square in two "almost similar x/y models"

Hello.

 

I know a similar question was asked before, but please help me to apply this to my model:

 

1. test two regressions - A versus B and A versus C (similar observations n=1242)

2. Model A versus B has an higher Rsquare and seems to have a better correlation - but a Lack of fit < 0.05

3. Model A versus C has a lower Rsquare and sems to have a bit less correlation - but a lack of fit > 0.05

4. The residual plot looks quite similar.

 For the actual full result please see also my attached PDF.

 

My question: Do I have to disregard Model A versus B and conclude that A versus C is better and more appropriate for my analysis ?

Screenshot (11).png

 

 

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
David_Burnham
Super User (Alumni)

Re: Lack of fit - significant R square in two "almost similar x/y models"

Based on a brief glance of the models my feeling is that you are worrying too much about the detailed statistics.  The 2 models look almost identical to me (look at the bivariate plots not just the stats) - clearly there is a very high level of correlation between B and C. Does your scientific understanding favour one model over another?

What stands out to me are the high leverage points at a value of about 70 (for both B and C - strange that they are both on the same scale - are they different measures of the same thing?).  Anyhow, I would be concerned about the degree of leverage of those points and their overall influence on the regression.

-Dave

View solution in original post

2 REPLIES 2
David_Burnham
Super User (Alumni)

Re: Lack of fit - significant R square in two "almost similar x/y models"

Based on a brief glance of the models my feeling is that you are worrying too much about the detailed statistics.  The 2 models look almost identical to me (look at the bivariate plots not just the stats) - clearly there is a very high level of correlation between B and C. Does your scientific understanding favour one model over another?

What stands out to me are the high leverage points at a value of about 70 (for both B and C - strange that they are both on the same scale - are they different measures of the same thing?).  Anyhow, I would be concerned about the degree of leverage of those points and their overall influence on the regression.

-Dave
Marc
Level II

Re: Lack of fit - significant R square in two "almost similar x/y models"

Many thanks David.
All are physiological variables, all measurements are done with the same method - hence trying to find the best predictor for A. B and C are dependant to each other.
I will explore the outliers separately in a later, but before this I wanted to configure the best model to define outliers (>2SD or >90% percentile) - although they maybe the same in each model anyway (I didn't check this yet).
Thanks a lot, Marc