cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Choose Language Hide Translation Bar
lisat
Level I

Residuals analysis for multiple linear regressoin-normal quantile plot isn't normal. What to do next?

Hello again. This is as much a statistics question as a how-to-use-JMP question.

 

I've done a Fit Model multiple linear regression on my data and found that only 2 (of possible 6) predictors matter and they interact (So 2 predictors and 1 cross of them in the model).

 

However, my adjusted Rsquare is only 0.72, and my residual analysis looks like this:

lisat_0-1629084873697.png

lisat_1-1629084904473.png

lisat_2-1629084962870.png

 

Any suggestions for dealing with the non-normality suggested by the quantiles plot? There are no assignable causes for the tail observations on each end to justify excluding them (they are scattered throughout the data), and the highest Cooks-d on an outlier is 0.4.

1 ACCEPTED SOLUTION

Accepted Solutions
ih
Super User (Alumni) ih
Super User (Alumni)

Re: Residuals analysis for multiple linear regressoin-normal quantile plot isn't normal. What to do next?

You might take a look at this wikipedia article, specifically the 'in Regression' section. Transformations are easy in JMP: right click on a column heading and choose 'New Formula Column'.  Make a bunch of different transformations and then analyze each using the distributions platforms to decide which is best.  If you transform your target variable and want to predict it, you just need to 'undo' the transformation by using the inverse transformation on your predicted value.

View solution in original post

3 REPLIES 3
ih
Super User (Alumni) ih
Super User (Alumni)

Re: Residuals analysis for multiple linear regressoin-normal quantile plot isn't normal. What to do next?

Hi @lisat,

 

Are your variables normally distributed? Perhaps a transformation prior to linear regression would help.

lisat
Level I

Re: Residuals analysis for multiple linear regressoin-normal quantile plot isn't normal. What to do next?

The data are not normally distributed. For each of the categorical factors that are significant, there is a separate non-normal distribution.  (One set of data here is suppressed because it was increasing the bin size, hiding the detail in the other two categories. Basically the same story but with greater spread. Chart labels also suppressed for confidientiality.)

 

lisat_1-1629138693576.png

 

 

ih
Super User (Alumni) ih
Super User (Alumni)

Re: Residuals analysis for multiple linear regressoin-normal quantile plot isn't normal. What to do next?

You might take a look at this wikipedia article, specifically the 'in Regression' section. Transformations are easy in JMP: right click on a column heading and choose 'New Formula Column'.  Make a bunch of different transformations and then analyze each using the distributions platforms to decide which is best.  If you transform your target variable and want to predict it, you just need to 'undo' the transformation by using the inverse transformation on your predicted value.