cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
Jenny
Level II

ANOVA assumption test

Hi JMP community!

I run into a question when doing my data anlysis project. I want to test that whether differnt types of products are statistically different in prices. I use ANOVA to do the testing. Before that, I created a boxplot of prices for differnt types of products. I'm wondering whether I should exclude outliers indicated by the boxplot before doing ANOVA analysis.

It would be really helpful if you can provide me some insights. Thank you so much!

 

1 ACCEPTED SOLUTION

Accepted Solutions
txnelson
Super User

Re: ANOVA assumption test

In my opinion, outliers should not be eliminated, unless there is a causal effect unrelated to the analysis, which made the values what they ended up having.  If non can be found, then you should make the assumption the values are part of your valid distribution.  But that leads us to the next issue, ANOVA assumes the data are normally distributed.  With skewed data(outliers may have caused such), the data may not be normal in form.  When this happens, you should look to normalize the data through transformation.  The Distribution Platform in JMP can help you with the determination of whether or not the data are normal and if not, it may be able to provide you with a transformation you can use to convert to normal for the analysis.

Jim

View solution in original post

6 REPLIES 6
txnelson
Super User

Re: ANOVA assumption test

In my opinion, outliers should not be eliminated, unless there is a causal effect unrelated to the analysis, which made the values what they ended up having.  If non can be found, then you should make the assumption the values are part of your valid distribution.  But that leads us to the next issue, ANOVA assumes the data are normally distributed.  With skewed data(outliers may have caused such), the data may not be normal in form.  When this happens, you should look to normalize the data through transformation.  The Distribution Platform in JMP can help you with the determination of whether or not the data are normal and if not, it may be able to provide you with a transformation you can use to convert to normal for the analysis.

Jim
Jenny
Level II

Re: ANOVA assumption test

Thank you so much Jim! This helps a lot!

Re: ANOVA assumption test

I really like the Normal Quantile Plot option in Oneway. This plot overlays the normal distribution of each group in the same plot. The y-intercept is the mean and the slope is the standard deviation. You can check ANOVA assumptions (only population difference is the mean (vertical displacement of lines), populations have same variance (lines are parallel), and check for outliers) all at the same time.

Re: ANOVA assumption test

In addition to Jim's insight, you also want to check the assumption that the variance is constant across the groups because the test models variance this way and pools the estimates across the groups. So be sure to also click the red triangle next to Oneway and select Unequal Variances for this check of another important assumption.

Jenny
Level II

Re: ANOVA assumption test

Thank you so much for your reply!

I understand that I need to check whether dependent vairable is normally distributed and variance is equal. I tried the normal quantile and unequal variances in JMP. I also attached the result in this post. However, it seems that my data are not normally distributed and have unequal variances. I wonder how to deal with unequal variances. 

Re: ANOVA assumption test

You might try transforming the response. Heavily skewed data often benefits from the natural logarithm function. Alternatively, analyze the data with Fit Least Squares to determine the best power transformation:

  1. Select Analyze > Fit Model
  2. Select Openbid and click Y
  3. Select Item and click Add
  4. Click Run
  5. Click the red triangle next to Response and select Factor Profiling > Box Cox Y Transformation

Examine the plot of SSE versus lambda. If no transformation is helpful, the minimum SSE should be found near lambda = 1. Lambda = 0 is essentially the same as a log transformation. Click the red triangle next to Box Cox and select Save Best. Now repeat your analysis using Openbid X as the response.

See if this change helps meet the assumptions.