Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- ANOVA assumption test

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 8:14 AM
(1693 views)

Hi JMP community!

I run into a question when doing my data anlysis project. I want to test that whether differnt types of products are statistically different in prices. I use ANOVA to do the testing. Before that, I created a boxplot of prices for differnt types of products. I'm wondering whether I should exclude outliers indicated by the boxplot before doing ANOVA analysis.

It would be really helpful if you can provide me some insights. Thank you so much!

1 ACCEPTED SOLUTION

Accepted Solutions

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 8:32 AM
(3380 views)

Solution

In my opinion, outliers should not be eliminated, unless there is a causal effect unrelated to the analysis, which made the values what they ended up having. If non can be found, then you should make the assumption the values are part of your valid distribution. But that leads us to the next issue, ANOVA assumes the data are normally distributed. With skewed data(outliers may have caused such), the data may not be normal in form. When this happens, you should look to normalize the data through transformation. The Distribution Platform in JMP can help you with the determination of whether or not the data are normal and if not, it may be able to provide you with a transformation you can use to convert to normal for the analysis.

Jim

6 REPLIES

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 8:32 AM
(3381 views)

In my opinion, outliers should not be eliminated, unless there is a causal effect unrelated to the analysis, which made the values what they ended up having. If non can be found, then you should make the assumption the values are part of your valid distribution. But that leads us to the next issue, ANOVA assumes the data are normally distributed. With skewed data(outliers may have caused such), the data may not be normal in form. When this happens, you should look to normalize the data through transformation. The Distribution Platform in JMP can help you with the determination of whether or not the data are normal and if not, it may be able to provide you with a transformation you can use to convert to normal for the analysis.

Jim

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 8:36 AM
(1687 views)

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 2:26 PM
(1647 views)

I really like the **Normal Quantile Plot** option in **Oneway**. This plot overlays the normal distribution of each group in the same plot. The y-intercept is the mean and the slope is the standard deviation. You can check ANOVA assumptions (only population difference is the mean (vertical displacement of lines), populations have same variance (lines are parallel), and check for outliers) all at the same time.

Learn it once, use it forever!

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 2:21 PM
(1649 views)

In addition to Jim's insight, you also want to check the assumption that the *variance is constant* across the groups because the test models variance this way and pools the estimates across the groups. So be sure to also click the red triangle next to Oneway and select **Unequal Variances** for this check of another important assumption.

Learn it once, use it forever!

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 18, 2017 7:26 PM
(1636 views)

Thank you so much for your reply!

I understand that I need to check whether dependent vairable is normally distributed and variance is equal. I tried the normal quantile and unequal variances in JMP. I also attached the result in this post. However, it seems that my data are not normally distributed and have unequal variances. I wonder how to deal with unequal variances.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Jun 19, 2017 4:10 AM
(1605 views)

You might try transforming the response. Heavily skewed data often benefits from the natural logarithm function. Alternatively, analyze the data with Fit Least Squares to determine the best power transformation:

- Select
**Analyze**>**Fit Model** - Select
**Openbid**and click**Y** - Select
**Item**and click**Add** - Click
**Run** - Click the red triangle next to Response and select
**Factor Profiling**>**Box Cox Y Transformation**

Examine the plot of SSE versus lambda. If no transformation is helpful, the minimum SSE should be found near lambda = 1. Lambda = 0 is essentially the same as a log transformation. Click the red triangle next to Box Cox and select **Save Best**. Now repeat your analysis using **Openbid X** as the response.

See if this change helps meet the assumptions.

Learn it once, use it forever!