cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Submit your abstract to the call for content for Discovery Summit Americas by April 23. Selected abstracts will be presented at Discovery Summit, Oct. 21- 24.
Discovery is online this week, April 16 and 18. Join us for these exciting interactive sessions.
Choose Language Hide Translation Bar

Demonstration: Influence of Outliers

Started ‎06-10-2020 by
Modified ‎12-03-2021 by
View Fullscreen Exit Fullscreen

Learn more in our free online course:
Statistical Thinking for Industrial Problem Solving

 

To explore the impact of unusual observations and outliers on the correlation coefficient, we use the demoCorr script. This script is in the JMP Sample Scripts Directory.

 

The demo correlation script starts with several observations and allows us to drag points or add new points and explore changes in the correlation coefficient.

 

The initial correlation is 0.675. X and Y are positively correlated. So, in general, as X increases, Y also increases.

 

When I add a new observation at a low value of Y and a high value of X, the correlation drops.

 

When I drag this observation to a high value of Y and a low value of X, again the correlation drops.

 

In both of these cases, without looking at the graph, we might conclude that there is no relationship between the two variables.

 

What happens if this point is an outlier for both X and Y? I'll change the axes for X and Y and add this observation at extreme values for both variables.

 

Do you notice what happens to the correlation now? The correlation is now strongly positive. This one outlier has caused the correlation to be inflated.

 

Look how important it is to plot your data before you interpret the output!