Discussions

rshehadah · Oct 23, 2018 05:33 PM

Hi,

Is there a way in JMP to label outliers in a dataset. For example, I have 100 lots and I want to label the units that are observered on the lower side of the distrubution. I dont care if the outliers perform better than the distrubution, I am more intersted in the lower ones.

In other words for example, I have a lot with 100 units, 1 or 2 units are outliers. How can I label those units?

Thank you,

Rami

msharp · Oct 23, 2018 06:31 PM

The easiest way would be to just create a new "outlier" column and mark the rows that are outliers. You can then make that column a "label" column. Once you have another column you can use it to color or use different marker styles (like x vs dot). Since it's a label column you can highlight the points and show the row label.

rshehadah · Oct 23, 2018 06:38 PM

Thank you for the reply, but I dont see how I can do that for over 100 lots. Also how would you know those two points are outliers?

msharp · Oct 23, 2018 07:04 PM

Labelling outliers and finding outliers are two completely different questions. There are lots of statitistical methods to determine outliers (Pierce, Grubbs, 3 sigma, box and whisker plots, ect) all of which vary and disagree. You can use the Analyze > Screening > Explore Outliers tool for this.

That said, I always put in a word of caution around outliers. For you a lot is really only an outlier if it experienced a processing different from the rest of your lots. Data shouldn't be thrown away just b/c it makes your "fit bad" or it "looks high" or it makes my "P-value significant".

Discussions

Outliers in data

Re: Outliers in data

Re: Outliers in data

Re: Outliers in data

Recommended Articles