- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Outlier Deletion
Is there a way to script the deletion of extreme outliers per column? My objective is once i run the script, it concatenates data tables, delete extreme outliers per column and display graphs but im having some sort of problem with the outlier part.
1 ACCEPTED SOLUTION
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Outlier Deletion
Created:
Feb 16, 2020 10:25 AM
| Last Modified: Feb 17, 2020 8:13 AM
(2015 views)
| Posted in reply to message from viskovicz00012 02-16-2020
I use Jackknife Distances for outliers.
For extreme outliers I use upper limit = 6.
In the following example I use upper limit = 2.
Names Default To Here( 1 );
dt = Open( "$SAMPLE_DATA/Big Class.jmp" );
JackknifeObj = dt << Multivariate( Y( 4 ), Jackknife Distances( 1, Save Jackknife Distances( 1 ) ) );
JackknifeObj << Close Window();
dt << Select Where( :Jackknife Distances > 2 );
dt << Hide and Exclude << Clear Select << Delete Column( Jackknife Distances );
2 REPLIES 2
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Outlier Deletion
Created:
Feb 16, 2020 10:25 AM
| Last Modified: Feb 17, 2020 8:13 AM
(2016 views)
| Posted in reply to message from viskovicz00012 02-16-2020
I use Jackknife Distances for outliers.
For extreme outliers I use upper limit = 6.
In the following example I use upper limit = 2.
Names Default To Here( 1 );
dt = Open( "$SAMPLE_DATA/Big Class.jmp" );
JackknifeObj = dt << Multivariate( Y( 4 ), Jackknife Distances( 1, Save Jackknife Distances( 1 ) ) );
JackknifeObj << Close Window();
dt << Select Where( :Jackknife Distances > 2 );
dt << Hide and Exclude << Clear Select << Delete Column( Jackknife Distances );
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Outlier Deletion
@tom_abramov approach is a good method for identifying outliers. Are you looking for what method to use to identify outliers or do you already have a method of determining the outliers that you want to use?
Jim