cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Choose Language Hide Translation Bar
cweisbart
Staff
Need for Speed: Filter and subset data

c.jpgHello! Welcome to today's Need for Speed! It's all about filtering and subsets.

My name is 2-Click Clovis, and I am truly passionate about data analysis and most important, time efficiency! JMP was an integral tool for my previous work in the semiconductor and manufacturing industries. Since joining JMP, I have learned so many new tips and tricks. I’ll never forgive myself for not knowing what I now know, because I could have saved so many hours in my data analysis workflow! I see it as my current responsibility to share my newfound knowledge with data analysts to help them regain their precious time.

After I demonstrate how quickly JMP can perform your routine data manipulation and analyses compared to other tools, I’ll show you the quickest way to get it done within JMP.

Filtering data by one variable

One of my favorite features in the newer versions of JMP (beginning with JMP 15) is the histogram icon in the top left corner of your JMP table.

histogram icon (2).jpg

This histogram icon allows you to easily filter your selection of data by one variable with just two clicks! With your first click, click on the histogram icon. BOOM! Histograms appear beneath each of your column names. If your variable is continuous, you see its minimum and maximum values displayed on the histogram. If your variable is categorical, you see the number of values/levels for that variable and their relative count.

histograms (2).png

With your second click, select a value or range of values directly on the histogram. BOOM! Notice that all the rows associated with your selection are now highlighted. This is an example of JMP’s famous dynamic linking! In the animation below, watch as I click on the histogram icon, then click on the “Reject” value under the Lot Acceptance column. When I do this, all the rows associated with “Reject” are highlighted (14 out of 90 rows).

histogram1.gif

You can now easily create a new subset table based off this selection by going to Tables>Subset>OK. In the animation below, you can observe the creation of a new subset table with just the 14 “Reject” rows.

subset 2.gif

 

Filtering data by more than one variable

To filter data by more than one variable, I recommend using JMP’s Data Filter. All you need to do is go to Rows>Data Filter or just click on the following icon: 

data filter (2).png

You can then double-click on a variable of interest and select a desired value or range of values. To incorporate more variables, click either the “And” or “Or” functions at bottom. 

cweisbart_6-1649200497036.png

In the animation below, you can see how to create a subset based off the following conditions: Sugar Supplier equal to “Sour” and Mill Time between 10 and 20.

datafilter1.gif

 Local data filtering within a JMP graph or report

All JMP reports are going to have an option to add a Local Data Filter under the red triangle at top left of report. This tool is a great way to answer any “What would this look like if…” questions. The best part is that you do not need to recreate your reports or graphs! In the example of the animation below, we are looking at the Dissolution box plot. What would the box plot look like if we were only looking at “Sour” Sugar Suppliers and “Smooth” Talc Suppliers?

local data filter 1.gif

 

On that note,

c.jpg

 

 

 

is clicking out!

Last Modified: Apr 18, 2022 2:43 PM
Comments
gail_massari
Community Manager

Try this yourself using any JMP table or the Table Production data table Clovis used, and which is part of the JMP Sample Data library.  Thank you Clovis @cweisbart.

gail_massari_0-1649856482182.png

 

HenrietteK
Level II

Nice post - really helpful

jthi
Super User

One more nice thing with row selection (and with other options also!) is using Data View. You can create Data View (linked subset) by right clicking on different options on Rows section in bottom left corner of data table. This lets you quickly create Linked Subsets for Selected, Excluded, Hidden and Labeled Rows

jthi_2-1654335484476.png

jthi_1-1654335457383.pngCouple of links related to data view:

The Data View 

Rows Panel in Data Tables (help, jmp.com) 

cweisbart
Staff

@HenrietteK  Thank you! I’m happy you found it helpful! Here are some additional Need for Speed JMP blog posts: 

https://community.jmp.com/t5/tag/Need%20for%20Speed/tg-p/board-id/jmp-blog 

cweisbart
Staff

@jthi , thank you for adding your comment! Very helpful and well explained! I am a huge fan of Data View. It’s a great option and I think it actually saves you a click! 

GeneratingModes
Level I

Is there a way to select rows using multiple numeric variables by keying in a range rather than picking graphically?  This would be more efficient when a large number of variables are involved.

jthi
Super User

@GeneratingModes (Global) Data Filter is one option

jthi_0-1676103661739.png

You could also create a formula and select values based on that.

cweisbart
Staff

@GeneratingModes great question! The 3rd gif in the blog shows an example of how to key in a maximum and minimum filter value for Mill Time instead of manually using slider. Hope this helps!