cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
JMP is taking Discovery online, April 16 and 18. Register today and join us for interactive sessions featuring popular presentation topics, networking, and discussions with the experts.
Choose Language Hide Translation Bar
BSwid
Level IV

Filter Distribution Output by Count (N) of Data Points

I want to use the Distribution function to explore data.

The data is tagged by Region.  The Regions are of varying sizes.
There are too many Regions to analyze visually.
I'd like to be able to set a filter, say N=25, and only see the graphs for the where the Region has at least 25 data points.

Is that possible?

In the image "Store Cluster" is equivalent Region.
2018-09-25_DistributionSubsetByCount.png

1 ACCEPTED SOLUTION

Accepted Solutions
txnelson
Super User

Re: Filter Distribution Output by Count (N) of Data Points

The simple interactive way to do this is to create a new column, which contains the results of the Col Number() for each Store Cluster.   The formula for this column would be

Col Number( :units sold, :region)

Then you could use a Data Filter, or a Local Data Filter, and exclude all rows that do not have a value > 25 

Jim

View solution in original post

2 REPLIES 2
txnelson
Super User

Re: Filter Distribution Output by Count (N) of Data Points

The simple interactive way to do this is to create a new column, which contains the results of the Col Number() for each Store Cluster.   The formula for this column would be

Col Number( :units sold, :region)

Then you could use a Data Filter, or a Local Data Filter, and exclude all rows that do not have a value > 25 

Jim
dale_lehman
Level VII

Re: Filter Distribution Output by Count (N) of Data Points

You could also create a new column using the ColSum function and putting region in as a By variable.  You may have to create a column of 1s if you don't have a column with unique values first.  But that function will then show how many rows each region has and you can use that column as a filter (for >25 rows or any other value you want).