cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Submit your abstract to the call for content for Discovery Summit Americas by April 23. Selected abstracts will be presented at Discovery Summit, Oct. 21- 24.
Discovery is online this week, April 16 and 18. Join us for these exciting interactive sessions.
Choose Language Hide Translation Bar
Martin
Level V

Formula for Number of Unique Categories in Column

I am looking for the column formula to provide me with the count of unique categories in a column (maybe based on another columns categories). For instance, if a column has

A

A

B

B

B

C

then I would like to get back the value 3.

Thanks

1 ACCEPTED SOLUTION

Accepted Solutions
txnelson
Super User

Re: Formula for Number of Unique Categories in Column

Try this formula.......of course, swap out the :Flow for the column you want to count the categories for

Summarize( x = by( :FLOW ) );
N Items( x );

If the data table is really large, you may want to use this more efficient version

If( Row() == 1,
	Summarize( x = by( :Flow ) );
	y = N Items( x );
);
y;

 

Jim

View solution in original post

9 REPLIES 9
Martin
Level V

Re: Formula for Number of Unique Categories in Column

I think I remember the Summation formula being used for this, but my memory fails me.

txnelson
Super User

Re: Formula for Number of Unique Categories in Column

Try this formula.......of course, swap out the :Flow for the column you want to count the categories for

Summarize( x = by( :FLOW ) );
N Items( x );

If the data table is really large, you may want to use this more efficient version

If( Row() == 1,
	Summarize( x = by( :Flow ) );
	y = N Items( x );
);
y;

 

Jim
Martin
Level V

Re: Formula for Number of Unique Categories in Column

This is PERFECT! Thank you Jim!
aronwilson
Level II

Re: Formula for Number of Unique Categories in Column

I see how this would work for a script to create a column, but I've created a datatable that I update from a database and I'd like to have a column in that table with a formula to calculate the number of unique categories. Is there a way to do that directly in a column formula instead of though a script?

 

I found a hack using using Def() and Col Sum but it only works because i'm looking at a sorted field that's a unique id with numbrers. It doesn't work for actual text variables. Is there a better solution for that?

 

Here is the hacked formula:

Col Sum( Dif( :customer_id ) > 0 )

 

Thank you,

Aron

txnelson
Super User

Re: Formula for Number of Unique Categories in Column

The stated formula from my last response should do what you want.  

If( Row() == 1,
	Summarize( x = by( :Customer_ID) );
	y = N Items( x );
);
y;

Just cut and paste this into the formula editor

Jim
Martin
Level V

Re: Formula for Number of Unique Categories in Column

What I always forget, is that a column formula can actually be an entire mini script; with many semicolons and lines of code.

aronwilson
Level II

Re: Formula for Number of Unique Categories in Column

That worked great! I was thrown off by the red underline, I thought that meant it wasn't going to work.

image.png

 

I'm not actually sure I've ever tried pasting a formula directly in the column before, I thought I had to rebuild it using the explicit formula column functions... this is a huge improvement. Thank you so much for the quick reply!

 

Aron

Martin
Level V

Re: Formula for Number of Unique Categories in Column

Mr. Nelson,

 

Can you describe in detail what this function is doing? Not sure I understand the "by" function.

 

Thanks,

Martin

txnelson
Super User

Re: Formula for Number of Unique Categories in Column

"By" is a fairly global concept in JMP.  If you run the Distribution Platform, and specify a "By" variable it will generate a separate Distribution for each of the different values of the "By" columns.  The same is for the Summarize() function.  Here is a quote from the JSL Syntax Reference for the Summarize() function.

     Help==>Books==>JSL Syntax Reference

"Using a BY variable changes the output from single values for each statistic to a list of values for each group in the BY variable."

 

Good documentation along with an example can also be fount in the Scripting Index

     Help==>Scripting Index==>Functions==>Statistical==>Sumarize

or

     Help==>Scripting Index.........search on Summarize

Jim