cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Choose Language Hide Translation Bar

Top 10 Values in Many Columns

I have a data table with a few thousand columns. I am looking to find the top "X" number of values in those columns, by another variable. For instance, I want to know the columns with the 10 highest values, when another column has a value of "Large Cap Stocks". 

3 REPLIES 3
jthi
Super User

Re: Top 10 Values in Many Columns

Without having any idea of your data it might be easiest to first stack your data and then pick the top 10 values.

-Jarmo

Re: Top 10 Values in Many Columns

I attached an example of a small set of my data. 

Re: Top 10 Values in Many Columns

Here's a way to do it with just one column. You could put this in a loop for each of your columns. This post has more background.

/*JMP STATISTICAL DISCOVERY LLC (“JMP”) PERMITS THE USE OF THIS COMPUTER SOFTWARE CODE (“CODE”) ON AN AS-IS BASIS AND AUTHORIZES YOU TO USE THE CODE SUBJECT TO THE TERMS LISTED HEREIN.  BY USING THE CODE, YOU AGREE TO THESE TERMS.  YOUR USE OF THE CODE IS AT YOUR OWN RISK.  JMP MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO, WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NONINFRIGEMENT, AND TITLE, WITH RESPECT TO THE CODE.
You may use the Code solely as part of a software product you currently have licensed from JMP, JMP’s parent company SAS Institute Inc. (“SAS”), or one of JMP’s or SAS’s subsidiaries or authorized agents (the “Software”), and not for any other purpose.  The Code is designed to add functionality to the Software but has not necessarily been tested.  Accordingly, JMP makes no representation or warranty that the Code will operate error-free.  JMP is under no obligation to maintain, support, or continue to distribute the Code.
Neither JMP nor its licensors shall be liable to you or any third-party for any general, special, direct, indirect, consequential, incidental, or other damages whatsoever arising out of or related to your use or inability to use the Code, even if JMP has been advised of the possibility of such damages.  Except as otherwise provided above, the Code is governed by the same agreement that governs the Software.  If you do not have an existing agreement with JMP or SAS governing the Software, you may not use the Code.
JMP and all other JMP Statistical Discovery LLC product or service names are registered trademarks or trademarks of JMP Statistical Discovery LLC in the USA and other countries.  ® indicates USA registration.  Other brand and product names are registered trademarks or trademarks of their respective companies.
*/
//init
n = N Rows();
lstCategories = Associative Array( :Stock Type << Get Values ) << Get Keys;
lstRows = {};
dt = Current Data Table();
dt << Clear Column Selection;
dt << Clear Select;

//loop through each category for first column; save top 10 for each category into lstRows
For Each( {i}, lstCategories,
	dt << Select Where( :Stock Type != i );
	dt << Hide and Exclude;
	dt << Clear Select;
	dt << select rows( Loc( Ranking( :"Value 1"n[1 :: n] ) >= n - 10 ) );
	t = As List( dt << Get Selected Rows );
	Show( i );
	Show( t );
	dt << Clear Row States;
	Insert Into( lstRows, t );
);

//do something with the rows
dt << Select Rows( lstRows );