cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
agneshb
Level III

Creating a validation column in JMP PRO

is there a quick way or addins to create the validation column when using JMP PRO (and not to have to go through the steps in new colunm>missing data>random...).

I'm aware of the save validation in the neural platform, but it only creates training and validation category (I like to also have a test category).

Thanks!

Agnès

1 ACCEPTED SOLUTION

Accepted Solutions
yvesa_roy
Level II

Re: Creating a validation column in JMP PRO

Hi,

There should be no distinguishible pattern in the selection criterion. One of the simplest formulas uses the RandomCategory function of JMP, like this :

7476_RandCat.PNG

(Sorry for the decimal commas but I work in France). You will have to insert a new line in the function definition in the dialog box as, by default, the function proposes only two.

Yves

View solution in original post

7 REPLIES 7

Re: Creating a validation column in JMP PRO

When you create the new column for the validation status, you can use the Initialize Data option near the bottom of the dialog to select Random > Random Indicator. From here, I would then enter a large portion for 0 and 1-portion for 1. The portion is up to you, perhaps 0.25-0.5.

agneshb
Level III

Re: Creating a validation column in JMP PRO

Thanks Mark and it is the way I'm doing now, usually using 0.6 for the training, 0.2 for the validation and 0.2 for the test, but I have to recreate the column each time I create a new subset of data and I was wondering if there was a simpler way to generate the column or a script that can be saved.

julian
Community Manager Community Manager

Re: Creating a validation column in JMP PRO

Hi AgnesHB,

Maybe this will help? I used Sequence() to count from 1 to 5 (with step size of 1, and repeating each value 1 time), and placed the sequence in a match function to recode 1, 2, and 3 as 0 (for training), 4 as 1 (for validation) and 5 as 2 (for test), which will fit your .6, .2, .2 setup.

JSL:    Match(Sequence(1, 5, 1, 1), 1, 0, 2, 0, 3, 0, 4, 1, 5, 2)

7454_Screen Shot 2014-10-20 at 11.38.48 PM.png

I hope this helps!

Julian

julian
Community Manager Community Manager

Re: Creating a validation column in JMP PRO

Alternatively, you could use a random integer function rather than sequence, but using sequence will ensure proportions more close to what you define (since a sequence is predictable and a random integer, well, isn't). But, the above could be seriously problematic if, for some reason, there is something systematically biased across the sequence (like every 5th observation being different in some way due to the measurement system, which would mean ALL your "test" data would be of that type). Probably unlikely, but not great. A quick random shuffle of rows would solve that problem.

Julian

yvesa_roy
Level II

Re: Creating a validation column in JMP PRO

Hi,

There should be no distinguishible pattern in the selection criterion. One of the simplest formulas uses the RandomCategory function of JMP, like this :

7476_RandCat.PNG

(Sorry for the decimal commas but I work in France). You will have to insert a new line in the function definition in the dialog box as, by default, the function proposes only two.

Yves

julian
Community Manager Community Manager

Re: Creating a validation column in JMP PRO

I wasn't aware of Random Category() until now; that's a much more elegant solution!

Julian

gianpaolo
Level IV

Re: Creating a validation column in JMP PRO

Another Alternative:

 

 

Names Default To Here( 1 );
dt = Open( "$SAMPLE_DATA/Big Class.jmp" );
dt << Make Validation Column(
	Training Set( 0.75 ),
	Validation Set( 0.25 ),
	Validation Column Type( Formula ),
	Go
);

 

ciao

Gianpaolo

Gianpaolo Polsinelli