cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
AnastasiaMan
Level I

Validation for Logistic regression model JMP 15

I saw many comments on how to run a validation for a jmp pro version, that unfortunately I don't have.

In jmp 15, when I open the Fit Model, there is no box to add the validation column.

 

So, what I did was to create manually a validation column, with 60% data for training, 20% for validation and 20% for testing.

I build the model based on the training dataset, and I can see the predictions for the validation and testing set.

 

How can I generate the R^2 and other fit details for the validation dataset on the model that is build on the training set? 

3 REPLIES 3
dale_lehman
Level VII

Re: Validation for Logistic regression model JMP 15

I am assuming you excluded the test data when fitting the model (I'm not sure what you did about the validation data - if you excluded that as well, then the test data is redundant - otherwise, I'm not sure what purpose the validation data served in your analysis).  Unexclude all the rows and use Fit Y by X, with Y as the predictions and X as the actual values - and put Validation in the By box.  You will get separate plots (and R-squared) for each subset.

AnastasiaMan
Level I

Re: Validation for Logistic regression model JMP 15

Hi and thank you, it does work.

But does it calculate 3 different models based on each subset?

What I initially did, was to use only the training set for building the model, and as I have more than one possible models, I want to fit them in the validation data to choose the best one. So, I need to apply the same model built on the training dataset, to the validation dataset and compare which fits the best.

 

Zhiwu
Level I

Re: Validation for Logistic regression model JMP 15

You can first fit your model using training dataset, the save prediction formula for each of models. Then using fit Y by X with your Y value in validation set and test set and X value is prediction formula from you models. Then by doing fit line or fit special with 45 degree line you will get all statistics number in output.