- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Cut dataset
Hello,
I would like to cut my 70% vs 30% dataset to have a learning dataset and a test dataset for my statistical models
I use this method:
And I change the 0.7 in 0.3 to have 30%
However when I do that, the dataset does not separate into two parts: 70% vs 30%, it takes 70% then 30%
This means that in the dataset 30% there may be data from the dataset containing 70%
Is there a way to cut into two parts the dataset: 70 vs 30% without having the same type of data in both parts?
Thank you
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Cut dataset
Create a new column using formula:
Random Binomial(1, 0.3), it will give you 30% "1' and 70% "0".
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Cut dataset
You need to use the "Make Validation Column"
Analyze=>Predictive Modeling=>Make Validation Column
This will give you a new column that if you need to, you can subset the data table into 2 different tables.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Cut dataset
Hello,
I use the JMP version 16.1.0 and I can't find in the "Analyze" menu the "make validation column"
Thank you
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Cut dataset
Create a new column using formula:
Random Binomial(1, 0.3), it will give you 30% "1' and 70% "0".
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Cut dataset
It works very well !!
Thank you