I'm trying to do permutation tests and Monte Carlo Cross Validations with the Iris sample dataset as a MWE for our dataset. I'm not sure if I'm doing things and interpreting output correctly.
1) Create New Formula Column (Random->Sample Without Replacement) for the Species Column for Permutation Test.
2) Create a Validation Column (0.75/0.25 split) and a new Formula Column (Random->Sample Without Replacement) using this for Monte Carlo Cross Validation
data:image/s3,"s3://crabby-images/cc65b/cc65bc3459dd874d5a859b0c39245cfc52967ba8" alt="mjmg_0-1638371838261.png mjmg_0-1638371838261.png"
3) Run the Discriminant Platform with the Validation column and display the ROC curves. Use the Simulate Platform on the Area of the ROC Curve
data:image/s3,"s3://crabby-images/17240/17240dd8c99121f238f422144d65f4d844d1d766" alt="mjmg_3-1638372772720.png mjmg_3-1638372772720.png"
4.1) For the Permutation Test-Select Species as column to switch out and Shuffle[Species] as column to Switch In. Enter the desired number of random sampling and random seed and run the simulation.
data:image/s3,"s3://crabby-images/9af59/9af59bd91411d8111a10888039d64208574fcdb0" alt="mjmg_1-1638372281372.png mjmg_1-1638372281372.png"
4.2) View the Distributions script and take note of the empirical p-value.
data:image/s3,"s3://crabby-images/ed99a/ed99a2f91713580ae09c8be6eaa433fd2a22cc56" alt="mjmg_2-1638372528310.png mjmg_2-1638372528310.png"
5.1) For the Monte Carlo Cross Validation-Select Validation as column to switch out and Shuffle[Validation] as column to Switch In. Enter the desired number of random sampling and random seed and run the simulation.
data:image/s3,"s3://crabby-images/4dfba/4dfba6aa1893f2c6d5229fdf1dcecb8357f92c22" alt="mjmg_4-1638372900805.png mjmg_4-1638372900805.png"
5.2) View the Distributions script and take note of the empirical p-value.
data:image/s3,"s3://crabby-images/3845f/3845fa5fa43a4fd67827d926a97fd480913f43a8" alt="mjmg_5-1638373060746.png mjmg_5-1638373060746.png"
6) Assuming what I did is correct (testing the Area of ROC curve), the low p-value from the Permutation test is expected, but what about the large p-value for the Monte Carlo cross validation? How should this be interpreted?
Thanks