Video 1: Various ways to create models: model building, comparison and selection
Comments:
@Victor_G - Some visualization and propositions for model comparison and selection (raster plots) proposed by Goos : https://community.jmp.com/t5/JMP-Wish-List/Raster-plots-or-other-visualization-tools-to-help-model/i...
@martindemel - bootstrapping is what I thought of when saying tree based models, sorry fo rbeeing unprecise
SVEM - Self Validating Ensemble Models:
Discovery Summit Talk - SVEM: A Paradigm Shift in Design and Analysis of Experiments (2021-EU-45MP-779) - JMP User Community
Support page - https://www.jmp.com/support/help/en/18.1/index.shtml#page/jmp/overview-of-selfvalidated-ensemble-mod...
@Jonas_Rinne - SVEM might be a method which uses some kind of validation without violating the design structure
@Victor_G - Or use statistical analysis, or Machine Learning algorithms robust to overfitting (random forests, SVM, ...)
Replicates are more used for explainability (sort the random error/variation/statistical error from the factors influences), I wouldn't use them for predictive modeling, particularly for Machine Learning modelling
Video 2: Mixture design with constraints
Comments:
@Bill_Worley - Mixture models can be highly non-linear and it may be good to start off with a space-filling mixture doe to try and better understand the non-linearity.
@Victor_G Or do it sequentially : classical mixture design up to 2nd/3rd order, and then augment with Space-Filling to have points inside the experimental space.
Video 3: Number of trials : can we use the minimal advised by JMP in Optimal Design?
Comments:
@Bill_Worley - Try Augment Designs
@Paul_J - This makes me wonder ... How does JMP decide the recommended number of runs? ... Could that give us an idea of why to use something more than the minimum?
@Victor_G - Very often, 4 extra runs than minimum number of runs
Design Generation
@Jonas_Rinne -
It also depends on your choices. If you have chosen some terms as "if possible" it will try to build these effects in with the added runs coming from the minimum. Otherwise it will add replicates for instance. JMP is trying to distribute the additional runs as best as possible based on your choices.
@martindemel - consider that you will get also information about the error estimate
@Victor_G -
Not only correct model, but you can evaluate "natural" variability (random noise)
Without extra runs, you fit a model with the assumption that you have no experimental or measurmeent noise !
Completely agree with Brian, it all comes with a price / compromise
Video 4: Resources