cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
kjwx109
Level II

Preprocessing ahead of Predictor Screening analysis

Hello,

I have some characterisation data that I would like to subject to predictor screening.  The variables are on different scales.  Is any preprocessing like standardisation required prior to the predictor screening?

2 REPLIES 2
Byron_JMP
Staff

Re: Preprocessing ahead of Predictor Screening analysis

Tree methods, like predictor screening, partition, bootstrap forest, and boosted trees (and XGBoost) don't need pre processing.  They work well on dirty data too (missing, outliers, strong X correlation with Y...

JMP Systems Engineer, Health and Life Sciences (Pharma)
Victor_G
Super User

Re: Preprocessing ahead of Predictor Screening analysis

Hi @kjwx109,

 

As @Byron_JMP mentioned, the Predictor Screening platform is based on a Random Forest, which is a tree-based method robust to outliers, collinearity between variables, scales of the variables...
You can find more infos here : Predictor Screening 

 

If you need further infos or if you have more questions, don't hesitate

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)