Predictor explainer automates the screening of process variables using feature engineering and machine learning (known as AutoML). Parallel coordinate plots and trends will be automatically shown to interpret the results. If PyJMP is installed (optional), SHAP plots and UMAP will be automatically calculated as well.
For further details and applications of ML applied to industrial processes, you can have a look at our open-access review: https://pubs.rsc.org/en/content/articlelanding/2022/re/d1re00541c
1 – Download and installation
Download and click the .jmpaddin file to install the latest version of Predictor Explainer from:
https://github.com/industrial-data/predictor-explainer
If you have a previous version installed, it will be automatically removed.
PyJMP (Python for JMP) can be installed optionally.
https://github.com/industrial-data/pyjmp
2 – Distillation column (continuous process)
In this example will be illustrate how to screen tags (sensor data) to quickly identify correlated factors to the yield in a distillation column.
Go to the Add-ins menu and open Predictor Explainer.
Predictor explainer contains example files. To open the folder, click in the button on the bottom left corner (see A in image below). The following steps are using the file ‘distillation_column_na.jmp’.
(A) The add-in folder contains other example files and a python code
(B) If pyJMP is installed, additional options for SHAP and UMAP will be shown.
The main results are:
A hidden and temporal table with all the pre-selected predictors, target and time variables can be accessed via JMP home. The original analysis table won’t be modified.
If PyJMP is installed and the option to show SHAP plots is activated, an interactive violin plot will appear after the analysis.
Additional hidden tables containing SHAP, UMAP and clustering results will be accessible via the home menu.
3 – Batch data analysis
Predictor Explainer can also be used to screen sensors measuring batch processes. The file named ‘Fermentation_Batch_Data.jmp’ illustrates the challenge of combining unique values coming from a lab analysis with process data.
Predictor explainer will first create a table with summary statistics (also called fingerprints or landmarks).
If there is information about product (grade) or phase (stage) of the batch, these will be also used to generate more granular summary statistics. Using the noise contribution as a filter will eliminate all calculated features below it. When PhaseID is introduced as numeric and ordinal column (1, 2, 3…), an automatic aligning of the batch will be performed and shown.
In the example, Predictor Explainer identified the strongest sensor (tag).
Creating your own version of the add-in
If you want to distribute or modify a new version of the JMP addin, there are two important things to consider.
As well as add the Notebook and other files you want to distribute with the add-in.
In case you have suggestions, errors, or success stories (!) contact us
https://github.com/Industrial-data/predictor-explainer
Authors: Francisco Navarro, Carlos Perez-Galvan, Juline Gilliard
License: BSD-Clause 3
Code: github.com/industrial-data/predictor-explainer
Latest Version: 2022-09-12 (JMP 16)