Building Predictive Models for Spectral Data
Created:
Jul 30, 2020 2:54 PM
 Last Modified: Jul 31, 2020 12:01 PM
How it's Done:
 Examine data using Graph Builder to get idea of what different spectra look like

Use Multivariate analysis to examine all wave lengths and resulting Correlation Coefficients to confirm multicolinearity
 Use ModelDriven Multivariate Control Charts to examine all wave lengths variables and drill into runs that are out of control
 ID how many PC you need to build model
 Take spectra of the desired samples. No need for output (Y) information at this point.
 Identify most prominent dimensions in the spectra by Row with Functional Principal Components, f(PC’s), from Functional Data Explorer – not for use in the traditional sense
 Use Functional Principal Component Profiler to get an idea how your spectra are changing as variable of interest changes
 Save Functional Principal Component scores to data table to use in experimental design
 Create an experimental design using the f(Principal Components) as factors (covariates)
 Run the experiment – gather the output of interest
 Model the results via PLS, and/or Generalized Regression (or other methods able to handle correlated factors)
 Determine the overall optimum solution
 Use this sustainable model to determine the “outcome” for all future sample* the model will hold true for samples analyzed using the same calibrated instrument)
 11. Use Score plot to examine Categorial Data
Summary:
 Compress the available information regarding spectral wavelengths or mass of different options via functional principal components
 Use covariate DOE to select the “corners of the box” for testing representative sample spectra based on Design of Experiments
 Model the data via PLS – Generalized Regression is an excellent alternative option or you may need to use more sophisticated techniques including Neural Nets
 Find the overall optimum solution
 As new samples are tested the spectral data can be input into the data table to determine level of active or desired component.
 Highly efficient experimentation
 Sustainable empirical model based on spectral data/wavelengths