PCA Model Validation Techniques

Report Inappropriate Content · Nov 6, 2017 04:53 AM

PCA can be used to build a model that describes a set of observations. Using the multivariate platform it is possible to perform jacknife and T2 outlier analysis to check if any of the observations are extreme relative to the multivariate mean.

In what situation might a weights column be used to build the model?

If you use a PCA model how do you maintain it?

If I have a new observation(s), I can apply the saved principle components & T2 to assess if it is consistent with my model or not.

Are there any methods in JMP to assess how robust the PCA model is? I was thinking for example of creating a script which deletes 10 % of cell values at random from each column and then uses impute missing to recover them from the covariance matrix. By repeating this many times and comparing the eigenvalue table and the imputed versus actual could give useful infomation about the model robustness.

Byron_JMP · Nov 13, 2017 04:48 PM

I like where you're going with this idea.

First, Weight and Frequency, I'm pretty sure they do the same thing in Principal components. Weigh can use fractions where frequency uses integers.

It looks like a lot of the functionality you're thinking of is pretty easy to script.

Names Default To Here( 1 );
dt = Open( "$SAMPLE_DATA/Tiretread.jmp" );
obj = dt<<Multivariate(
	Y( :ABRASION, :MODULUS, :ELONG, :HARDNESS ),
	Estimation Method( "Robust" )
);
obj << T Square( 1 , "Save T Square");
obj<<Close Window;


r=dt << Select Where( :name("T Square") >= Col Quantile( :name("T Square"), 0.9 )  );
r<<exclude;

JMP Systems Engineer, Health and Life Sciences (Pharma)

Bill_Worley · Nov 14, 2017 08:22 AM

DModX might be another option for you.

From the Multivariate Methods book in Help From the Multivariate Methods book in HelpSave the DModX formula to your data table as you would other formula columns.

Peter_Bartell · Nov 14, 2017 09:53 AM

Another thought wrt to PCA validation is to create a validation column ala JMP Pro's "Make Validation Column" capability. Here's the link to the JMP online documentation describing this utility:

http://www.jmp.com/support/help/13-2/Make_Validation_Column_Utility.shtml#

If you're not running JMP Pro, you can create one by using the workflow shown by my colleague @julian in this video:

https://www.youtube.com/watch?v=M5_mECc4NAg

Then use the validation column as a By variable. Then you can compare the various PCA visualizations and statistics for reasonableness and such.

PCA Model Validation Techniques

Re: PCA Model Validation Techniques

Re: PCA Model Validation Techniques

Re: PCA Model Validation Techniques

Recommended Articles

Get Going with JMP: Essentials for Using JMP

Multiple-Group Analysis in Structural Equation Modeling

Manage Limits for JMP 17