turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- Discussions
- :
- Principal component analysis questions

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 1, 2015 2:47 PM
(1724 views)

Hi, I'm fairly new to PCA. I've read up on it a bit and watched several YouTube lectures on the subject. I think I have a so-so handle on it. I understand the multi-dimensional orthogonal nature of it, that I can use it for variable reduction and categorizing, and it is looking for linear relationships.

I'm more curious about finding trends with parameters of interest using JMP. For example, if I've got 1 or 2 parameters of interest (Y, response), and 10 or 1000 other variables (X, factor), and am looking for a trend, I might run a script to calculate Rsquare of Y1 for all X, and Y2 for all X, and only list or plot those with an Rsquare > say 0.8, or just use the native y by x platform and plot all Y by all X. Either method works well because they are always focusing on my chosen responses, but there can be a lot to sift through.

Can PCA help here? I realize I can just throw my Y1, Y2 as well as all of the X's in the analysis. I'm assuming I understand the interpretation of the output plots, but is there anyway to have JMP focus on parameters of interest? With 1000 parameters, the output plots are information-dense and parameter names don't seem to be highlighted / un-highlighted when selecting columns. Also, please let me know if I'm way off track here with what I'm trying to do with PCA.

10 REPLIES

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 1, 2015 3:42 PM
(1358 views)

You might want to try the new modeling utility in JMP12 to screen your x's to narrow down to a smaller subset before going to PCA.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 2, 2015 1:29 PM
(1358 views)

Thanks Karen, that looks promising. I'll see if I can install JMP 12.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 2, 2015 12:58 PM
(1358 views)

The key word in your original post I'm focusing in on is 'trends'...to me this implies a time series element to your evaluation of both x and y. Is this the case? Perhaps a multivariate time series modeling approach is called for?

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 2, 2015 1:39 PM
(1358 views)

Hi Peter,

I think maybe "relationships" would have been a better word choice on my part. My analyses aren't typically in the time domain. It is semiconductor electrical test data, and sometimes we may have a parameter that we want to understand more about it (for example if the distribution is shifted or skewed for a production lot or new product, or even just a few units in a lot). In this case I might want to check it against other parameters to see if the distribution or units of interest correlate to anything else that may shed light on it.

The general 'multivariate' platform is nice. It's been a long time since I've used it, but I just ran it again and I will add it back into my typical toolbox. It does get a little hairy with more than around 15 parameters. I have a multivariate report on my screen now with 20 parameters. It works but there's a fair amount of scrolling... maybe I should ask for a larger screen. :-)

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 2, 2015 1:51 PM
(1358 views)

Have you tried the color map in the multivariate analysis when you have more than 15 parameters?

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 2, 2015 3:39 PM
(1358 views)

Have you tried the color map in the multivariate analysis when you have more than 15 parameters?

Thanks, I didn't know that was available! It definitely condenses the results.

-Mike

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 3, 2015 6:13 AM
(1358 views)

Mike,

No problem,

Yes ever since we added that feature it is my "go-to" visual. My brain seems to assimilate the data more readily.

Lou V

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 3, 2015 6:12 AM
(1358 views)

When you are checking one lot against others isn't there a time component to that sort of evaluation? From your second post it sounds like you might also be looking for multivariate outliers...the Outlier analysis sub platform under the Multivariate -> Correlations has a couple outlier analysis techniques there. Also if you are looking to build a model, perhaps a PLS or Generalized Regression and Model Comparison approach might yield some insights? You need JMP Pro for Generalized Regression and Model Comparison...but these are just ideas...

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jun 4, 2015 3:30 PM
(1358 views)

When you are checking one lot against others isn't there a time component to that sort of evaluation?

That is true for some of the analyses I do... either lot by lot in time, even unit by unit sequentially within a lot could be thought of that way as well, although I normally don't treat it that way. You mentioned multivariate time series earlier. Is there a particular platform you had in mind? I don't have JMP Pro and I didn't see something like this.

From your second post it sounds like you might also be looking for multivariate outliers...the Outlier analysis sub platform under the Multivariate -> Correlations has a couple outlier analysis techniques there.

Thanks. I hadn't noticed these but I will look into them.

-Mike