cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
  • Learn how to build custom Python data connectors and further customize JMP’s Data Connector Framework with the Python Data Connector Demo, available now in the JMP Marketplace!
  • See how to move from signal modeling to system modeling at the first JMP Aerospace Analytics webinar. Register. June 18, 1 p.m. US Eastern Time.

Discussions

Solve problems, and share tips and tricks with other JMP users.
Choose Language Hide Translation Bar
craigwb
Level I

How to organize multidimensional spectral data for partial least squares (PLS)?

I have a large data set that has multiple independent and dependent variables.  For smaller systems, I have organized the data such that the independent variables are defined in the Column Name.  However, for much larger sets, where more than a few Xs and multiple Ys are involved, this becomes an extremely tedious exercise. Moreover, it makes for very wide, unwieldy tables and is hard to visualize graphically.

Does anyone in the community have any suggestions on a more straightforward method of organizing large spectral data sets for PLS?

Thanks.

2 REPLIES 2
Jeff_Perkinson
Community Manager Community Manager

Re: How to organize multidimensional spectral data for partial least squares (PLS)?

I'm not sure what kind of suggestions you're looking for but perhaps creating column groups could be helpful.

Select a set of columns and choose Col -> Group Columns.


-Jeff

-Jeff
craigwb
Level I

Re: How to organize multidimensional spectral data for partial least squares (PLS)?

Jeff,

Thanks for the response.  I know about organizing data via column groups;  what I'm looking for is a more organized way to input multi-spectral input to PLS. For PLS input, I currently convert all the independent variables to columns using 'split' operations and then have the dependent variable(s) as the column value. It would be great if PLS could accept x,y,z, ... as values in their respective columns, and then have the response (dependent) variables as separate columns.

As for this kind of data being hard to visualize, it would be great to (at least) have a graphics capability which would allow topographical maps to be generated from 3D data.  3D scatterplot is just okay for this kind of visualization.

Recommended Articles