Hello,
I have several bioreactor data. Each bioreactor has an ID (1 to 5) and several time-course profile variables (Dissolved oxygen, pH...). I have 20 variables for the time-course data, and 10 timepoints.
When I do a PCA analysis, using the 20 variables as Y columns, I obtain a score plot of the bioreactor data, which is good. Meaning for each bioreactor I get one time-course profile considering the 20 variables, with 10 datapoints per batch. But I would also like to obtain a "Batch ID PCA", in which the data for each bioreactor would be condensed into one datapoint only, and I could identify which bioreactors are similar to each other. SO I would like to get a PCA plot with only 5 datapoints, in which these datapoints would consider the (20*10 data for each bioreactor)
The closer I have been to achieve this was through the K-means cluster platform.
Any help? Preferentially using JMP menus rather than scripting.
I know that one alternative would be to split my matrix. Because the reason I get a time-course data in my PCA analysis is because I have my data table as column 1= ID; Column 2=time; Columns 3-20=Variables.
If I could make only one row to each batch, the PCA analysis would give me what I want. I have tried to split my data, but I have been unsuccessful. If I could convert my data table to column 1= ID; column 2-201 - datapoints, I believe the PCA analysis would give me what I want.
How do I convert my current data table into the 200 new columns? (21 first columns would be time 1, values for the 20 variables for time 1; 21 next columns would be time 2 and the values for the 20 variables for timepoint 2, and so on).
Many thanks