Hello everyone!
I'm reaching out to see if someone can guide me through the basic steps for performing PLS-DA.
Below is an example of the raw data table I'm using in order to construct the classification method. I'm working with spectral FTIR data, and my objective with the classification method is to see if I can classify samples according to the treatment they have received.
When I proceed with the model, I invariably get that 0 factors are the minimizing number of factors, and hence I cannot classify anything.
I thought this is puzzling, since I've already performed curve fitting analysis of the spectra in question, and with that analysis, there are clear differences between the samples treated and those untreated. Furthermore, I've graphed the treated and untreated spectra and there are clear visual differences... hence, I suspect that I'm doing something wrong.
My questions are:
For the classification variables, I'm using binary code, 0 when samples didn't receive that treatment and 1 for those that did. Is this ok, or should another setting be used?
Any other suggestions on how to improve the classification performance of this dataset? (I forgot to say that all spectra has been SNV normalized previous to the analysis to account for noise).
Thanks in advance! Any suggestion would be greatly appreciated
