Hello,
I am running a PCA on an NMR dataset that contains some missing values. This isn't due to an error—it's simply the nature of the data, as some samples do not show certain peaks.
I have noticed that when I include these parameters Y variables in the PCA and color the score plot by sample, some samples are missing entirely from the plot. It's not consistent—some samples with missing values are included, and others are not.
Has anyone experienced something similar? Why would certain samples be excluded from the PCA score plot even though missing data is present in other samples that are included? Is there any way to control which samples are included?
Thanks in advance for any insights!