I have datasets with many response variables (~15K) and many samples (~7K). I have clustered the samples based on prior knowledge into 10 clusters, and need to define the variables that are characteristic for each. I want to generate a table/matrix that lists all the variables vs the clusters, with the fold change (ratio) in mean response values (each cluster vs all other samples together) with P values (either corrected or not) and FDR if possible.
Hoping there is a simple way to generate this. ANOVA or negative binomial options would be helpful.