JMP > Identification of predictive signature from large data set > Issue with obvious overfitting
Hi JMP community, I have a large data set with ~4,000 biomarker measurements at Baseline in patients enrolled in a clinical trial. I need to identify among this data which combination of biomarkers may predict the clinical outcome 12 weeks later. I have used a rather naive approach that appears to yield grossly overfitted outputs: Run all 4,000 biomarkers through the Screening > Response Screening...