Dear colleagues,
Currently, I am studying at "Statistical Thinking for Industrial Problem Solving" course and during the Quiz a question was set with four possible answers. I am afraid that the answer accepted by the quiz interface is correct.
The question is:
In predictive modeling, why should you use model validation?
Select one:
- Use it to make sure that you have fit the correct model.
- Use it to make sure that your model generalizes well to new data.
- Use it to make sure that you can identify the most significant variable.
- You don’t need to use model validation in predictive modeling.
I selected 1, as validation is required to understand if a model is adequate (no overfit, for example) before introduction new, test data for the final decision. However, the right answer is 2. It looks quite doubtful because 2 is related more to test data, which is the next step after validation.
Please, correct me, if I am wrong and give a line of clarification to me.
Thanks!
Reaching New Frontiers