cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
jlrouquette
Level III

Prediction Interval for New Observations Not in the Data Set

I’ve built a linear regression model and been asked to create a tool in excel, using the prediction expression, for a team of stakeholders to use.  The will apply the tool to new observations not seen in the original dataset.  I’d like to include a prediction interval but I’m not sure how to use the JMP outputs to give a prediction interval on data not already seen in the model.       

 

1 ACCEPTED SOLUTION

Accepted Solutions
dale_lehman
Level VII

Re: Prediction Interval for New Observations Not in the Data Set

The red arrow on the regression output will permit you to save the predicted values and either a mean or individual confidence interval - which to use depends on whether the predictions will be for individual observations or an average of many observations with those characteristics (note that the difference between the two is likely to be very large, so be careful deciding which is appropriate).  However, it is also important to note that these predictions and intervals only apply to new observations drawn from the same population.  Usually, the new observations are from a later time (such as the present or future) than from the past (where the data came from).  So, there is always the question of whether the sample you based your analysis on (even if it was the population at a point in time) is still representative of the population about which the new predictions are to be made.  There is really nothing to be done about making predictions about a changed population unless you can specify the ways in which it may have changed.

View solution in original post

2 REPLIES 2
dale_lehman
Level VII

Re: Prediction Interval for New Observations Not in the Data Set

The red arrow on the regression output will permit you to save the predicted values and either a mean or individual confidence interval - which to use depends on whether the predictions will be for individual observations or an average of many observations with those characteristics (note that the difference between the two is likely to be very large, so be careful deciding which is appropriate).  However, it is also important to note that these predictions and intervals only apply to new observations drawn from the same population.  Usually, the new observations are from a later time (such as the present or future) than from the past (where the data came from).  So, there is always the question of whether the sample you based your analysis on (even if it was the population at a point in time) is still representative of the population about which the new predictions are to be made.  There is really nothing to be done about making predictions about a changed population unless you can specify the ways in which it may have changed.

jlrouquette
Level III

Re: Prediction Interval for New Observations Not in the Data Set

Perfect.  Thanks, Dale!  I hadn't thought about noting that we can only apply the data to observations from the same population.