turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- Prediction formula null values

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

Highlighted

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 7, 2018 11:27 AM
(489 views)

Hello,

After saving my prediction formula from GenReg, I'm seeing about 80 percent of my dataset has null values (.) in the predicted values column. I suspect it has something to do with several "levels removed" in some of my significant predictors. Any advice on how to handle this or how to pivot from this in order to arrive at a higher yield (of predicted values) would be appreciated. Thank you :)

- Tags:
- regression

4 REPLIES

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 20, 2018 4:41 AM
(381 views)

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 20, 2018 4:44 AM
(379 views)

@samalar,

Can you share a reproducible example - so people can try and step through your workflow ?

You don't have to share any confidential data - you can either anonymize your data or use the sample data sets in JMP if possible.

Best

Uday

Uday

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 20, 2018 5:12 AM
(375 views)

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 20, 2018 6:15 AM
(359 views)

A categorical factor with *k* levels will require *k-1* parameter estimates. You have many levels for each of your categorial variables. That translates into a model with many parameters to estimate. You don't have enough data to estimate your model. There are only 98 observations in the training set and 43 in the validation set. There is no way to validate your model since you don't observations with each level of all of those categories. JMP does its best to provide a fit to the data, but you need more data. You should rethink what model you wish to fit and the format of your data.

Dan Obermiller