cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
The Discovery Summit 2025 Call for Content is open! Submit an abstract today to present at our premier analytics conference.
Get the free JMP Student Edition for qualified students and instructors at degree granting institutions.
Choose Language Hide Translation Bar
View Original Published Thread

Assessing for colinearity for categorical variables and/or binary outcome

gstins
Level I
I am seeking to assess for colinearity among categorical variables to be included in a multi variable nominal logistic regression. I understand that VIF is useful for this, but I’m unable to find VIF for categorical variables or a binary outcome. Other methods have been less initiative like a LASSO. Is there a way to, either with or without LASSO, to evaluate for colinearity? Thanks!
2 REPLIES 2


Re: Assessing for colinearity for categorical variables and/or binary outcome

From the Fit Model dialog, have you tried switching to Generalized Regression? From there you can say the response has a binomial distribution. This should give you a fit that is very similar to the nominal logistic regression and it offers the ability to look at the Correlation of Estimates table. You can use that to understand the collinearity. Otherwise you can study the collinearity of the main effects by using Multivariate Methods > Multivariate.

Dan Obermiller
Victor_G
Super User


Re: Assessing for colinearity for categorical variables and/or binary outcome

Hi @gstins @gstins,


Welcome in the Community !

 

In addition to the solution proposed by @Dan_Obermiller, one possible "trick" to evaluate collinearity using VIF values is to create a random "dummy" numeric column, fit a multivariate regression model, and display VIF values of the model's variables. See How to examine VIF in a generalized regression report?  and the response from @cwillden.
VIF are calculated only using values from input data/factors, (see equation here : Parameter Estimates for Original Predictors), so no matter the response type, you can always calculate and analyze VIF values.

 

Hope this workaround may help you,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)