cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
KD
KD
Level II

Transforming skewed count data for analysis

Hi Everyone,

I have some heavily skewed count data that I'd like to use to build a model.  The data looks much better with a log transform shape wise but being count data, there's an abundance of zeros that won't work with a log model.  From reading online, I should be using something like a Poisson distribution but I am not sure how to do so with JMP?  

 

Any guidance would be appreciated, thank you in advanced!

1 ACCEPTED SOLUTION

Accepted Solutions
Peter_Bartell
Level VIII

Re: Transforming skewed count data for analysis

A couple thoughts for you:

 

1. You might want to try the Generalized Linear Model personality in the Fit Model platform. You can specify the response distribution (Poisson is an option) and link function (log is one of the options) as part of the model specification window. Here's the link to the JMP online documentation for the Generalized Linear Model personality:

https://www.jmp.com/support/help/14-1/generalized-linear-models.shtml#

 

2. If you have JMP Pro, within the Fit Model, Generalized Regression personality, there is a zero inflated Poisson distribution option for situations such as you describe...a predominance of zeros within the response population. Here's the link to the JMP Pro online documentation for the Generalized Regression personality distribution notes:

 

https://www.jmp.com/support/help/14-1/distribution.shtml

 

View solution in original post

3 REPLIES 3
Paul_J
Level III

Re: Transforming skewed count data for analysis

I often use the Gamma-Poisson to fit the a count distribution with peak at zero.  I like the JMP help on this as well.

Peter_Bartell
Level VIII

Re: Transforming skewed count data for analysis

A couple thoughts for you:

 

1. You might want to try the Generalized Linear Model personality in the Fit Model platform. You can specify the response distribution (Poisson is an option) and link function (log is one of the options) as part of the model specification window. Here's the link to the JMP online documentation for the Generalized Linear Model personality:

https://www.jmp.com/support/help/14-1/generalized-linear-models.shtml#

 

2. If you have JMP Pro, within the Fit Model, Generalized Regression personality, there is a zero inflated Poisson distribution option for situations such as you describe...a predominance of zeros within the response population. Here's the link to the JMP Pro online documentation for the Generalized Regression personality distribution notes:

 

https://www.jmp.com/support/help/14-1/distribution.shtml

 

KD
KD
Level II

Re: Transforming skewed count data for analysis

Thanks Peter! I will check out the GLM link.