Discussions

zimmerj · Oct 21, 2019 05:04 PM

I'm doing a discrete fit between an observed count distribution and a Gamma-Poisson mixture. However the Pearson Chi-Squared Goodness-of-fit statistics is returning a very high value for X2, but indicating an acceptable p-value.

In the example provided, X2 is 1222.58, which if inserted into a Chi Squared distribution would return a p-value of 7.552193e-268.

JMP returns a p-value of 0.1021.

Is there some adjustment that is done in JMP's calculation that would increase the value so much over a direct calculation?

tonya_mauldin · Oct 29, 2019 4:59 AM

The expected Chi Square under the null hypothesis is equal to the degrees of freedom. Your degrees of freedom are high for this example, you have an N of 1162. Therefore I am not surprised by the results.

Most issues with the Pearson Goodness of Fit tests comes from the bin sizes.

The Pearson Goodness of Fit test has changed for Version 15. The problem with this test is that the bin size it not well defined. For version 15, JMP makes sure there are at least 5 expected observations in each bin. This way we satisfy that rule of thumb (the assumptions are towards the bottom). https://en.wikipedia.org/wiki/Pearson%27s_chi-squared_test

The last bin is the exception. If it's expected count is less than 5, JMP does not worry about it.

Previously, JMP treated every integer between the min and max as a bin. This means there can be bins with zero observations and expections essentially zero.

View solution in original post

tonya_mauldin · Oct 29, 2019 4:59 AM

The expected Chi Square under the null hypothesis is equal to the degrees of freedom. Your degrees of freedom are high for this example, you have an N of 1162. Therefore I am not surprised by the results.

Most issues with the Pearson Goodness of Fit tests comes from the bin sizes.

The Pearson Goodness of Fit test has changed for Version 15. The problem with this test is that the bin size it not well defined. For version 15, JMP makes sure there are at least 5 expected observations in each bin. This way we satisfy that rule of thumb (the assumptions are towards the bottom). https://en.wikipedia.org/wiki/Pearson%27s_chi-squared_test

The last bin is the exception. If it's expected count is less than 5, JMP does not worry about it.

Previously, JMP treated every integer between the min and max as a bin. This means there can be bins with zero observations and expections essentially zero.

zimmerj · Oct 29, 2019 03:17 PM

Thanks Tonya, that helps a lot. Is there a way to control the bin size for ver. 15 and above? In the example I have provided, I believe I wasn't giving the correct degrees of freedom for the Chi Squared Test in the alternate methods I was using.

tonya_mauldin · Oct 30, 2019 07:48 AM

No, JMP Version 15 does not give the user control of the bin sizes that are used for the Pearson Goodness of Fit test.

Discussions

How does JMP adjust for p-value of Pearson Chi-Squared Value

Re: How does JMP adjust for p-value of Pearson Chi-Squared Value

Re: How does JMP adjust for p-value of Pearson Chi-Squared Value

Re: How does JMP adjust for p-value of Pearson Chi-Squared Value

Re: How does JMP adjust for p-value of Pearson Chi-Squared Value

Recommended Articles