Discussions

Thierry_S · Mar 23, 2026 02:22 PM

Hi JMP Community,

I am working with cell counts derived from tissue biopsies that exhibit an unusual distribution: some biopsies have no cells (zero counts), while others have low percentages (see example below). A simple Root Cube transformation yields a decent distribution of non-zero counts, but the overall distribution of the transformed data remains zero-biased.

As expected, if I use this data in a Standard Least Squares Fit Model, the Residuals are not normally distributed (see below)

What would be your recommendation to test if these cell counts are associated with multiple covariates, including interactions?

Notes:

The absence of cells (zero counts) is scientifically meaningful.
I cannot easily share the actual data due to sensitivity.

Thank you.

Best regards,

TS

Thierry R. Sornasse

Victor_G · Mar 23, 2026 04:16 PM

Yes, I would split the task in two parts:
1. Determine when result is 0 or different from 0 (binomial distribution)
2. For the cases when results are different from 0, fitting a standard least squares model. You can still apply a transformation on your raw data (if needed !) like Box-cox transformation.

Best,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)

View solution in original post

Victor_G · Mar 23, 2026 1:12 PM

Hi @Thierry_S,

Do you have JMP Pro ? Using Generalized Regression models, you can specify one of the zero-inflated distributions.
I think zero-inflated Poisson distribution could work on your count raw data. If you're using JMP, maybe you could Split your modeling in two parts:

Determine when result is 0 or different from 0 (binomial distribution)
For the cases when results are different from 0, fitting a standard least squares model. You can still apply a transformation on your raw data (if needed !) like Box-cox transformation.

Hope this suggestion may help you,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)

Thierry_S · Mar 23, 2026 04:11 PM

Dear Victor,

Thank you for helping me with this problem. Unfortunately, I do not have access to JMP Pro at this time. Still, I have access to the Generalized Linear Model in JMP 19 "Basic", but the Poisson Distribution does not seem to fit all the data. Indeed, when the cell type is more abundant, the non-zero sub-population distribution tends to approximate a normal distribution.

I wonder if binning the data as an ordinal variable could help here?

Best,

TS

Thierry R. Sornasse

Victor_G · Mar 23, 2026 04:16 PM

Yes, I would split the task in two parts:
1. Determine when result is 0 or different from 0 (binomial distribution)
2. For the cases when results are different from 0, fitting a standard least squares model. You can still apply a transformation on your raw data (if needed !) like Box-cox transformation.

Best,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)

statman · Mar 23, 2026 05:45 PM

Without SME, it is difficult to answer. First, what questions are you trying to answer? Second, what is the data source? Third, how confident are you with the measurement system? Are there other measure you can take? How confident are you the process for taking the biopsies is consistent? What is the model you are trying to fit? Consider if you split the data (as suggested by Victor), the question you must ask is are there different factors responsible for effecting cells counts as the factors effecting no cells? If so, it would make sense to have the two Y's (2 categories, binomial) and continuous cell counts for the data.

"All models are wrong, some are useful" G.E.P. Box

Discussions

Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Re: Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Re: Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Re: Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Re: Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Re: Windows 11 > JMP 19 > Fit Model > Standard Least Square > How to handle unusual data distribution?

Recommended Articles