Discussions

Doraemon214 · Mar 28, 2025 10:00 PM

Hello,

I am new to the JMP life distribution plot.

I think it is a basic understanding issue.

The below is the test table I made.

According to the description: https://www.jmp.com/support/help/en/18.1/index.shtml#page/jmp/event-plot.shtml#ww1386833

I think it meas there are total 100 samples.

And exactly at 4 readings: 50, 100, 150 and 200, there are 25 new failures happened.

So the failure rate I expected will be:

50, 25%

100, 50%,

150, 75%

200,100%.

Then I lauch the life distribution with setting as below:

However, the dots in the Weibull plot are very different from the failure rate I expected.

For example, the first read point at time 50 has failure rate as ~13%.

I don't understand why.

Could you help me with this?

Thank you very much for your help!

I am using JMP17

hogi · Apr 1, 2025 9:10 PM

Sorry, my first guess was wrong.

I just checked the probability values in Life Distribution for tables with different numbers of rows. *)
Seems that life distribution uses Hazen (Midpoint) plotting positions - independent of the number of data points:

prob = (rank - 0.5) / N

aka "Weibull" is what you get when you use the right click "new formula column" menu in the data table:

Here is an overview of different plotting positions which are available in MatPlotLib (via the "postype" argument)
from: https://matplotlib.org/mpl-probscale/tutorial/closer_look_at_plot_pos.html

seg = current report () ["Compare Distributions",FrameBox(1)] << find seg (Marker Seg( 1 ));
data= seg <<Get Y Values;

col = current data table() << new column ("prob");
col << set values(data)

*) get Y positions from a Marker Seg

View solution in original post

hogi · Mar 30, 2025 4:29 AM

For large N the estimated percent is

rank/ ( N + 1)

For N=4 you would get 0.2, 0.4, 0.6 and 0.8.
(instead of the 0.25, 0.5, 0.75, 1 which you expected)

For small N the estimated percent is "corrected". There are several approaches for the correction.
I have seen an overview in Wikipedia - I will try to find it ...

hogi · Mar 29, 2025 02:02 PM

It's kind of surprising that "N" is just 4 - because of 4 rows?
And not 100 - because of Freq = 25+25+25+25.

So, users should not be lazy, an use multiple lines instead of aggregating results via Freq : )

the same input, using 100 rows:

Doraemon214 · Mar 30, 2025 12:19 AM

Hi Hogi,

Thank you for your reply.

I think you meant the probability is an estimate instead of a measured value, so the it won't be the same as the actual failure rate in the test.

Is that correct?

In that case, I wonder if there is a document explains the thoughts and math behind it? I tried the JMP document but had no luck.

And if I want to do the plot as my acual failure rate, is there a way to do it in JMP?

Thank you.

hogi · Apr 1, 2025 9:10 PM

Sorry, my first guess was wrong.

I just checked the probability values in Life Distribution for tables with different numbers of rows. *)
Seems that life distribution uses Hazen (Midpoint) plotting positions - independent of the number of data points:

prob = (rank - 0.5) / N

aka "Weibull" is what you get when you use the right click "new formula column" menu in the data table:

Here is an overview of different plotting positions which are available in MatPlotLib (via the "postype" argument)
from: https://matplotlib.org/mpl-probscale/tutorial/closer_look_at_plot_pos.html

seg = current report () ["Compare Distributions",FrameBox(1)] << find seg (Marker Seg( 1 ));
data= seg <<Get Y Values;

col = current data table() << new column ("prob");
col << set values(data)

*) get Y positions from a Marker Seg

Doraemon214 · Mar 31, 2025 10:26 AM

Thank you Hogi.

I will read into it.

hogi · Apr 2, 2025 01:03 PM

@Doraemon214 wrote:
I think you meant the probability is an estimate instead of a measured value, so the it won't be the same as the actual failure rate in the test.

So, we can take the measurement values of the measured samples and guess / estimate the actual failure rate of the whole population. There are just readouts at 50, 100, 150 and 200 which add another error.

I use the Life Distribution platform to fit Distributions - mainly Normal Distributions. Sometimes: get a visual impression for the deviation from normal distributions. like a Q-Q Plot fith a fit line and nice looking confidence intervals.
In addition:
a) little chance to get 2 samples with the same value

b) concerning curve fitting it feels somehow "right" to place the fit curve at the center of the points

But when you think ...
especially about lifetimes - and "Survival" rates - indeed surprising that
"25 out of 100 units fail at 50 (or earlier)" is implicitly converted to

"cumulative probability up to the value 50 is 12.5%".

and with

it sounds even more surprising:
if 25 of the 100 units fail at 50, "probability" of survival at 50 is close to 90%:

Still many open questions ...
I asked JMP support: TS-00196267
I hope there is a white paper which tells us the details : )

NB:
according to Life Distribution plot points should default to Kaplan-Meier or Turnbull estimates with option to se...
there was a change between JMP16 and JMP17 how Life distribution calculates the probabilities.
Maybe there is a documentation{motivation which can add more insights ...

TomH · Apr 3, 2025 09:57 AM

Your data set is set up with exact fail times. It essentially says that 25% of the population fails at exactly t=50. Perhaps this is correct, but in my experience a more accurate description is that 25% of the population fails sometime between t=0 and t=50. To represent this in JMP you need to use interval censoring. If I modify your original table so Start time = [0, 50, 100, 150] and End time remains the same [50, 100, 150, 200], then I think the Weibull from Life Distribution matches your expectations:

hogi · Apr 4, 2025 12:23 PM

It helped me to consider non.equal frequency values, e.g. bunched data like

When you think of a (e.g. normal) distribution with maximum weight at 50 -> midplint.

and when you think of "100 out of 111 entries failed at 50 or below" -> Kaplan Meier.

Doraemon214 · Apr 7, 2025 06:09 PM

Thank you TomH.

I think you are right. I have to tweek the way my data is formatted so JMP can process it as I expected.

I wondering if it's statistically incorrect to do the weibull fit in the way I understood.

From Hogi's reply, I think that is true.

Feels like the measured data is not a direct representation of the true cumulative distribution function, but a math transform.

Is that why JMP does not allow weibul fit and probability plot using the Kaplan-Meier estimate?

Discussions

Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Re: Life distribution: The failure rate/probability plot does not match with the table

Recommended Articles