turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- Discussions
- :
- Non-normal data Capability Analysis

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

Highlighted

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 11, 2018 2:15 PM
(382 views)

Hi everyone,

I am trying to __ calculate CpK__ for two different parameters. Data for both does not have a normal distribution. I followed the tips provided in the following post:

https://community.jmp.com/t5/JMPer-Cable/Process-Capability-Analysis-for-nonnormal-data/ba-p/38112

Based on this analysis Mixture of 3 normals looks the best (AICc). Then I plotted the individual probability plots.Showing below are the top 3 probability plots including "Normal".

Based on this analysis the calculated CpK (Mixture of 3 normals) was 0.65 as opposed to the one calculted with "Normal" which was 0.9. Is this approach correct?

The second paramter is even more complicated as the data is skewed mostly due to values below LOQ of the assay reported as 109.

Any help is appreciated!

Thanks

7 REPLIES

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 11, 2018 2:41 PM
(375 views)

Well, I certainly wouldn't trust the normal estimate, but I'm not sure I'd trust any of those. Stability is fundamental assumption for the capability indices. They don't have to be normal, but the data needs to maintain a constand mean and variance. In the other thread, Mark and Mike talked about understanding why you have multiple modes. That doesn't absolutely mean you don't have a stable process because sometimes there are perfectly reasonable reasons to have multi-moded data and they can't be "fixed." However, you really need to make sure you do your due diligence. If the assumption of stability is not met, then your capability estimates are probably not indicative of the capability you will have in the future.

That being said, if you feel confident your process is stable, then by all means base the capability calculations on the most appropriate fitted distribution. An alternative is non-parametric capability, which doesn't assume any distribution.

-- Cameron Willden

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 11, 2018 2:49 PM
(371 views)

Thanks Cameron for the quick response! Is there a way to identify these multiple modes in JMP 14 and analyze them?

Also I forgot to mention that some of the data points highlighted below were determined to be due to sample prep error. Can these be elimintated from the process capability or sample prep or mixing errors are considered part of your process?

Also the goal is not only to calculate CpK but change the specs (tighten or broaden) depending on the results from Capability analysis.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 11, 2018 2:56 PM
(367 views)

I think you can eliminate the those samples that you know were prepped incorrectly since you have an assignable cause. Your objective is to generalize to unsampled product, and unsampled product is not prepped at all for testing, so I would say that it's not part of the process.

JMP can't really figure out which samples belong to which mode; there's no way to determine that precisely. The best you could probably do is assign each individual to the closest mode, but you are likely to get alot of them wrong. If you can look at samples that come from different parts of the distribution and see if you can figure out how they are different. We sometimes see multiple modes in our strength testing and the modes align with different failure modes.

It might also help to look at a control chart. If the different modes of your distributions correspond to periods of time along the X-axis, you may be able to find a root cause.

-- Cameron Willden

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 12, 2018 9:42 AM
(309 views)

What could be the justification of shrinking the spec limit in the following case where data is heavily skewed?

Basically most of the data is at 109 which is the LOQ of the assay.

Again the Norm 3 mix looks the best and the new CpK calc with Norm 3 is lower at 3.1.

Is there a way to deal with the near LOQ data? Is the CpK calc this way accurate?

Thanks for all the previous replies!

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 12, 2018 10:45 AM
(306 views)

I don't know what LOQ means. I'm not sure I could be of much help with these questions.

-- Cameron Willden

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 12, 2018 11:46 AM
(301 views)

LOQ is limit of quantitation. Basically thats the lowest value that an assay can reliably report.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Jul 12, 2018 1:00 PM
(295 views)

You have chosen Normal 3 mixtures for your distribution, but SHASH and Johnson Su are better choices based on their lower AICc scores. The lower the AICc the better.

The **SHASH distribution** is also known as the sinh-arcsinh **distribution**. This**distribution** is similar to Johnson **distributions** in that it is a transformation to normality, but the **SHASH distribution** includes the normal **distribution** as a special case. This **distribution** can be symmetric or asymmetric.

Also, you have two outliers one is way below your LOQ and the other way above and beyond your USL. You may want to hide and exclude those points and refit your distributions to see how much those points leverage the overall fit and Cpk.