cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
Agustin
Level IV

How to implement variance decomposition

Hi, I'm following a guide on how to evaluate precision of measurements and they show this figure (image 1) for a 20x2x2 study (20 days, 2 runs each day, 2 replicates each run).

Annotation 2020-06-11 114516.png

 

Followed by the explanation (image 2):

Annotation 2020-06-11 114917.png

 

What is the best way to do this in JMP? Make the plot with how much of the total variance is explained by each of the variables, and also get a numerical table with this information.

Thanks

 

 

1 ACCEPTED SOLUTION

Accepted Solutions

Re: How to implement variance decomposition

I will assume that you have a data table with a row for each measurement. There are data columns for the measurement and the grouping variables. The grouping variables are named Day (1-20), Run (1-2), and Replicate (1-2). There will be a total of 80 rows in your table.

 

There are a couple of platforms that can compute the variance components for you. Both of them will give you the estimates but each offers unique additional information about the estimates and about the assumptions of the model. We will start with the Fit Least Squares platform.

 

  1. Select Analyze > Fit Model.
  2. Select Measurement and click Y.
  3. Select Day and Run in the list of columns on the left.
  4. Click Add.
  5. Select Day in the list of columns and select Run in the list of effects.
  6. Click Nest.
  7. Select Day and Run[Day] in the list of effects.
  8. Click the red triangle next to Attributes and select Random Effect.
  9. Check your dialog against the picture below and when they agree, click Run.

Screen Shot 2020-06-11 at 7.20.45 AM.png

 

The variance components are found in the Var Component column in the REML Variance Component Estimates report in the Fit Least Squares window.

 

Screen Shot 2020-06-11 at 7.22.25 AM.png

The variability chart is another good way to get the estimates.

 

  1. Select Analyze > Quality and Process > Variability / Attribute Gauge Chart.
  2. Select Measurement and click Y, Response.
  3. Select Day and Run and click X, Grouping.
  4. Click Decide Later and select Nested for Model Type.
  5. Check that you dialog agrees with the picture below and click OK.
    Screen Shot 2020-06-11 at 7.26.33 AM.png
  6. Click the red triangle next to Variability Gauge and select Variance Components.
  7. Click OK.

 

Screen Shot 2020-06-11 at 7.29.07 AM.png

 

The results from the two platforms differ. First, the model in the Variability Gauge platform includes an interaction term that is not part of the model used in the Fit Least Squares platform. Second, the estimation method is different. The first platform used REML and the second platform used a Bayesian technique. The reason is that the Day and Run variances are 0 in my simulated measurements, so the estimates can vary about 0. The first platform allows for negative estimates but you can change that behavior. The second platform detects the negative estimate and switches methods to give only positive estimates, but you can change that behavior.

View solution in original post

8 REPLIES 8

Re: How to implement variance decomposition

I will assume that you have a data table with a row for each measurement. There are data columns for the measurement and the grouping variables. The grouping variables are named Day (1-20), Run (1-2), and Replicate (1-2). There will be a total of 80 rows in your table.

 

There are a couple of platforms that can compute the variance components for you. Both of them will give you the estimates but each offers unique additional information about the estimates and about the assumptions of the model. We will start with the Fit Least Squares platform.

 

  1. Select Analyze > Fit Model.
  2. Select Measurement and click Y.
  3. Select Day and Run in the list of columns on the left.
  4. Click Add.
  5. Select Day in the list of columns and select Run in the list of effects.
  6. Click Nest.
  7. Select Day and Run[Day] in the list of effects.
  8. Click the red triangle next to Attributes and select Random Effect.
  9. Check your dialog against the picture below and when they agree, click Run.

Screen Shot 2020-06-11 at 7.20.45 AM.png

 

The variance components are found in the Var Component column in the REML Variance Component Estimates report in the Fit Least Squares window.

 

Screen Shot 2020-06-11 at 7.22.25 AM.png

The variability chart is another good way to get the estimates.

 

  1. Select Analyze > Quality and Process > Variability / Attribute Gauge Chart.
  2. Select Measurement and click Y, Response.
  3. Select Day and Run and click X, Grouping.
  4. Click Decide Later and select Nested for Model Type.
  5. Check that you dialog agrees with the picture below and click OK.
    Screen Shot 2020-06-11 at 7.26.33 AM.png
  6. Click the red triangle next to Variability Gauge and select Variance Components.
  7. Click OK.

 

Screen Shot 2020-06-11 at 7.29.07 AM.png

 

The results from the two platforms differ. First, the model in the Variability Gauge platform includes an interaction term that is not part of the model used in the Fit Least Squares platform. Second, the estimation method is different. The first platform used REML and the second platform used a Bayesian technique. The reason is that the Day and Run variances are 0 in my simulated measurements, so the estimates can vary about 0. The first platform allows for negative estimates but you can change that behavior. The second platform detects the negative estimate and switches methods to give only positive estimates, but you can change that behavior.

Agustin
Level IV

Re: How to implement variance decomposition

Thank you this is exactly what I wanted.
Although I don't understand why I have to choose nested, I assumed day and run were crossed? For every day you have run 1 and 2?
Thank you

Re: How to implement variance decomposition

It has to do with the modeling and the correct computation of the variance components. It has to do with the meaning of SAME thing. You label Run as 1 and 2 for every day, but the run changes from day to day. Run 1 on Day 3 is not the same as Run 1 on Day 1. If they were crossed, you would be able to bring the SAME run to every day, which is physically impossible. Another way to think about it is in terms of the degrees of freedom. If you used a cross design, then you would have only 2 degrees of freedom for Run. But you have 20x2 = 40 degrees of freedom for Run. That is because Run is nested in Day.

 

Think of it this way. What if the factor was Sample = 1 or 2 to be tested on Day = 1 to 20. If the same sample were tested each day, then it is a crossed design. If the two samples were different each day, then the design is nested.

Agustin
Level IV

Re: How to implement variance decomposition

I think I understand that, but if instead of run 1 and 2 it was Operator Bob and Operator Alice, would that also be nested?
Is there an easy way to determine whether the factors are nested or crossed, especially if I'm thinking of more than 2 factors?

Re: How to implement variance decomposition

I don't know of an 'easy' way. It is a very common point of confusion, so do not feel bad if it is not entirely clear yet. I will say, though, that once you 'get it,' it will be very clear to you.

 

If it was always Operator Bob or Alice every day, the Operator is crossed. But if it was different operators each day and you merely called them Bob and Alice, then it is nested. It is not what you call it, it is what it is!

 

Here is a link to a good source (of many!) to hopefully clarify crossed and nested factors.

Agustin
Level IV

Re: How to implement variance decomposition

Thank you
statman
Super User

Re: How to implement variance decomposition

To add to @Mark_Bailey 's comments, I will say sometimes it is obvious and sometimes not.  Here is a simple diagram that shows the continuum between nested and crossed with systematic in the middle.  For systematic sampling, you will find different opinions as to how this should be handled.  

crossnest.png

"All models are wrong, some are useful" G.E.P. Box
P_Bartell
Level VIII

Re: How to implement variance decomposition

To supplement @Mark_Bailey 's recommendations, here is a JMP On Demand webinar on the very topic of interest here, hosted by my former colleague @JerryFish :

 

https://www.jmp.com/en_us/events/ondemand/mastering-jmp/evaluating-and-monitoring-your-process-using...