Re: Custom DOE - best approach for categorical and mixture factors

SarahD · Jun 8, 2023 2:14 PM

I’m doing a custom DOE and would appreciate any tips or pointers on creating optimal recipes.

Is there a better DOE approach than the one constructed, or other factors to bear in mind?

Responses:

Maximise different nutritional values
Minimise quantity of salt
Reach target of taste and smell score

Factors:

Categorical –> Texture 2 levels : T1 / T2
Categorical --> Aroma 3 levels: A1/ A3/ A3
Mixture
- Ingredients to vary: C1 / C2 / C3 / C34
- Ingredients that are constant (flavoured liquid). I have noted it two different values, even though it will be constant in the model, as JMP only accepts two different values
- Water – this factor will be included when the sum of the constant ingredients and those to be varied equal less than 100 gr.

Blocking: Difference between days of panel testing (6 recipes to be tested / day)

Defining linear constraints:

Two lines are added which maintains proportions between the constant ingredients (liquid = 69.6%) and the variable ingredients (C1 /C2 /C3 / C4 =30.4%)

Model:

2nd level interactions were also selected. To avoid a model singularity one of the ingredients in the ‘model’ parameters must be arbitrarily removed.
- Water and Flavoured liquid were removed. 2nd level interactions involving either water or base were also removed (to decrease number of run and also as we are more interested in effets of the main ingredients)

Alias – all second level interactions
Maximum number of runs possible = 50 and with the above model the recommended number = 48 (so all good).

Example of runs:

Questions:

How is it best to determine the number of central and repeated runs ?
How do I interpret aliasing and the diagnostics ?

Is there a better DOE approach than the one constructed, or other factors to bear in mind?

Many thanks for any help or insight for my first DOE !!

statman · May 5, 2023 7:39 AM

Sarah, welcome to the community. Unfortunately there is not enough context to provide specific advice. I have way more questions than answers for you and some general advice...There is NO best experiment to run. "The best design you'll ever design is the design you design after you run it". I suggest developing multiple experiments and predicting what could you learn from each and contrast this with the resources required for each. I also suggest an iterative approach to experimentation. KISS (Keep It Simple and Sequential). Don't try to learn everything and optimize in one experiment. Instead grow your knowledge by first identifying what is important and what is not and ensure your measurement systems are adequate. Consider the hierarchy of effects. Then you can start to think about where important factors should be set.

Some questions/comments for you:

1. Question the measurement systems. How precisely can they measure the nutritional values and salt? You will have sensory measures for taste and smell (ordinal data sets like some hedonic scale). How will you account for within and between assessor variation? How will you account for assessor bias?

2. I don't understand the 2 categorical factors? What is the factor "Texture"? Are you adding something to the batch? What? What is "Aroma"? Texture and Aroma sound like response variables, not X's. Why 3-levels for your first experiment?

3. Mixture designs are what I consider to be optimization experiments. Are you ready to do this? Let's say the current process is performed in batches. Which components of variation are most significant, within batch, batch-to-batch or measurement variation? If you have significant batch-to-batch variation, you should figure out why before optimizing ingredients for 1 batch.

4. I don't understand the second bullet under mixture? If there are 2 values, it can't be constant.

5. Do you understand noise in the process? For example: water quality, homogeneity of the batch, ambient conditions, lot-to-lot variation of the ingredients, day-to-day variation, etc.

6. Do you understand the difference between repeats and replicates?

"All models are wrong, some are useful" G.E.P. Box

SarahD · May 9, 2023 5:42 AM

Thank so much for your answer and pertinent questions which I have responded to below (italics). Apologies for not being precise enough with the information!

For more context : we want to investigate the different quantities of various chunks in soups. There are three different flavoured soups which all have the same chunks (chicken, beans etc). We have already some chef-created versions with a certain recipe and now we want to investigate how far we can push the nutritional aspect without decreasing acceptable sensory characteristics.

1. Question the measurement systems. How precisely can they measure the nutritional values and salt? You will have sensory measures for taste and smell (ordinal data sets like some hedonic scale). How will you account for within and between assessor variation? How will you account for assessor bias?

The assessors will undergo training to reduce this bias. We will include a blocking factor for the difference between sensory sessions (days). We will accurately calculate nutritional (using an external laborotary).

2. I don't understand the 2 categorical factors? What is the factor "Texture"? Are you adding something to the batch? What? What is "Aroma"? Texture and Aroma sound like response variables, not X's. Why 3-levels for your first experiment?

Texture category refers to the fact that we will either leave one version with the the ingredients in chunks and the other version will be mixed. The content will be identical.

Aroma category - three different flavour of liquid bases (same quantity) - chicken / vegetable / coconut

3. Mixture designs are what I consider to be optimization experiments. Are you ready to do this? Let's say the current process is performed in batches. Which components of variation are most significant, within batch, batch-to-batch or measurement variation? If you have significant batch-to-batch variation, you should figure out why before optimizing ingredients for 1 batch.

We have already 'optimised' our recipes to a certain extent through focus groups, testing and chef creation. We now want to optimise the nutritional component by seeing how far we can increase(protein) or decrease (salt) certain ingredients without affecting sensory characteristics.

We have 'reference' versions of these recipes which will serve as a guideline for desired level of various sensory characteristics (so we aim to investigate how much extra protein and how little salt can we use without decreasing acceptable taste and texture).

We have three categories of soup with a different liquid base and want to test how the ingredient chunks interact with the different liquids, hence the three aroma categories.

4. I don't understand the second bullet under mixture? If there are 2 values, it can't be constant.

I have removed this element. All mixture factors are now variable (and I have also subsequently removed the linear contraint).

5. Do you understand noise in the process? For example: water quality, homogeneity of the batch, ambient conditions, lot-to-lot variation of the ingredients, day-to-day variation, etc.

We are trying to minimise noise - training sensory assesors, blocking for the variation of days, one production of all the recipes to be tested etc.

6. Do you understand the difference between repeats and replicates?

I didn't but I do now!!

Following your comments I have adjusted the custom DoE (in attached pdf). In the model evaluation part it states aroma 1 and aroma 2 - is this because I have three categorical levels?

statman · May 9, 2023 6:48 AM

I will offer some further thoughts in blue, you may choose to ignore them.

For more context : we want to investigate the different quantities of various chunks in soups. There are three different flavoured soups which all have the same chunks (chicken, beans etc). We have already some chef-created versions with a certain recipe and now we want to investigate how far we can push the nutritional aspect without decreasing acceptable sensory characteristics.

I'm confused, on one hand you indicate you have already optimized ("We have already 'optimised' our recipes to a certain extent through focus groups, testing and chef creation.")....IMHO, this is why you want to evaluate multiple Y's as you iterate (multivariate). Going after different Y's sequentially can be challenging ineffective and inefficient.

1. Question the measurement systems. How precisely can they measure the nutritional values and salt? You will have sensory measures for taste and smell (ordinal data sets like some hedonic scale). How will you account for within and between assessor variation? How will you account for assessor bias?

The assessors will undergo training to reduce this bias. We will include a blocking factor for the difference between sensory sessions (days). We will accurately calculate nutritional (using an external laborotary).

IMHO, training may help consistency, however it will likely not impact sensory bias. The key to mitigating the sensory bias is to remove emotion from the evaluation. For example, don't ask which one do you like?, instead, using your expert chef's, create samples representing some ordinal scale (at least 5 categories) and ask which one does it match. Sending samples to an external lab does not guarantee the measurements will be useful. And you care more about precision than you do accuracy for experimental design purposes (you want less variation in the measurement system). I would require identical samples be tested multiple times by the lab.

2. I don't understand the 2 categorical factors? What is the factor "Texture"? Are you adding something to the batch? What? What is "Aroma"? Texture and Aroma sound like response variables, not X's. Why 3-levels for your first experiment?

Texture category refers to the fact that we will either leave one version with the the ingredients in chunks and the other version will be mixed. The content will be identical.

Aroma category - three different flavour of liquid bases (same quantity) - chicken / vegetable / coconut

OK, so the names for the factors reference what you are actually changing (chunk size and flavor). I think you might consider blocking on base (flavor) as these might create rather different sensory responses.

3. Mixture designs are what I consider to be optimization experiments. Are you ready to do this? Let's say the current process is performed in batches. Which components of variation are most significant, within batch, batch-to-batch or measurement variation? If you have significant batch-to-batch variation, you should figure out why before optimizing ingredients for 1 batch.

We have already 'optimised' our recipes to a certain extent through focus groups, testing and chef creation. We now want to optimise the nutritional component by seeing how far we can increase(protein) or decrease (salt) certain ingredients without affecting sensory characteristics.

Hind sight is 20/20. I would have recommended these be included in your initial recipe experiments.

We have 'reference' versions of these recipes which will serve as a guideline for desired level of various sensory characteristics (so we aim to investigate how much extra protein and how little salt can we use without decreasing acceptable taste and texture).

Use these "reference versions" to create your ordinal scale.

We have three categories of soup with a different liquid base and want to test how the ingredient chunks interact with the different liquids, hence the three aroma categories.

I'm not sure I understand why the consistency of the soup (chunkiness) would interact with flavor (or why you would care), but you can estimate block by factor interactions if you block on flavor. I guess I'm thinking more about what would you do with knowledge of a flavor by chunk interaction...to me you would need to have a different model for each flavor...that would be the advantage of blocking on flavor...it would be like 3 separate experiments on the same factors that can be compared.

4. I don't understand the second bullet under mixture? If there are 2 values, it can't be constant.

I have removed this element. All mixture factors are now variable (and I have also subsequently removed the linear contraint).

5. Do you understand noise in the process? For example: water quality, homogeneity of the batch, ambient conditions, lot-to-lot variation of the ingredients, day-to-day variation, etc.

We are trying to minimise noise - training sensory assesors, blocking for the variation of days, one production of all the recipes to be tested etc.

This is NOT what you should do. You want to create a design space that is representative of future conditions. Consumers' sensory perception varies, production processes vary, raw ingredients vary, etc.

The exact standardization of experimental conditions, which is often thoughtlessly advocated as a panacea, always carries with it the real disadvantage that a highly standardized experiment supplies direct information only in respect to the narrow range of conditions achieved by the standardization. Standardization, therefore, weakens rather than strengthens our ground for inferring a like result, when, as is invariably the case in practice, these conditions are somewhat varied.

R. A. Fisher (1935), Design of Experiments (p.99-100)

6. Do you understand the difference between repeats and replicates?

I didn't but I do now!!

Following your comments I have adjusted the custom DoE (in attached pdf). In the model evaluation part it states aroma 1 and aroma 2 - is this because I have three categorical levels?

I don't follow your logic here? Although I notice you use French and I don't.

"All models are wrong, some are useful" G.E.P. Box