cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Try the Materials Informatics Toolkit, which is designed to easily handle SMILES data. This and other helpful add-ins are available in the JMP® Marketplace
Choose Language Hide Translation Bar
emanuelx
Level I

definitive screening, modeling help / feedback

Hi Everyone!
I am realtively new to JMP and would like to ask if I could get some feedback / answers to a few question. I am quite sure some of them are based in a lack of understanding on my side, so please be patient.... I have of course tried to find answers first in documentation and training videos but I'm still left with this.

At work, I recently took over the development of growth media for our cells (biotech). The medium consist of about 80 different components as well as some additional factor (for example pH setting). There are about 14 different stock solutions / groups of ingrendients with are mixed at different ratios. We currently have an issue with this formulation to keep the cells alive as long as possible which means viability on the last day (D10_viability) should be as high as possible.
To figure out which component components might influence this without too much bias from previous obsevations, my idea was to perform a definitiive screening design with 12 of these stock solutions as factor. I would assume that quadratics will be present for almost all. Obviously it would be nicer to do this for each individual ingredient in a way I can also asses the (certainly present) interactions, but that's not feasible by hand.
So what I did was using the current formulation as center point and go from there. Since I assume that for some ingredients (for example trace elements) a much bigger change is expected to see something, I chose different "intervals". What I did however is that I changed the values from the setpoint using different factors which is supposed to mean for exmaple: for ingredient 1 (let's just imagine this was glucose), the factor was 5, so the center point (0) is for example 5 g/L, the low point (-1) is 1 g/L and the high point (+1) is 25 g/L. In this particular case 0; 5 and 10 would probably also have worked, but in case of trace elements, just doubling for +1 probably wouldn't do much. Also for pH I need to use the "pH expected" and not what it turned out be in reality (which was along those lines, but of course not 100% the same).

I attached the file also containing some responses and the real values and the ones used in the DoE as well as some responses.
So my questions(s):
Does this approach make any sense / is this correct? If I use the real values, I cannot assess / fit a definitive screening and the expected performance of the DoE is terrible.
If I perform a fit anyways (not using the definitve screening fitting tool) using mains and quadratics I get something that looks like a decent model but I have no idea if this makes any real sense....
Can I put in interactions that I believe exist despite the "evaluate model" dialog telling me these can't be estimated?

It turned out that one of the ingredients (nr 6) was extremely deadly for the cells on already by D03. I can identify this quite nicely (although it's also obvious by eye) using the definitive screening tool.
When I use the same platform to look at viability on D10_viability as response I get some weird results including very strange residual by row plots. When I kick out all the conditions with the high ingredient_6 (marked in red in the file) which is about 40% of the conditions, I cannot use the definitive screening platform anymore, but can still fit models that look good to me. Obviously the evaluate model platform tells me this this is no longer great.
Does this still make any sense??
Also, this only works if I remove the original high 100 from the recoding for factor 6...
Sorry for all these somewhat messy and long questions, but I would really appreciate any kind of input!

2 REPLIES 2
mlo1
Level IV

Re: definitive screening, modeling help / feedback

Isn't it useful to put your model and the exploratory analysis graphs you did so far in the table so other can check what you did?
What is the reason of having coloured Cells and cells which are set as "hide and exclude" ?
Is the factor 100 in changing the factor level (0.01 - 1 - 100 for low, centre, high) normal in your field and reasonable?

emanuelx
Level I

Re: definitive screening, modeling help / feedback

thanks for the feedback - I tried to create some of the models I encountered which a short description
the observtions marked in red are those that had a viability of 0 on day 3 which conincides with the high values for ingredient 6 (+1 / 100). When I keep them in the and try to model days after this (day 10), everything looks pretty weird to me - however I have no idea if this is a valid approach
green is used for the center points (unfortunately, 1 of them is quite different than the other 2... )
concering the factor levels
generally, using something like 1, 10, 100, 1000,... as the values in a dose response in assays using cells would be pretty common; however those response curves are usually plotted on a log scale and EC50 (those at 50% response) are compared - this doesn't really make it easier for me to understand if this approach is fine here
my thinking was: if 100 is the center point then if I just go with the classical approach, the lowest possibe value would be 0 (which is okay, I guess), but the highest would be 200 which for most ingredients would not be expected to have a very strong effect 
in retrospect, I certainly chose the ranges too wide, but I was still wondering if I could use this approach at all for having 3 factor levels...