Abstracts

0 attendees

0

Wednesday, November 15, 2023

※注意：このサイトが英語や日本語以外で表示されている方は、画面右上の言語設定で「日本語」、または、「私の設定」＞「Preference」から「日本語」を選んでください。本サイトは、Discovery Summit Japan 2023の配布資料を公開しております。サイト上部にあります各ファイルをクリックすると資料がポップアップされます。矢印↓マークをクリックするとダウンロードできます。全ての資料をダウンロードしたい場合は、「DSJ2023-handsout-all.zip」をダウンロードし、解凍してご利用ください。尚、本年は配布資料を会場では配りませんので、PCや携帯電話を使用して表示するか、ご自身で印刷をお願いいたします。【公開している配布資料のファイル名とセッション情報】※資料のないセッションもあります。 A-1_HOSOJIMA：グラフビルダーとデータフィルタを駆使したドリルダウンで多変量解析の訴求力を向上する（東林コンサルティング　細島章） A-2_TAKYU：JMPからMYSQLサーバーへの検索スクリプト生成の効率化について（国士舘大学　田久浩志） A-3_KAWAGUCHI：JMP/JSLでアプリケーション作成を行う上でのTips（日本ゴア合同会社　河口雅彦） A-4_KUROSAWA：機械工作実習の安全教育がもたらす危険感受性向上効果の統計的検証（筑波大学　黒澤拓未） A-6_NORIO：トヨタ自動車九州における粘土製造演習を教材とした、モノづくりとデータサイエンスと問題解決の習得（トヨタ自動車九州株式会社則尾新一） A-7_OTA：研究開発部門におけるJMPを活用した統計・データ解析普及への取り組み（株式会社レゾナック　太田浩司他） B-1_Y-TAKAHASHI：共変量を含む多因子実験データでの交互作用を考慮した探索的な変数選択（BioStat研究所株式会社　高橋行雄） B-2_SAEKI：日本における乳がん患者の経済的毒性に関連する要因：患者と医師の観点からの比較（がん研究会有明病院　佐伯澄人） B-3_ODAI：官能評価とJMPの活用（キリンホールディングス株式会社　小田井英陽他） B-4_HONDA：Pharmacovigilance 分野での JMP の活用事例の紹介（小野薬品工業株式会社　本田主税／イーピーエス株式会社　小柳将） B-5_OGASAWARA：明日から使える！JMPテクニカルサポートへのお問い合わせ事例から見るJMPのTipsのご紹介（JMP 小笠原澤)

0 attendees

0

Event has ended

0 attendees

0

Monday, January 1, 2024

Great software, when wielded by skilled hands, has the power to transform the world. Derived from a leading semiconductor manufacturing company, the tangible value created through Six Sigma is undeniable. With a diverse array of real-world cases, spanning JMP functionalities such as Design of Experiments (DOE), multivariate analysis, quality control, process optimization, and consumer research, this collection shatters conventional thinking, offering invaluable insights for various industries to emulate. It's not merely a showcase of DOE prowess; rather, it epitomizes JMP's role as the epitome of 'Data-Driven Decision Making' solutions. The presentation is cohesive, seamlessly connecting different cases through a workflow concept, serving as a crucial catalyst for successful Six Sigma projects.

0 attendees

0

Event has ended

0 attendees

0

Monday, January 1, 2024

Despite the remarkable advances with AlphaFold2 in predicting protein structure and structure determination by cryogenic electron microscopy (cryoEM), protein crystallography is still more likely to provide higher-resolution data and more reliable structures for drug discovery. The bottleneck in crystallographic studies is the search for the chemical cocktail that promotes crystallization. Each lead condition contains two to seven chemicals and is discovered by sampling hundreds of solutions from sparse matrix screens. Once a good lead's composition is optimized, protein crystals are grown in large numbers for structural studies. Protein crystal growth often gives quadratic responses to rising levels of some factors, and two-way interactions are abundant. Two-level designs miss these features. New two-level modified one-factor-at-a-time (MOFAT) experiments can detect two-way interactions in simulation studies (Yu 2022). The extension of MOFAT to three levels would interest crystal growers. We hypothesize that a three-level extended MOFAT (EMOFAT) design can detect two-way and quadratic effects. This design retains the baseline run of standard OFAT designs and adds runs with all factors at their lowest or highest levels (2m + 3 runs, where m is the number of factors). We conducted simulation studies using JSL scripts to explore the ability of these designs to detect main effects and higher-order interactions, including three-way interactions. These designs may be attractive to protein crystal growers because the three levels will detect nonlinear responses and promise to detect two-way interactions before advancing to more advanced optimization experiments.

0 attendees

0

Event has ended

0 attendees

0

Monday, January 1, 2024

This paper focuses on a MOSFET product, which was treated with metal deposition. Due to metal film property bow or stress value variation, may induce threshold voltage (Vth) shift, which caused device yield loss. DOE tests were carried out to identify post metal deposition treatment process on the Vth shift: DOE #1 (baseline condition), #2, #3, #4. Vth result showed: DOE #1 Vth was NG, other splits OK. Why Vth was NG of DOE #1 baseline condition? JMP analysis software was adopted to identify the correlation of bow and stress value with Vth. Three types of pattern non-uniformity metrics were used in JMP for bow and stress analysis: radial / planar or angular / off-center non-uniformity. Bow and stress raw data was transformed into a descriptive statistic. Define WIW-NU criteria. Conduct ANOVA analysis including equal variance / normal distribution / independent violation modes. Carry out variability chart analysis based on radius, angular and pair difference. Utilize data mining multivariate correlation analysis to explore the root cause of the issue. DOE #1 baseline condition yield low attributed to bow value overall stdev was the worst one, especially radial C4 / C5. Uneven post treatment about the DOE #1 condition led the stdev of wafer bow radial C4 / C5 worse, which caused the Vth degradation. Hardware and recipe CIP solved issue of low yield.

0 attendees

0

Event has ended

0 attendees

0

Monday, January 1, 2024

Design of experiments (DOE) and randomized control trials are the gold standard for determining causal relationships, and JMP is the gold standard for DOE software. Unfortunately, many studies with causal inference objectives cannot be run as DOEs due to an inability to randomly assign treatments because of practical or ethical constraints. The requirement is to remove the impact of confounding variables that contribute to bias in the observational data classes. We provide an overview of current promising causal inference methods to include propensity scores, matching algorithms, and other statistically based approaches to include draft FDA guidance to industry for real world data/evidence. We demonstrate several procedures in JMP using a retrospective study database that supports multiple clinical research efforts for the Cardiothoracic Surgery Program at Houston VA Medical Center. We finish by providing recommendations, tips, and pitfalls for the practitioner to consider when causal association is the goal and DOE is not an option.

0 attendees

0

Event has ended

0 attendees

0

Wednesday, March 6, 2024

Whitworth 2

Cerba Research is a globally renowned company that specializes in delivering top-notch analytical and diagnostic solutions tailored for clinical trials worldwide. At Cerba Research Montpellier, our dedicated team customizes immunohistochemistry protocols to detect specific target expressions within patients' tissue sections. To address the escalating demand for protocol development and enhance process profitability, we recognize the vital need to streamline development timelines and reduce costs. Given the diversity of custom protocols to be developed, the conventional OFAT (one factor at a time) approach is no longer sufficient. We have therefore undertaken an in-depth evaluation, comparing various design of experiments (DOE) methodologies, including custom design and space-filling design, using JMP. These DOE approaches are evaluated against previously developed OFAT protocols. We present data illustrating the comparative advantages of OFAT and DOE approaches, in terms of cost-effectiveness and quality. Hello. I'm Marie Gérus-Durand, and I'm working at Cerba Research as a validation engineer. Today, I will show you how we can cut cost and elevate our quality by using design of experiments when setting up immunohistochemistry clinical protocols. As an introduction, Cerba Research is a worldwide company with activities in all the continents. I highlighted in yellow the department I'm working for. It's the pathology and the IHC department. The main research department for IHC is based in Montpellier where I am working. What is immunohistochemistry? Immunohistochemistry is a study of protein or various targets in tissue. When you have biopsies, we cut releasing slides of the tissue, and you use antibody-based technology to detect different targets of a terrorist on the tissue that we can see under a microscope or with a scanner, on digitized images. Here, for example, you have a skin tissue where you see a nice target, stained in red. More than a quarter of our activities in protocol set up for our clients at Cerba Research Montpellier. We have more than 50 clients. Each one of these favorite targets. It's crucial for us to be able to be fast best development strategy in order to stay competitive in a whole area of a protocol set up for clinical trials. Here is an example of another staining, and this is the staining I will use in the first example. I will show you. It's detection of a target in brown here by IHC, so this protocol allowed to detect only one target. We call it a simplex, and it's a lot of what we are doing. Our actual approach is one factor at a time, so we will first evaluate the antigen accessibility of the tissue for the targets to be recognized, then we will do a titration of the antibody to find the best condition, giving the best signal to ratio, and we will do further optimization if required. Each arrow here, it's a step in the development protocol, and each step is at least one automaton cycle, and each cycle last at least four days from the request to the test results. To improve our grant profitability and amount of protocols developed, we need to reduce time and amount of tests to arrive to the optimized protocol. Here I schematize our strategy, so we test two antigen accessibility conditions with no antibody or a defined antibody consultation, and so we have these four points. Then we choose the best one so, it's condition two to do a titration of the antibody which has more points, but what if in fact, the optimal point was somewhere with condition one outside of the range tested? When we look at this, it reminds me a lot of the design of experiments, and webinars that I saw with JNP. This is an example slide from a manual room, where you compare the offered approach one factor at a time which is one we are using now to the DOE and you see that you have a more covering of your experimental area. Which can be a great improvement, in our setup. I decided to give it a try. First, I start with the simplest thing, which is simple protocol development as I show you, detecting only one target on this issue. We are first to define our constraints and parameters. In our automaton, the one I'm using for this experiment, this automaton has independent positions. It mixed with one test from the design of experiments is one slide in the automaton, and we have to define the responses and the factors. The response we want to analyze is a signal intensity. We want to maximize it, and we want to minimize the background. The factors we can play with at this stage is antigen retrieval, which is a categorical factor, and the primary antibody concentration, which is a continuous variable. Let's go with the design. I choose to compare different designs. I show you the Custom Design and the space-filling design. We would set up, together this design. I just go to the DOE platform and custom design, and we will have to enter the responses and the factors or responses from last year here. The first one is a signal intensity that we want to maximize. I will assume the arbitrary evaluation of it between zero, which is no signal to three which is a signal, and I will add another response here. It is a background we want to minimize, so I should minimize this time, and I put the background here. In the same way, zero it's no background, and three is very high background, and then we will add the factors. I'll just show you because I was a bit hidden here. As factors, we have the categorical factor, which is two-level, the antigen retrieval. It's quite easy to change, and we have two PH usually PH 6 and PH 9. Which is called differently depending on the automaton, but this doesn't really matter. We have the continuous factor, which is an antibody concentration. This usually, we test 10 and maximum, so I will make it vary from zero to 10. Once we have said this, I have no covariates. As I told you, I chose the simplest one. We will continue with the model. I want to see all the possible interactions, so I do RSM. I don't do replicates. I don't do center points. I already did go to the simplest, and you see that by default, she said, okay. You should do up 11 ones, 11 ones fits with what we are doing up, usually, so it's fine. To take advantage of this design explorer that is in JNP 17, I just click here and say, okay. Let's explore maybe different design. You see that when you click, you have the factors again, the model here, and you have different options, so on the left, it's just to express design by design. On the right, you can do a combination. For RSM usually, we do eye optimality, and once, let's say, I want to try different numbers of runs. Let's say a minimum of five. Let's say, we would start with this, and we go up to, so we say 11. What if we do 15 for example, with the step of let's say one. Center points, I don't really care, but we can try and see what it does and replicate. Let's see, it's better if we do replicate. I click generate all design, and it did also design with the different parameters, so I chose eye optimality, so it's eye optimality everywhere. I choose runs number five to 15, and then you have replicates. It didn't but here's a replicate for five. I don't know why, but it's okay. You see for seven for example, I have two replicates, one or zero as the same for all of them. For center points as well, I have one center points or two center points when possible. At the bottom, you have a linen table so you can make it and it adds a nice column. This is just custom-designed for all of them. If you choose a design in this table, you click Custom Design and it opens the table cost. Which is nice here such that you can do some graphs and say, okay. Let's see. Efficiency, depending on the number of fronts, I will do. You see that there is a great correlation between the two of them. We can play and add other variables to look at with the column switcher. Once I want to see runs and I want to see center points and replicates as well. Sorry. I forgot. To select the runs, but anyway, so here we have the runs. Center points doesn't seem to have a good impact and replicates neither, so it looks like it's ready. We should play with runs number. If I look at the runs number, I would come back to this. I would just block the users, the center points, and now you look just at the reference numbers. I didn't remove everything. I hope you should remove everything before. Otherwise, it's just adding all of them. I removed everything. Just selecting. Didn't make a note. Sorry. It takes that, so let's say, I will choose, what was selected, so 12 runs. I will go back just here and say, okay. Is it default? 11? In fact, I want to be sure to have my two negative control, which is no antibody at all with isotopes, so I would say 10. I would make a design with 10, and then I will add manually my two controls. Here you see, it's a small design. You take only 10 seconds, and here is a table. In fact, each time we run it, it will do different numbers here. I will use the one I did before. Just running this script. I will have exactly the same data as anyone I'm showing you. Here it's a table, so you see it was different. I have only 0 or 10 or five of the concentration, but let's keep this stable. In the table, you see you have the model already, the evaluation of the design, and you can go back to the DOE dialog box. Evaluation, I will not do it because I want to compare two designs. I will do it at this time. I just brought it, so the design I have, it duplicates for each point at the end. Even if I did not ask for duplicates, it gave me duplicate. It covers this area of experiment space. I try a space-filling design. It's the same. Good DOE, but this time, it's special purpose design, space-filling here, and what I want to show you, so I'd need to go back to my Custom Design. I want to compare the design, but the responses and factors would be the same. I can just save my responses and my factors. Oh, I would just save responses, in the same way, save photos. Then if I go back to the Space Filling Design. I can load the responses. So sorry. You have to have the window selected, so you find it. The same for the factors, and I wrote the factors. No constraint. Once again, number of warrants is 20, but I want to compare two designs, so I will do 10 the same. I have no choice. It's just a flexible feeling. It generates this design, and you see that this time, the concentration is not strict like 0105. It's a huge range of concentrations, and you can do the table the same. I can close all of this. As for the Custom Design, I rerun the one I did, so I have exactly the same numbers, and so I wanted to compare the design. How do you compare the design? Oh, first. Sorry. I forgot. You see that the so the variation in antibody concentration are not the same I don't have any duplicates except for this control here. It covers more broad range of antibody concentration, which is good actually for what we are looking for. I wanted to compare the design. First, I can graphically because that's the area covered by each design is different. And in my point of view, the Space Filling will allow me to try more antibody concentration, which may be a good point, but I can compare the design. In DOE, you have design diagnostics, compare designs, so you see I have both of them. I already have the Fast Flexible, so I will add the Custom Design. Columns names are the same, so I don't need too much columns. He will do it, you know it, and so recap to date the factors. For the Fast Flexible, it doesn't start from zero exactly and doesn't go up to 10, but it's nearly there. The model I will do to erase them. I cannot have the categorical factor in this, but I have. Let's go back to the antibody by antibody and antibody by antigen risk reward. I cannot get antigen at factor two, but it's okay, and we go for the design evaluation. That's why I didn't do it one by one. I wanted to compare. In blue here, if you look at the power plot, it's a Fast Flexible Design, and in orange is the Custom Design. You see that [inaudible 00:17:23], Custom Design looks better in the name of four of the determination of the model than the Fast Flexible Design. If you go to the fraction of Design Space plots, it's the same. Custom Design seems to fit better. If we compare the design diagnostics, it's relative to Custom Design. Meaning Custom Design adds one value, and if I look for high efficiency, Fast Flexible Design, I open seven, so it's 30% less efficient, let's say, than a sensor Custom Design. I just put this in here. Just a three diagnostic I show you here, and it seems like Custom Design is better than Space Filling one [inaudible 00:18:18], but let's see what the results will tell us. The profilers are obtained after entering [inaudible 00:18:26] so just to show you an example of the data, so I have my DOE table here. I just added the signal intensity and background response, so I look at my images and say, okay. I have no intensity. I have some. I have more, and so-and-so on, and I did the same for both designs. Here's the same. If this were not, it was two runs, so two different data sets. At the end, I did the model. Just click on the model. You see it's a standard square effect screening. With all the interaction here, and I fit everything together. I have some factors that I could remove, but I decided to keep everything. Then I go to the profiler here. If we look at the best condition, I will maximize my disability as it was done. Maximizing my intensity. You see it goes to the high point here, minimize background, and he found this condition. After this, I realized that maybe it's not, I don't want to maximize signal intensity. I just want to match a target because maybe the sample I'm using here, it's not the sample with low, so I want to be able to detect low and expression of my targets. I will change a bit, and I say, okay. I want to match a target, which is a middle intensity. Not diagnosed one, but the middle one. I still want to minimize background because background is not good. Now if I do maximize desirability, you see that it changed the concentration of antibody to use. This is nearly what we obtained here. Maybe I did a little number different, but it's okay. I did the same, and that is the same for a Fast Flexible Design. I obtained two different conditions, one where I am nearly 3.8 of antibodies in CC2 and two in CC2. Here, it's not PHCCN9 because on this automatic, it's called CC1 and CC2, but it's the same. Then I say, okay. I have these two conditions. I would compare to the initial protocol. Meaning our reference, which is our standard brush, which is the images I showed you before. Here, you have the Custom Design conditions and the Space Filling Design conditions. You see that these two conditions give data which has at least as good. I would say even better than the one we define with our standard approach. For me, the Space Filling data, I wrote to test more antibody concentration, which is useful when you have difficult targets. I will go for this because in addition, this two condition you see here. In fact, were present in Space Filing Design. I didn't have to run against the conditions because I had the data in the Space Filling Design and the image is to double-check that is working well. I choose a Space Filling Design to test on another protocol. Just to see if it's working. Don't go through everything. I just take the same paper and change my responses for the new targets. I obtained this model where to match the target of two here of central intensity and to minimize that background, he said, okay, you should have 2.8 of microgram per male of antibody and in CC2 condition. If I compare it to the protocol we developed, it was said CC1 and 5 microgram program, so it's not the same. Both change, but as you can see on the image, it's the same protocol defined by Space Filing Design is at least as good as the one we usually develop. Now what is missing to convince operations and managers is for sure the cost and time. Our offered approach to the design here. First, I compare as a Custom Design and Space Filling Design, as time it took me to design everything comparing to our standard approach. Our standard approach is in blue in this graph and Space Filling in red, Custom Design in green, and I compare for the number of cycles, which is time-consuming the number of slides, the time from design to protocol, and the time from take a request to reserves. If you look at the time from technical request to research, even having to design everything, including that it's a new way of doing for the technical team, we shortened the time to results. Now with a second example, I did not have to design everything again. It's more likely what we will do next step if we take this approach as our new actual approach, and so I compare the standard approach used for this project. In blue, in striped red, I put the DOE setup press the comparison cycle because as we have a reference protocol. I did a comparison cycle, but if it's in the new approach we are using, then we will not do comparison cycle because we will not have reference. We would just do the DOE setup, and it's in red. As you can see, it decreased the number of cycles, the number of sites, the number of antibodies we used, and the final results in days, it's shortened by ALF, which is quite nice. In conclusion, I like this quote from Steve Jobs. "Start Small, think big." At the beginning when I saw all these [inaudible 00:24:53] oh, it's nice, but I can apply it. I struggled a bit, but I said, okay, I will start the simplest way, and I set it up for this simple IHC, and I convinced my manager even more rapidly set up when I thought. Then the operational team and eventually other leaders that the approach can be applied to IHC. We cannot grant in case we need optimization as we do now because, you don't know at the beginning if it would be an easy or not an easy development. The next step, I would like to introduce to the lab. It's multiplex setup. Detecting multiple targets on the same tissue, starting with two targets against the suppressed one and going up to five targets. This is ongoing. Finally, I want to thank you. I want to thank the steering committee to have selected my abstract I want to thank you, Cerba Research Montpellier team also the lab technician because it's sometimes hard to follow my crazy ideas. My manager, which is already supportive in new, strategy and new way of seeing things, and all the conception and validation team. I am a part of to follow me in this test too. I want to thank the JNP French team. Stephane to being supportive and asking all the questions, and then that allowed me to load this new, JNP 17 person where I'm stuck with the 16 and be able to see the nice platform of design exploration, and I thank you all for your attention.

0 attendees

0

Event has ended

0 attendees

0

Wednesday, March 6, 2024

Oak Room

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Oak Room

Innovating more sustainable, higher-performing products is the foundation of our ambition for a Clean Future in Home Care at Unilever. Scaling up new technologies from laboratory to factory brings considerable and exciting challenges, so how do we approach innovation to deliver for our consumers and our planet? In Process Development, we believe the most value is created if DOE and modelling are established as key skills in every process engineer. This is why we have built a globally active community of practice through a 70:20:10 approach to digital upskilling, delivering impactful innovations through DOE, and modelling on high-value projects. Embracing a "digital mindset" has empowered engineers to deliver impact and value as individuals, developing deep technical expertise in new-generation technologies through structured data capture and statistical modelling. This approach has enabled the introduction of sustainable biosurfactants and low-CO 2 formulations straight-to-factory, with cost and complexity reductions across supply chains. New efficient process routes, optimised through modelling, have resulted in double-digit million-euro savings and product performance improvements throughout our Home Care portfolio. From formulation to factory, our approach to process development is helping to deliver the Clean Future revolution through a digital approach to innovation. Hi, I'm Ewan, and I'm a Process Development Engineer working in Home Care at Unilever. Today, I'll be talking you through how we built an active global community for a digital-first approach to innovation and sustainability. Now, whether you know the brands or the company itself, Unilever is one of the world's largest consumer goods companies, with a portfolio of leading purposeful brands, home care brands like Persil or OMO and Comfort, and other brands you'll know, like Ben & Jerry's and Dove. We have an unrivaled presence in future growth markets. As a business, we have a determinedly commercial focus to be sustainable. Now, this focus as a sustainable business helps drive a real impact to the planet through the 3.4 billion people that are using our products every day across over 190 countries. The reach we have as a sustainable business resulted in over €60 billion in turnover in 2022. Now, our purpose and ambition as a business really shines through in our clean future strategy. Now, this is the delivery of products that are unmissable superior in terms of product performance, products that provide great value across all price tiers, in all of our brands and products that are sustainable. It's really the combination of all three in every product that we deliver to our consumers who are at the heart of our strategy, that makes this such a progressive strategy. There are over 50 proof points of the real impact and business power that this strategy has enabled. In home care, it's process development that are the ones that are delivering our clean future strategy to factory scale. Taking the ambition of formulation scientists and marketeers, and scaling up from lab concepts all the way to optimized factory scale production around the world. Typically, we do this in four key steps, the first of which is building an understanding of how the formulation and the process interact. Whether that's the effect of temperature making the product thicker and harder to pump, whether its effects on product clarity, so whether it looks clear or hazy, we really need to build an understanding of these interactions upfront, so they can feed into all the subsequent steps. The next of which is a process route design. Starting to understand and build an idea of how we want to process this, both at pilot scale, so small scale, and also at large scale, so in our factories around the world. From these two steps, we can then start building scale up rules. This is in pilot plant scale up work. Whether we're making product in 50 kilogram batches, 100 kilogram batches, or 200 kilogram batches, even more. This is where we really start to develop rules that will apply at factory scale, where we're producing in the order of tons per batch. Resulting from these is the main plant trial. Testing our product in live factories around the world. In each of these steps, we can generate a lot of data, a lot of understanding to build into the next step. We want to be as exploratory as we can to really push the innovation process to innovate products that are conceptually new and our consumers want. Our ambition here is, can we link these data and our ambition to be exploratory into a digital driven innovation pipeline throughout the whole process. This is what we tried. We wanted to pilot a hypothesis driven approach to design of experiments doe, and we had a new formulation that we needed to launch one of our popular home care brands. We needed to launch this in over eleven factories worldwide, comprising of over 25 to different types, sizes and scale of mixer. Now this is a big challenge and it's essentially a global rollout. How can we optimize this for unmissable superiority product performance to provide great value to our consumers and also to provide a sustainable product? Because every optimization we do at this step can benefit our 3.4 billion consumers around the world, and essentially our planet as well. We started by forming a hypothesis, so that consisted of which process parameters we thought might influence our product, whether the product quality parameters itself or the process. We trialed a design of experiments approach at pilot scale. Doing this at smaller scale before we scale to a factory, allows us to minimize cost, raw materials, and actually expedite the process as well. From these data, we were able to start modeling some of our quality parameters. An actual example is shown here. This enabled us to identify the actual critical control points, not just in the process itself, but some of the formulation parameters as well. What we see here is the prediction of our product. Viscosity depends on temperature. Another quality parameter, parameter 2 and materials A and B. Now parameter 2 and material A don't really have much of an impact on viscosity, but the really interesting part is in the temperature and material B. If we reduce the temperature even further, we remain in the green zone within specification. If we reduce material B, it's the same. There's a potential for reducing energy savings through temperature reduction and the potential for reducing cost through reducing material B. Both of these actually will result in a greenhouse gas reduction. This was the business benefit that we managed to provide through this approach. Double-digit million euros in material savings and a global energy reduction for sustainability, all for a clean future. Now, this is the work of one engineer or a small team. If this is what we can achieve, why wouldn't we want to make these key skills in every engineer? That's what we set out to do. We wanted to make design of experiments and modeling key skills for every engineer in home care process development around the world. These engineers have different first languages, live in different countries, in different time zones, and are at different stages in their development. The aim here was to ensure that everybody could benefit from one program. Different teams work on different products, so they develop specializations, maybe they want to use certain features of jump. We created a global community of practice where engineers can get together, share their learnings, and we can upskill together. The approach we took for this was an approach called the 70:20:10 approach. Ten percent of the time spent on structured training, and this was led by JMP champions, so engineers with a higher proficiency in the use of JMP within our team, working with other engineering teams to upskill. Twenty percent was shared learnings. Regular community of practice sessions with engineers as participants and their mentors, so we can share our learnings, our struggles, challenges, and really how we've progressed with jump, with a name to upskill everybody. The 70% is the most crucial part of this program. That's delivering impact through hands on work in key technologies in high value projects. These projects are business big bets that we want to launch. Using DOE can help expedite timelines, can help optimize products, and can help us understand the behavior of the product to a level we never had before. If we made DOE optional and modeling optional, nothing would happen. We're trying to change ways of working here, so we have to integrate this as an approach for our high value projects. We started by trying to cement good, structured data capture, forming a foundation for all of our future modeling work. We wanted, again, to cement a DOE first approach to our exploration. We typically used a custom design in JMP for this, using a response surface. This was complex enough to accurately represent the formulations and the processes, but not too complex that it took a very long time or would be impossible for our engineers around the world to understand. Building on this, we wanted to build data analysis and modeling skills in our engineers, so they can start developing insights about the formulations and about the processes. Again, this is straightforward and simple. Linear regression using standard least squares. We wanted to ensure that there's no multicollinearity in our models. If we see an effective temperature, we want to be 100% sure that it's temperature and not an artifact of another interaction. The real ambition of this journey is that we can build expert engineers that can create value adding technology insights. Moving on from custom designs and basic regression onto other functionalities like simulations. Whether we can simulate product specifications, for example, using desirability functions to optimize our products for batch cycle time or cost, and move on to more complex ways of modeling. Away from numerical modeling for viscosity, and starting to consider other factors like foam. This is what we managed to do. I'm going to talk you through a couple of case studies where we've managed to develop superior products, where we've managed to provide great value products and deliver the savings to our consumers, and where we've managed to deliver sustainable products to market all three at the same time in our products. The first is an example of a product that's one of the most complex products we've ever developed. It all started with characterization through DOE. Modeling the formulation space helped us really build an in depth understanding of the formulation. From this understanding, we're able to optimize the formulation. This is key, because with our approach to clean future, we want to reshape our formulations, not just for liquids, but powders, gels, capsules, creams and bars. We're having to learn how we can produce the most sustainable formulations possible and deliver these with great performance and great value. This is why characterization is critical. This product was very complex, and we have to be able to make it a factory. What we did is work with the factory teams to incorporate factory data into our model, and we were able to simulate how the formulation behaves in our factories, building confidence, not just in the formulation team, the process development team, but also factory teams and marketing teams. The results of our modeling enabled us to save multi-millions of euros in capital investment. The situation we're in now is that we can de risk the formulations in the formulation development phase, before scale, at work, before factory launch, because we can build confidence in the right first timescale up of higher performing than ever products, better value than ever performing products. This product was 100% sustainable. Through DOE and modeling, we're able to produce superior products, great value products and sustainable products. The second example is extremely similar. This is a product you probably use very frequently. It's very well known. Again, started with characterizing not just the formulation, but also the process as well. Investigating the interactions here led to new understanding. Understanding we did not have before on both the formulation and the process. With this understanding and with this model, we were able to reduce the time it takes to make every single batch by 26% at factory scale, a 26% batch cycle time reduction. This is not just a time saving, because all that saved time is now time that we can make other products, this product and all of our other brands that we can deliver to consumers. We're unlocking factory make capacity and building on our model here. Beyond batch cycle time, we're able to incorporate cost and other quality parameters to start modeling and profiling multiple parameters at once. This enabled us to maintain a high performance, reduce cost that we can pass on as savings to the consumer and improve product sustainability through a 21% reduction in our polymer, which typically are non-biodegradable. Again, we're able to maintain performance, provide great value to our consumers, and deliver sustainable products, not just for our consumers, but for our planet as well. To share a few key learnings from our journey. It was really the focus on high-value projects that helped us to create the real business impact we see here. It enabled us to bring stakeholders on board with the digital first approach to innovation and really helped to start expand this, not just in formulation, but process development, supply chain and other teams. We've moved on from process development, and we're expanding this now. The second is that frequent presentations. The shared learnings, 20% of our journey helped maintain teams development. It helped maintain the pace of value creation for the business. Having written these presentations and given these presentations, it then became much easier to transfer this understanding to other teams as well, and also stakeholders. It helped stakeholder buy in as well. The third learning is that one-to-one mentoring was a huge investment of resource at the beginning, but it was by far the most valuable for us. That early stage investment really helped quickly upskill engineers, and having engineers that professional with JMP, work with other engineers helped us maintain our focus on what really mattered in terms of a formulation and process perspective and overarching. All of this is a focus on the journey, and its long term commitment is key if we want to successfully change our ways of working. This is not overnight. It's a long journey, and it has some sacrifices as well. Your productivity is slightly lower when learning new tools, when starting to work in different ways, your output is lower, but the results really come when that productivity begins to increase because every technical insight you can pull out of your models results in a new space to explore, which then results in more insights. You start here to develop a circular innovation process. That circular innovation process is what we, now are doing, not just in process development, but all across Home Care. I'm Ewan, and I've just taken you through how we built an active global community for a digital first-approach to innovation and sustainability at Unilever. Thank you very much.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Whitworth 2

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Whitworth 1

Fujifilm Diosynth Biotechnologies is a contract development and manufacturing organisation (CDMO) that has a dedicated Process Characterisation department focused on performing process characterisation studies (PCS). The aim of PCS is to demonstrate that our customer processes are robust to changes in parameter settings or their normal operating ranges. PCS commonly employ design of experiments (DOE) to investigate the effects that process inputs have on quality attributes (QA) and process performance indicators (PPI). DOE analysis is a useful tool to identify the inputs that have an effect on the QA/PPIs, but it is mainly quantitative. In addition to the tradititional DOE analysis, calculation of the impact ratio (IR) for each input provides a quantitative and qualitative assessment and can aid in the assignment of a parameter as being critical or not. The IR provides a measurement of the effect size relevant to an acceptable range. Doing the calculations manually is time-consuming and prone to error. We will present an automation tool that can extract the required information from a DOE model and compute the IR. An interface allows the user to customise how the results are calculated. Hello, everyone. I'm really happy to be here today with my colleague Sam. Today, we'll be talking about impact ratios. Basically, what is the impact of your DoE factors on your DoE outputs or responses? I'll introduce the concept and calculations and examples in JMP, and then I'll pass things on to Sam who will actually take you through automating that process using JSL and showing you how he's done this in add-in. First of all, confession time. I really wanted to call this talk, What do Process Characterization Scientists Have in Common with NASA Scientists? But the link was too tenuous, and maybe it would have been a bit mystifying. Nonetheless, I still use the concepts here. This is why we have meteor pictures on the first slide. This is my metaphor. I'm looking at the impact craters, and I'm comparing it with impact ratios because we need to have a little bit of fun. What is happening here? We have a very similar meteor, and it could fall on the Earth or Mars or the moon, and the conditions would be different. Here you have, first of all, an atmosphere or very little atmosphere or no atmosphere at all, which changes the blast, which changes the behavior of the meteor. If you are on the Earth, the impact crater that will be formed It could have a really massive impact on what's going on around because on Earth, we have people living around. With this in mind, what are we going to go through today? We'll look at yet another statistical ratio. Statisticians like their ratios. Then we'll look at what the impact ratio specifically is measuring. Then we'll answer this question, "Can I calculate this in JMP?" The answer is, obviously, you can, and you can do that a number of ways. You can do it the painful I will take you through this manually. I will skip over this step because we don't have time, but we have created a workflow to make things a little bit easier with table management. Then I'll pass things on to Sam, who will show you the happy way of calculating the impact ratio by clicking on his JMP add-in. Here we have a bad hand drawing that I did myself to show you something that you have probably seen many times because that's a control chart, and they are quite omnipresent in statisticians' world. Here we have an output, and it's changing over time as we create more batches, for example. We have zones on this chart. In the bluish purple zone, we have where our output falls. Because this is normally distributed data, we can make a prediction about where it will fall as time goes past. Here we have statistics giving us a prediction of how wide this blue zone might become. Then in green between two specifications, we have a customer type of safe zone. Customers tend to give us specifications, or we might have calculated an acceptance criteria from developmental work, for example. What we want to do is make sure that our process is sitting in a zone that's well is all contained by the safe zone when we are doing process control. One ratio that's used for this is the Capability Index, like the CPK, the PPK, is basically ratioing this zone versus this green zone. This is the most similar I found to impact ratio. Here we have almost the same graph. You could tell I have reused the same drawing. The only thing that changes here is that this time, instead of having statistics predicting the spread of the data, we have statistics predicting the shape of the data. We're modeling what happens to one specific output when you change the input, and you move its value from low to high. This is what you would do if you were doing a DoE and you changed the operating condition systematically. More particularly for process characterization, this low to high would be a range that you want to prove is acceptable. Here we have an equation in the end, and we have a blue zone, the min to max predicted for our response, and we still have a green zone, what's acceptable to give us quality product at the end of the process. We are checking, again, if the deviation is occupying an acceptable proportion of our safe or play area. This impact ratio can help us compare the impact of different factors because we will get such a model for all of our factors. It could also be a criterion for classifying those factors if this is part of a DoE. What is this impact ratio really measuring? You had a clue on the slide before, but we're back to meteorites. Our NASA scientists have calculated two things. The place where our meteorite will impact the planet and also the radius of the impact crater. They have said, as long as we are well within this green safe area, whoever lives on the outside will be fine. Of course, that would not be true for meteorite, but just stay with me here for a second. What we're ratioing here is basically the radius of the crater to the radius of the safe area. We're hoping that this number is much smaller than this one. That the safe area is very big compared to the impact crater. Now, what happens is that we could be off target so that the NASA scientists predicted that the meteorite would fall here, but it fell here. Even though the safe area was probably big enough for a centered meteorite/process, in this case here, we actually have the impact crater well outside of the safe area. I hope you don't live around here. Another possibility, and there are lots of other possibilities, would be we are on target here, but we have miscalculated the crater. The crater is much bigger than what we thought it would be. Again, it's outside of the safe area, and the radius of the safe area is actually smaller than that of the crater here. This is just a picture, and let's see how this would look on a graph. Same graph again with some extra arrows here and some equations. In plain English, your impact ratio really is ratioing the effect size over the distance to your specification or acceptance criterion. The effect size here is those skinny blue and orange arrows in this case. It's the distance from the minimum prediction to the center or the maximum prediction to the center. We are ratioing this to the distance from the center to your specification on either side. In this case here, this distance occupies 70% of our safe zone. That is not a good impact ratio here. Here is only 40%, it's still a pretty high figure. But to be fair, we're going to have to take the maximum of those are two values, so we would consider 70% here. This is my hand drawing. Let's go to an actual JMP chart. This is a jump chart, and it's just one part of a profiler that you would get at the end of finishing up a DoE analysis. In this case, we have a bit of curvature, but the zones are the same. If we want to calculate our impact ratio to the minimum here, the prediction is that minimum to center is taking about 42% of minimum to specification. On the other end, minimum to top or maximum prediction is only taking about 7% of our safe zone. Nonetheless, we're going to have to report this number, so it's still a pretty high number here. The questions we're trying to answer here is, are we on target with a small impact compared to what is acceptable? Now, I will exit from here and go into JMP to show you how to carry this out manually, the painful way. Here we have a box standard DoE, five factors. We've only shared here four responses. It's all anonymized, so I'm very sorry, but the numbers look a bit funny because they're all between minus one and one. But what we're looking for here is that we have response limits specified so that we can play around with the goal for those responses. Maximize for most of them for something like impurities that might be minimized, and we might not even be interested in how high they go. I'm not going to go through the modeling because that's not what we're here for. Sorry about that. Wrong script. There we go. We've already fitted this, and this is the example I had in the notes, so I'm going to keep with this one. Here we are using the Profiler to get all of the numbers we need for our equation. We're setting this at center point, and we're going to record that number. There we go. That's the center point conditions. Everything is kept at center point here. I'm going to ask JMP to remember this setting. I'm going to call this Load Center Point because I'm interested in the load here. That's my first one. Then manually, I wouldn't really need to ask JMP for help here. Everything is at center point for those two. I could just move this here to the minimum and I can record this again as the minimum value. That's my load min here. For the maximum, it's a little bit trickier. I could try to do this. I could make this bigger and try to make sure I get to the maximum here. But I'm going to trust JMP for this one. I'm going to click Control + Alt, and click on the other two graphs here, and I'm going to lock the factor settings at center point. Because I'm only interested in what the load is doing and what the maximum is for the load when everything else is kept at that point. Then I'm going to use the maximizer in JMP. There we go. I could have reached that by hand, but now I know it's exactly at the maximum. I'm going to save this one as well. We have the max. Now I have almost everything I need. If you're doing this, manually, you're doing this for every factor or equation parameter on this list for every response. If you have fairly big models, and you have maybe 5–10 responses, This becomes a lot of work very quickly. Then you would have to export all this data into table. You would use this if you had many done, but I only have one, so I'm going to do this. Here you have all the info you need in the wrong format. What we need to keep is where are these? Then we want this number here. Now, you would use transpose in tables here, but I'm not going to do because I've already done it ahead of time. I just want to quickly share that with you. Why is it not happy? Here we have the transpose data. We have all the labels from the table here. Manually, we would have to add our lower and upper spec. From these and the min and max and the CP, we can calculate the numerator for the ratios here, the denominator for the ratios, and the ratio themselves. You have the small distance over the large distance give you your low impact ratio. Then you simply use formula to get the maximum of the two, and you'd have to repeat that for every one of them. I'll leave the floor to Sam now who could show you how much easier this is to do once you have an add-in for it. Sam? Thanks, Gwen. As Gwen mentioned, we decided to create an add-in to automate this task. I'll just give a quick demonstration now of how the add-in works. I'm using the same data table that Gwen has just been working with. The reason why we decided to make an add-in was just because it makes it easier than running a script directly from a script window. The nice thing about having an add-in as well is when you hover over the add-in name, you can add a tool tip. In this case, the tool tip just tells you the correct window that the add-in has to be run from. In fact, if I try clicking it, you'll see that nothing happens because I'm not in the right place. If I now open up the script with those models that we made earlier, you can see that I can now run the script and then it will run properly. First of all, the user is then presented with some windows which have some instructions and then later on some areas for user input. The first window just has some instructions around requirements for the underlying data table. For example, the factor columns have to be coded, and any units have to be input as column properties as well. If that's not the case for any of the columns, you can then just hit Cancel, and I can go back and make those changes. But in this case, I know that they are all coded correctly. I'll hit run again and then click OK. Then the next window just has the area for inputting of the factor settings. You'll notice that we have an input here for categorical factors. It's not possible to calculate an impact ratio for a categorical factor. However, any categorical factors contained within the models have to be fixed at a particular setting, so this just allows the user to input this here. For the remaining factors that were continuous factors that were evaluated in the DoE, you can see that we have an area for inputting those settings. You can see that the script is automatically read in the minimum and maximum value just by reading from the table. What it has done as well is calculated the center point just by looking at the middle between the minimum and maximum. However, in some cases, the center point might not be at the exact center of the range. In this case, that was the case for load, so I'll change that to 20. It's possible to edit any of these values. The benefit of being able to change the input here is that if we run the adding, and we get impact ratios that are too high for some of the factors. We can then just run the adding again, and I'll change the values here, try evaluating a reduced factor range, for example. Then see if the impact ratio looks any better at that new range. But I'll keep the rest of the values the same and click OK. Then the final window just has the input for the response acceptance criteria. I'll put those values in now. If there are only one criteria, then you can just put one in and leave the other blank. If you have no criteria, you can leave both boxes blank, and it will still be able to calculate the impact ratio. In that case, it's just the percentage difference between the set point response and the minimum or maximum prediction. It doesn't quite offer the same measurement in terms of practical significance, but it will still be calculated by the script. Now, if I click OK, you can see that the summary table has been generated. But if I go back to the window where I ran the add-in from, you can see that underneath each prediction profiler, the settings have been remembered. These are the settings that were used to obtain the minimum and maximum prediction for each factor in the model. This is useful so that you can then go back and see how the calculations were made. It's good to be able to review that. But now I'll just give a very general overview of how the script actually works. First of all, the script loops through each response model in the window. It then sets the desirability to maximize the response. The script then loops through each term in the model. For the first term, which is load in this case, it then unlocks the factor settings, so it can be free to move. Then all of the other factors are then locked at their set point setting. The script then executes the Maximize and Remember function. You can see now that that setting has now been saved underneath the Profiler, and you can see that we've maximized this response by just changing the load factor in this case. It then continues that operation for each term in the model, and then the desirability is then set to minimize the response, and then that process is repeated again. Each term is then evaluated again to get the minimum prediction. Then finally, all of those remembered settings are then updated so that the name is meaningful. Then we show the factor name and whether the goal was to minimize or maximize the response. Then finally, this entire process is then repeated for each response model contained in the window. That's the script works. I'll just return now to the summary table that was output. You can see here that all the data just gets collected and output into this table. We have each row contains one particular factor for each response model that was evaluated. We have columns for the acceptance criteria that were input by the user. There is then columns showing the prediction at the set point settings, and then the minimum and maximum prediction for each factor contained within that response model. Then the remaining columns are just formulas. Here we've got the difference between the minimum prediction and the set point prediction, and then, similarly, for the maximum prediction and the set point prediction. Then the last two columns are just the calculated impact ratios. We have the lower impact ratio and upper impact ratio. You'll notice that where we only have one criteria, so for impurity in this case, since we only have an upper acceptance criteria, we only calculate an upper impact ratio. Then the final column is just the overall impact ratio. This is just simply the maximum of the lower and upper impact ratio. I'll just finish off this part of the talk by summarizing the benefits of using an add-in to automate this task and jump. Firstly, using the add-in is much quicker than doing it manually. Secondly, this allows the task to be repeated much more easily. As I mentioned, if you run the add-in and get impact ratios that are considered to be too high, you can then just rerun the add-in, change the factor settings, and see if you can obtain an acceptable impact ratios. Then finally, there's much less chance of any errors occurring because you don't need to do any data transcription or manipulation of the data. The script collects all that data together and then puts it all into this table that gets output at the end. Okay, thank you. That concludes my section of the talk. I'll hand over to Gwen to summarize things. Thank you. Just in addition to what Sam said, a big advantage of the add-in is that you could repeat the calculations and change what you input in the first couple tables. You could change the factor settings if you wanted, for example, to bring in what you had set out to show where your proven acceptable range is, a little bit outside of your normal operating range is, presumably. The other thing you could change is revise your specification or acceptance criteria. If your impact ratios were really high, then your safe zone would have been maybe a bit too small to be comfortably operating in. You could push the specifications out and check if all your impact ratios are being a bit smaller. You could actually probably use this as a justification for changing those acceptance criteria. Another thing that you can do with those impact ratio is that you could use them collectively as a criterion to classify your tested process parameters. It could be that a process parameter is critical, highly critical or non-critical at the end of the DoE because its impact is very small. I think that concludes what we had to say about impact ratios today. We're both available to answer questions if you have any. Thank you very much.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 3

In the pharmaceutical development of tablets, most active substances are difficult to process or dissolve. There are also many process steps and functional components that need to be included to solve all the issues that appear along the way. To narrow the focus, it is important to recognize which of the many potential factors are the most important for the responses of interest. Definitive screening designs are often considered to be most appropriate for experimentation with four or more factors. Whenever there are available results of experiments that are not part of a specific design, it is good to use tools such as advanced predictive modelling techniques to help capture valuable information. The aim of this project was to apply different analytical techniques to evaluate the effects of input factors on responses. Another goal was to find the balance between the factors that contribute to tablet appearance and mechanical resistance and the factors that enable quick active substance dissolution, which is important for product in-vivo performance. By using a combination of analytical tools, valuable insights were obtained regarding effect of formulation and process factors on tablet characteristics. Optimal settings were then defined to maximise dissolution. Hello, my name is Tijana Miletić, and I work in Hemofarm, part of STADA Group in product development. Today, I'm going to show you how we use the definitive screening design and advanced predictive modeling as useful tools in product development. Main goal of product development here was to find optimal formulation and process settings for several quality attributes. At the beginning, we suspected what could be potential effects of formulation and process variables on tablet properties. This is how we selected the factors for our experimental study. But we did not know what would be the actual relationship between these variables for this specific system. Here we use experimental design as a way to extract most information from a limited number of experiments. Our main challenge in this study was to achieve maximum dissolution while maintaining mechanical resistance of tablets at the same time, and of course, to decide which ranges or which factors we are going to use and which experimental design we are going to select. Because we suspected that we will have some significant quadratic effects and interactions, we selected the definitive screening design. Overall, impact that we achieved with this study was positive for our product development because we managed to identify most important factors and optimal values to achieve desired responses. Here, the main response was the solution because it is important in vitro result, which is considered prior going into costly clinical studies. The tablet hardness, on the other hand, is a good indicator of mechanical resistance of tablets, which tablet needs to withstand manufacturing and packaging process. All in all, moving ahead in product development with the right decisions being made, is something that saves time and resources in all stages of product development. Here we were happy with the value which we achieved because we got some direction in product development. Instead of performing for six factors, full factorial design on 64 runs, we managed to execute the definitive screening design with 13 runs in about four times less time than it would be for 46 runs. Besides that, we also use some additional experiments and advanced modeling techniques to get even more insights into factor effects and their significance, which in the end resulted in having development goals achieved. Our main data analysis was execute it with three main activities, which we will present with our poster presentation. We used for this data analysis JMP 17. Here we presented visual data exploration. We also used the platform for Fit definitive screening and for model screening. At first, we used scatterplot matrix, which provided quick assessment of relationships between multiple variables at the same time. In order to better understand the nature of relationships within our variables, we looked into our models in more detail. For definitive screening, we recognized that the most significant factor for hardness was compression force, and for this solution, amount of disintegrate and compression force. We did not observe significant quadratic effects, and the only significant interaction was between amount to binder and compression force for response of this integration. By going back to our visual data exploration, we saw that there is not that much of connection between this integration and the solution, meaning that this result is not that significant, and we will not be able to see if we are going to like our dissolution results, so we did not focus too much on this interaction at the moment. We used the run 14 to evaluate predictability of a model being created here, and we received the 78 versus 80, and 93 versus 94% with dissolution, which is considered to be good match for this type of test. We received exact match for hardness of 54 newtons. By being happy with this, we then use prediction profiler and by maximization of this ability, meaning we wanted to see how can we maximize the solution, we learned that we need to use 8% of this integrant, 3% of binder, and compression force at the lowest level. Here we were worried that with lowest compression force, we would compromise mechanical resistance of tablets, knowing that it could lead to lower hardness. Here we used a bigger data set of 27 runs to graphically evaluate if there could be trend there so that we could rely on amount of binder to get a positive effect of tablet hardness. We also performed the model screening for this response, and we picked Fit least squares method to develop the best models to evaluate possible effects. We reduced models so as to get the highest possible significance of effects. Here it was confirmed that binder could have a positive effect on hardness. We also performed model screening for response dissolution, and Here we presented with two methods and two models, highest possible dissolution prediction, which is in both cases about 87%, and we also confirmed the significance of this integrant amount in compression force for this response. Here we saw some difference in what would optimal level of lubricant could be, and this is not surprising because we know that the lubricant can have potentially negative effect on dissolution and hardness. But here it seems that this effect is not significant. We could go ahead with higher level in order to decrease risk for sticking or to perform additional experiments to better explore this effect. By looking at all these insights, we were satisfied with the conclusions that we made. Definitely, key ingredient for desired dissolution was this integrant. We were happy to learn that amount of binder did not have negative impact on the solution, but could have positive impact on tablet hardness. We also learned that glidant does not have significant impact on evaluated responses. Of course, compression force is key process parameter, and it should be carefully set. Overall, definitive screening design provided the directions in our formulation development, which led to desired results in last time, and we were happy with that. With advanced modeling techniques, we managed to get additional insights. Of course, in order to have better predictability of models or to investigate relationships between variables in more details and with more precision, we would need to generate more data. But based on this experimental study, we got results that made us feel confident enough to go ahead within our product development. That is all that I prepared for today, and thank you for your attention.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 6

908 Devices released a JMP add-in tool that facilitates the direct analysis and trending of amino acid and vitamin concentrations generated at-line by the REBEL media analyzer. In media development and adjustment, various parameters are tested over time, which leads to a high number of samples and generated data. REBEL analyser, in combination with the JMP add-in function, allows data sets to be visualized immediately in a customized view. Alvotech presents a case study demonstrating how JMP enables a fast comparison of amino acid levels in different bioreactor runs with different media formulations, leading to improved process understanding. In a complex experimental setup, various media formulations were tested over 13 days of multiple bioreactor runs by analysing amino acids and vitamins concentrations at-line with REBEL in three different dilutions in duplicates to evaluate batch performances. The results of this large data set of all 21 amino acids and six vitamin levels were visualized with JMP in a simple way that still provided various setups for comparing data and determining measurement accuracy. Hi, my name is Ildiko. I'm representing Alvotech Germany on this meeting together with my colleague, Thomas. We are both members of the Process Innovation Team at Alvotech, and we are working mostly on media development projects. We are active JMP users, mostly using different DOEs, complicated DOEs, but this time, I would like to present a small smart tool for data visualization, which is for REBEL users. REBEL is an ad-line spend media analyzer machine, which was manufactured by 908 Devices company. Last year, they released a JMP add-in tool which facilitates the direct analysis and trending of amino acids and vitamin concentrations. Together with this measurement, really large data sets are generated which can be immediately visualized in a customized view with this JMP add-in. We would like to present the workflows to compare, to show trending in time, and different results, different media formulations, different bioreactor runs, to show how this Smart Ramping tool can lead to the improved process understanding, and how this can help us in our everyday work. A few words about the REBEL. What is REBEL? It's a spend media analyzer machine, which is a capillary electrophoresis-based small mass spectrometry instrument working with kits, and it has the capacity to analyze 33 different metabolites in one sample, approximately 7, 10 minutes, so it's really fast. We use this platform, as I mentioned, in the media development for different purposes. We compare vendors in formulation of a specific big media. We compare them, and we define which vendor could perform the best or media. As I mentioned, we are doing complex deals, and we need to follow the analyte trends in these different media conditions. We have to do fast decisions day-by-day to define feeding strategy, or we just need to see these conditions, the analyte trending across a time course. We also use this tool to compare different conditions in bench scale bioreactor runs. If there is one product lifecycle management, and we would like to implement a new media, for example, and we compare it with the other setup. When we generate the large data sets, we have this JMP add-in tool for really fast data processing and visualization. I would like to note here that it works only with JMP 16 or above versions. If you go back to the full design and the methods, as I mentioned, I would like to present three small comparison workflow on this poster through three case studies. I would like to show some short demo videos about how these workflows are showed through data import until the customized dashboard view, which could be then further processed, saved, sent, exported, whatever according to our needs. The analysis just takes really 1, 2 minutes for large data set with the prepared data filters. This is a really useful tool for REBEL users. How it works? First, after installation of the add-in, which is really easy, you just have to install the add-in file. We at Alvotech have a remote version of JMP because this is a GMP-validated environment with so many restrictions. Here, after the add-in is installed, we see here the three workflows, the sample comparison workflow, what I would like to show in the study one, and the both time course workflows for anomaly trending and condition trending in the study two, and I forgot… Here, the study 3 shortly. The result files, what the machine generates are CSV files coming from a so-called REBEL batch, which is based on the batch run sheet. This is a template, an Excel template, when the sample labels are well-defined. When the result sheet is generated, it can be directly imported to JMP. There is an optional file, which is the so-called sample label file, which is very useful because it has the option to correlate samples to custom metadata, which means it is important in the trending workflows, then we would like to correspond, for example, a bioreactor name to a sample label name to see which conditions are in which bioreactor. The JMP recognizes this sample label and these components. I will show that later in the demo videos, it will be more clear. Finally, when the results were imported into JMP, it generates a default report dashboard first, which can be customized later with data filters. This is the workflow at REBEL. We go to the first comparison workflow. I would like to show how this customized view were generated. I would like to show that in a short video. First we open the sample comparison workflow. We select the folder where this file is, which is a little bit complicated for us as we are working with a remote JMP. We find the folder. It's empty first, when we select the folder. We have to select the sample label file and open it. We adjust the header, which starts the row number one. This is the imported file, which JMP generates data table from that. It takes a little time. Directly, we go to the sample comparison dashboard view. We can see here all the analytes in different bioreactors with different media. The tabulate shows the mean values of the analytes because we are doing duplicates or triplicates. On the left side on the Filter panel, we can select sample label because in this case, he would like to show only the medium types. It also takes some time to open it. This is medium A composition. All the other medium should be appeared one after each other. Other workflows are much faster. Somehow, the media, the comparison workflow is really, really slow. These are media, same media prepared by different vendors. You can see the differences. Also, we can see the concentration pattern of the analytes, and we can decide if the dilutions of the samples were good or not. Also in the control panel, we can select Confidence Intervals. We can select the standard deviation for the values to figure out if these are in the defined range that we would still accept. Once we have the final customized dashboard, we can save this dashboard. After saving, it should appear when we click on the data table. Yes. It's in the control panel on the top left. The modified dashboard, which is there, we just click on that and it appears again. Of course, if we need to save the data table to make it appear again next time. We also can export the data in any format, of course. This was the first workflow. The other workflows, what we are using the most in the media development, the analyte trending workflow and the condition trending workflow. If you go to the analyte trending, I also have a video for that. But before telling, I would like to tell here about the very important action which is needed to take, define the time course component, which works in a way that in the original results table, Excel table, we have the first entry of the sample label column. In this case, it's the culture station one, vessel number two on the day three. JMP should know which is the time component, first time component, which is number three. It works with delimiters to define the time course component. Delimiter, as it is shown up here, is the character that separates the string of the text. In this case, it's a double space based on this naming convention. The time course component I want to show as first time, which is the number three. This is the third component of the sample label. We can just update the preview here, and it appears that time three will be the first time component, and the sample label is that one. Which is the same, what I defined in the optional sample label file, which is corresponding to the batch record position, what I would like to see in the analyte trending data set. Once we have this, the JMP import shows the dashboard. It's better to show that on the video as well, go step by step, which makes more sense to see. We open the second Analyte trending workflow. We pass the information from a sample label as I mentioned before, because I created a sample label optional file. The time course sample label file is there. JMP recognizes immediately and import that. This is how I define the time course component with the double space and the number 3, which shows the time starts with 3, which is day 3. Now the data table appears and also the analyte trending dashboard view, which shows all the analytes. It's important to say, this analyte trending, it shows the mean of everything in a time course. All the days which we have samples are defined here. We can set the standard error range, and we can also have the filters to show only one culture station because otherwise, it won't be the good trending. This analyte trending shows only one condition. All analytes, we can see the outliers here. We can see the non-usual behavior of an analyte. We can select analytes as just showing up one in a customized view. We also can see if values were not measured. We can put the confidence intervals, we can put the standard deviations, if we need that. Check the different analytes one by one if we would like to do that. And saving the dashboard to the data table. See that we can, we can save any number of modified dashboards that we would like to export that later. Here, there is an unusual thing. Actually, a bug was identified in this data set. I'm showing that in the next slide with the condition trending workflow, which is generated from the same data set, at least the first one. Here, I'm not showing the video this time because the data import would be the same then last time. As I mentioned, this is a DOE for media conditions. It's an AMBR15 small bioreactor setup. Here, this is my favorite workflow, the condition trending workflow, when we select one analyte, throughout time course in all conditions we have. Here, there is this bioreactor position with question marks, which was not in the sample label file. This is a bug in the system. In this case, when checking the data table, the time course components and the name of the sample label, it was not correctly separated. This was identified as a bug. We can see here that values are missing at specific days, like here and here, day 11 and day 6. We also can see that in the data table missing values. This means a bug which can be reported to the 908 Devices. It's still ongoing to investigate what happened. The reason why I find it really good and really useful, this time course condition trend workflow, because here we can see, for example, different conditions. How unbalanced are the different vessels, different conditions, because it's a DOE with any kind of conditions, and we have no idea about the outcome. Media is not balanced. Amino acids really show different trending. The alanine, for example, here, it shows up completely different pattern in how the cells are growing and dying after a specific day. It shows really different trends, which is very, very valuable and important information for us when concluding the study. Also, if we would like to make an emergency feeding strategy change, day by day, the REBEL can measure every day all the analytes in all conditions, and we can see that really fast with this workflow. This is also the condition trending, but in the last study I would like to show, this is the life cycle management of a mAb product. When we would like to show how a new media formulation performs compared to the original media formulation. We just follow the cells growing and the product produced throughout this 14 days time course. We followed, of course, the analyte trends. A large data set were generated, and I created the sample label file which means that after the time course component was defined, same way than last time, then the batch accord positions were added with the same name to make the JMP able to identify the batch accords. I'm showing you this last video. These are concatenated files. Two different batches under the same folder could be visualized by JMP and imported to JMP. This is also a really good feature in this adding tool. That different result files for the same study can be opened at the same time and data are concatenated. The sample label file is defined and read by JMP immediately. Here the definition is a little bit different because the first sample label column is also different. I identify as day zero here. Now we see that as a default, all analytes under all conditions appears. That's why we need the local data filter to apply here. Let's see arginine, for example, which is on the slide as a screenshot. We can see clearly how balanced is the media, it's defined condition already. The trending is the same in between two different media types, at least for this analyte. The [inaudible 00:22:57] positions are perfectly here. No bug in this case. Once it's done, we can save the dashboard as before. Of course, it can be further customized with the Graph Builder options for more advanced JMP users. This is like how you can export that by selecting an export folder, and it can be exported as a picture file, or it can be exported into Excel, the result table. There are many exports here already. If we can go back and conclude this really useful tool. We can tell that these complex data sets could be visualized really in few clicks by adding some time course component and identification of some component, which is really easy. It is really useful for non-advanced JMP users because I don't believe that JMP knowledge is needed at all to use this add-in tool. The benefit is great because in a very short time frame, it allows to really, really fast data-driven decisions, which is really important. I have a notification from 908 Devices that the upgraded statistical analysis tool, the version 2.6, is under launching. This is the improved version. When the separated sample label file is not needed, it can be as a separate column in the original result file, which is the sample label file, and there is no need to add an optional sample label Excel file when defining the time course and opening the trending workflows. That's all. I hope you like that. Thank you so much for the attention.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 4

Pharmaron has developed a platform process to generate adeno-associated viruses (AAV) gene therapies with a highly adaptive toolbox to manage varying AAV products and serotypes. Our toolbox can rapidly assess a product's compatibility with our platform through a manufacturing feasibility assessment and finely tune a number of parameters for targeted process optimisation. One essential tool for Pharmaron’s approach to optimisation is DOE (design of experiments). We show how a central composite DOE approach can maximise the recovery of monomeric AAV by identifying the optimal residence time, loading density, and load pH for the initial AAV purification process's capture step. The optimal loading conditions were measured using titre by Capsid ELISA and multi-angle dynamic light scattering (MADLS) and monomer percentage by DLS. DOE analysis showed a strong link between loading density and monomer content, whereby a higher loading density resulted in a higher yield of monomeric virus. Load pH and residence time had negligible effects on recovery and monomericity. Since it facilitates the analysis of multiple parameters in a fraction of the time, DOE has enabled Pharmaron to rapidly identify the optimal conditions for affinity capture. It significantly improves process performance and drives generation of a highly pure, monomeric virus. Hello, my name is Damon Ho, and I am a Scientific Associate III at Pharmaron Biologics UK, based in Liverpool. Thank you for tuning into this talk. I'm really proud to have completed this work during my student placement via Liverpool John Moores University at Pharmaron Biologics. And even happier to announce that JMP was a huge part of the success of the optimization of our capture chromatography step. Moving on to the first section of the poster. To introduce Pharmaron to individuals who may not have heard of us before, we are a leading pharmaceutical research and development services provider with worldwide operations. In Liverpool, we currently work on viral vector-based gene therapy development and clinical manufacture, currently focusing on adeno-associated viruses or AAVs. At Pharmaron, we have an impressive platform process that can manufacture multiple AAV serotypes and products, which is illustrated in Figure 1. This process utilizes depth filtration, followed by capture chromatography, and then intermediate polishing chromatography, followed by polishing chromatography, onto our formulation steps, and finally, sterile filtration. Now, I'll introduce to you to our robot and the use of JMP. We are immensely proud to share that we have a state-of-the-art high throughput process development robot called the Beckman Biomek i7, as you can see in Figure 2 and Figure 3, that can rapidly assess a product's fit with our platform and finally tune parameters for its seamless integration into our process. Utilizing this robot for HTPD alongside the use of JMP for DoE, we can generate a high-yielding, high-purity drug product in a fraction of the time compared to conventional methods. This poster focuses on optimizing the initial capture chromatography step in our platform process for an AAV product. A DoE was designed using JMP software to determine the optimal loading conditions for processing. Three factors were chosen to be optimized, which were the residence time, loading density, and loading pH, with viral titre and viral aggregation to be measured as outputs. Now, for some quick information on how the DoE was designed in JMP. A central composite design, or CCD was chosen to create the DoE for a number of reasons. Firstly, that a minimal number of factor combinations are required to estimate main effects, and it is able to analyze two-factor interactions and quadratic effects. The CCD also has a good lack of fit detection, which can easily show which factors affect the chosen outcome, and graphical analysis is possible through various tools available in JMP, as you can see throughout the poster. The CCD was created in JMP by first selecting a response surface design, entering the parameter names of loading pH, loading density, and residence time, and then inputting the predefined high and low ranges. The goal was to maximize the monomeric virus content and viral capsid recovery, so these are input in the responses section. The software is very flexible, and you can always add more responses in the finished model. In this instance, as this was a conventional CCD, on face was selected. Triplicate measurements were selected for the center point. The CCD was then ready to be generated. In total, 17 different experimental conditions were generated via the CCD model with varying low, medium, and high parameters. Center points are defined as all medium parameters that form the foundation of the CCD model, with the low and high parameters acting as probes to test how the factors interact and influence each other. This forms a 3D response surface that can quite accurately predict the interactions of factors. Using JMP, ultimate conditions can then be hypothesized and tested in a subsequent confirmation run. Our HTPD robot was utilized to perform the capture chromatography at a very small scale, allowing multiple conditions to be run at the same time, which would have taken considerably longer at lab scale. This system allows for a highly-accurate and reproducible process due to its automatic nature. Once all of the conditions were run on the Biomek i7 platform, we employed the use of our world-class analytics to measure AAV capsid titre by an enzyme-linked immunosorbent assay, shortened dualizer, and multi angle dynamic light scattering, also known as MADLS. Monomeric virus content was measured by conventional dynamic light scattering or DLS. Not only is JMP using the design of an experimental study, but also in the analysis of results. Once analytical data was available, this was entered into JMP with the data able to be visualized in a number of different ways, including counterplots displayed in Figure 4, Figure 4a, 4b, and 4c. A simulation was run to generate tens of thousands of virtual results that helps build an optimized model and also to compare against real-world experimental data to build a confidence level in the model. This is shown in Figure 5. The contour plots highlight the impact of a higher loading density, which increases viral monomer content shown in Figure 4a as the brown. However, upon analyzing capsid ELISA recovery, which is highlighted in Figure 4b, it showed that a lower loading density, also shown in brown here, returns the highest capsid yield. This was challenged by MADLS data shown in Figure 4c, which provided the evidence to support the impact of a higher load density to increase viral monomer content, shown as the brown and the red and orange here in this bottom right corner. An important consideration is that MADLS does not measure aggregates, thus confirming DLS findings. Ph and residence time did not have a significant impact on capsid titre, MADLS or DLS results, so the contour plots are not included in this poster. Now, we can move on to the interpretation of the findings. After the three learning conditions that were analyzed, learning density had the most significant impact on capsid recovery and the monomeric nature of the AAV, which is visualized by the steep gradient of the line on the left graft of the predictive profiler shown in figure 6. Residence time shown in the central graph did not have any notable impact on even monomeric virus or capsid ELISA recovery, which is indicated by the near horizontal line of the prediction profiler. Loading pH shown in the right graph also did not significantly impact the capsid recovery or monomeric virus, which also has a near horizontal line. The optimum condition suggested by the DoE is a higher loading density with a shorter residence time and loading at pH condition A. This is an ideal outcome of the model as with a higher loading density, less capture chromatography media is used to achieve the same target loading density. A shorter residence time also results in a faster process, which streamlines the AAV loading stage. Loading a pH condition A is also ideal as it does not require pH adjustment of the load material, thus saving time during load material preparation. Using these optimized loading conditions determined by the DoE, a confirmation run can then be to verify the DoE output. Once confirmed, these optimum conditions can then be used in a scaled up run, which is specifically designed to achieve a high quality, high purity, and low impurity AAV product. This approach led to a rapid low-resource demand and low-cost assessment to optimize our capture chromatography step for our Pharmaron platform process. In total, the optimization experiments took place over only three days. This would have taken many, many weeks to complete if it were a lab scale, along with using much more AAV material and much higher cost associated when increasing the scale as only one condition can be run at the time. The use of JMP software in tandem with our HTPD capabilities can allow us to perform robust optimization of our platform process to suit specific gene therapy product needs. This greatly speeds up the feasibility and optimization stages, which are often the most time-consuming phases of the process development pathway. Ultimately, this drastically reduces the time taken from product development to the successful delivery of a gene therapy product to a patient in the clinic. I would like to end by thanking you for listening to my poster presentation, and I hope you have learned a thing or two about what we can do at Pharmaron, plus the importance of JMP in our product development pathway. Thank you very much.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 2

An experimental design was created to study the formation of an unwanted byproduct in an esterification reaction. Four mixture component factors plus one process-based factor were used to generate a 26-run space filling experimental matrix, specifically for analysis using Self-Validated Ensemble Modelling (SVEM). This approach was selected over a traditional mixture design intended for a polynomial Scheffe model. The resulting predictive model was an excellent fit to the data, clearly identifying the impact of each factor on the level of byproduct formed. This information was used to accelerate the development of a kinetic model and scale up the process. Hello, I'm Andrew Fish. I'm a Senior Principal researcher at Johnson Matthey. I'm a chemist by background. I've been working with JMP since 2016. My poster today is a mixture: process experimental design and SVEM analysis for an esterification reaction. As a way of introduction, so within the catalyst technologies business of Johnson Matthey, we sell catalysts and floor sheets for a range of different technologies, and that lends itself quite to design of experiments and to advanced data analytics. We develop catalyst formulations, and we optimize process conditions for chemical reactions. What we do is we work at small scales in the laboratory, and we translate that to commercial-sized reactors. The reaction which we're going to focus on for this poster is an application of Fish esterification reaction. I don't want to dwell on the chemistry because that's not the focus of this poster. But just by way of background, what we have is a reaction where we are taking an acid and react it with an alcohol in the presence of an acid catalyst to form an ester product and water. This is a reversible reaction, so it can go both ways. If we don't get the conditions right, we're not going to end up maximizing the amount of our ester product. The added complication we have here is that we have a side reaction. This is reaction 2, where the alcohol can react with the same acid catalyst to produce an ether. That reaction is an irreversible reaction, which means that if the ether is formed, we can't get our alcohol back. We've consumed our reactants, and it means the reaction just isn't as efficient as it can be. What we're trying to do in this process is minimize the amount of this byproduct, ether, and try and maximize the amount of the vesta that's formed under these conditions. To do that, we used a design of experiments. Normally, we would tend to use a generic mixture design with one process factor. We're going to take a slightly novel approach of using a spaceful and design here. To summarize the factors, we have our continuous factor, which is a temperature. We then have four traditional mixture components, which is the amount of alcohol, the amount of acid catalyst, the amount of and the amount of water. The final component is the original acid, but we're going to fix that value at 25% of the mixture, which means the remaining four mixture components have to sum to a total of 75%. Then what we're looking for is the amount of ether that we're producing, and we're going to try and minimize that. This is a homogeneous equilibrium reaction, so what we're going to do is all of these components are going to be present at the same time. We're going to heat it up to temperature, and then we're going to measure after 30 minutes the amount of ether that's formed. What we're going to do is in a normal traditional setup, I'm just going to come out of JMP, is in a normal mixture design, which is one I've prepared before, I'll just reload this. In a traditional design, we would introduce the temperature as continuous factor. We would introduce our four mixture components as mixture components. Then we would use a chef cubic type model, so we'd add all the terms in to make that occur. That would suggest that we need around about 40 experimental runs to be able to create a data set large enough and to be able to analyze that data. If we look at how that looks in a traditional ternary plot, we can see here the factor combinations of the different mixture factors and the experiments we would run over those 40, you can see it's exploring the space quite well. The issue that we have potentially is that with the temperature, we're only really looking at high and low temperatures, and we're still at the extremes of the mixture factor settings. What I'm going to do now in the way we design this experiment is we're going to use a Space-filling Design instead, and we're going to use SVEM to analyze the results of that. To build the space-filling design, I'm going to go into DoE and Special Purpose and Space-filling Design. I'm going to load in my factors which I've prepared earlier. I'm going to also load in my response. Okay, so I've got my ether as a response, which I'm trying to minimize. I've got temperature as a continuous factor. What the difference here is instead of introducing these four mixture components as mixture factors, I'm going to leave them as continuous. I'm going to still specify the range. What I'm going to do this time is I'm going to specify some constraints, so I'm going to load in my constraints. What that says is that the sum total of the four mixture components must be less than 75% or 0.75 as a mole fraction. I'm going to put in the equivalent negative constraint as well just to make the maths work in the background. When that's done, I can then… I'm not restricted by a polynomial model in terms of the design space and how that's followed through. I can specify how many runs I want. In this case, I've only got enough time to do 25 runs. I'm going to select 25 runs, and then we're going to make the design. We have a 25-run design here to a lot of decimal places, which isn't going to be possible, but we're going to target these in our experiments. If we look at the results of this versus what we did before in the ternary plot, we can see very similar. We're covering a lot of the space, but instead of being at the edges, there's a lot more in the middle. Again, in our a multivariate plot, we can see the temperature now we're covering a lot more of the middle of the space rather than the edges, versus the original traditional mixture, Scheffe Cubic type design, and we're doing this in 15 fewer runs as well. The aim of this really is once we've collected this data, we can then apply a machine learning type neural network algorithm to it instead of a traditional polynomial model and hopefully increase the resolution, and the understanding that we get out of this system. I will head back to the poster. I've already said before, the experiments, there's going to be 25 of them. We actually made this 26, so we included a repeat. The red dot that you now see in these plots is the repeat test, and that was just to ensure that there was good reproducibility in our measurement of the ether. As I said before, the mixture factors have been treated as continuous. The mixture sum is 75%. We're carrying out these experiments in mini-order claves. We're going to leave them for 30 minutes… we're going to get them up to temperature, leave them for 30 minutes, and then use analytical technique called gas chromatography to measure the amount of ether after that 30 minutes, and that's going to be our response for these 26 experiments. We carried on. We did those experiments. It didn't take too long. We then did some slight modifications to the data set. What I've done before up to this point, is I've talked about these factors in terms of percentages. It's easier to work with later on if we transform this to a mole fraction. Essentially, I've just divided the percentage by 100, so we get a number that sums up to 0.75 instead of to 75%. The second problem is because, and I'll come out of jump again here, I'm going to my results is these are our components, and in some cases, the sum total adds up to more than one or more than 0.75, more than one if we had the given concentration of the acid which is fixed. The reason for that is because these factors have been measured as part of the experiment. We can't fully achieve what we wanted to achieve. What I've done in this case is a bit of a manual tweak. I've taken the largest component in the mixture, which happens to be the ester, and I've just adjusted that, so you can see here. I've adjusted by 0.01 just so that everything sums to one, and that just helps the maths work in the background. For the 26 experiments in the data set, that wasn't a big issue to do. Some small adjustments, we also confirmed that the repeat run gave us a result within experimental error in terms of the ether concentration. All good to go and progress. We then started looking at the modeling of the data set. As I've mentioned before, the reason we use a Spaceful and design over a traditional Scheffe Cubic polynomial one is potentially fewer results in the test, and we can apply some more neural networks to it and hopefully increase the resolution, not be restricted by quadratic or cubic terms, which are very limited functions. The way we can do this is these neural networks are generally more applied to really large data sets. You need a lot of data. You can't really apply them to small DoE type data sets. That's because every run in a DoE is important. You can't afford to discount certain runs as part of a validation or a test set because every run in the DoE counts, you violate the design structure. Whereas if you use SVEM, and I don't have time, unfortunately, to go into the background of SVEM. I'm just going to show how you apply it. But a little bit of background is that it works using SVEM, self-validation ensemble modeling. The self-validation part is normally when you're fit in a neural network, you would divide your data into a training set, a validation set, and a testing set, so you will partition your data. Within self-validation, what we're going to do is we're going to replicate the data set, and we're going to use something called paired fractionally weighted bootstrapping with a gamma distribution. Effectively, you get anti-correlated pairs of each data point, one with a high weight, one with a low weight, one is used in the training set, one is used in the validation set, and you build your neural network using that. Then where the ensemble modeling comes in is you build lots of those models using different weightings in the gamma distribution. Then you average the final model. That's effectively how SVEM works. It gives you, in theory, give you a nice model with high resolution than a traditional polynomial fit. It can be really applicable for mixture designs in particular. That's what we did. This is our resulting SVEM model. It was a neural network algorithm. I used 50 models which were bootstrapped and average those. It was quite a simple neural network. There was only one layer with three hyperbolic tangent functions. Again, I'm going to come out of presentation and just go into JMP and show you how this works. This was all done, the SVEM, via an add-in made by a company called Predictum. This is a licensed add-in. We're going to build a neural network. What we're going to do is I'm going to select my factors, which is the temperature, the acid catalyst, the alcohol, the water, and the ester, which I've transformed by the adjustment of 0.01 to make everything sum up. I'm going to select my ether as my response. I'm going to click Run, and I'm going to end up with a dialog to launch the SVEM. I can select how many models are going to be averaged, how many bootstrap models. I'm just going to leave it at 50. I'm going to leave it fairly simple and have three 10-age layers… three 10-age functions in the first layer. You can modify these as much as you like and bring in linear and Gaussian functions. I'm just going to leave it as we are. It's going to go ahead and run the SVEM, and this will be different every time you do it because the fractional bootstrapping will be different each time. We should get something similar to what I had before. Here is my actual by-predicted plot, where this is the actual value of my ether response, and this is the predicted value by the SVEM. You can see I've got a really fantastic R² of 0.99 with quite a low error associated with it. I would then save my columns to the table, which I've already done. This is here, just to show you what this formula looks like versus a standard formula. Effectively, the ether concentration or response is a combination of these tan h functions multiplied by coefficients, multiplied by our different mixture and process factors. This here is one model, this is the second model, this is the third model, and so on. Down to a total of 50 models, and then all of those models are averaged, and that gives us our prediction formula. That's effectively how SVEM works, and gives us that really nice prediction. I'm all back to the poster. By way of comparison, even though it wasn't particularly designed for it, what I did was I built a least square regression model using the Scheffe Cubic terms in that model, and use stepwise parameter selection to narrow those terms down into the model. Built that model and compared it directly against the SVEM model. Here you can see the model comparison, so the SVEM predictions are in red. The least square regression predictions are in blue. Higher R² value for the SVEM model, much lower error as well associated with it. Also on the residuals plot, again, same colors the least squared regression model has much higher residuals for certain data points. The SVEM producing a better model with a 25 run data set effectively. Normally it's very, very difficult to apply neural networks to a DoE. I'm just going to go back. Looking at how that impacts the overall purpose of this work, which was to minimize the ether formation, and we can do that by looking at the prediction Profiler. Just exported that prediction formula into prediction Profiler. Again, I'm comparing the SVEM model versus the least squared regression model SVEM in colored in red at the top. What you can see here is there are differences between the two. We do have much higher resolution on the SVEM model, whereas the least squared regression is limited by polynomial curves. Essentially for low amount of ether byproduct, we need to end up with a low temperature reaction, low acid catalyst in the mix, mid-range alcohol, high level of water, and a low level of ester. There's some surprising results there for us, but we're fairly confident after looking at that actual biopredicted plot that we've modeled the system quite well, and you can see differences between the two models. For example, here, the acid catalyst hasn't been picked out as an important factor in the least square's regression model, and the direction of the trend versus ester is completely different. That explains the differences in the R² value and the predictions of the least squared regression. You can also see some nice features here, for example, on the alcohol where we've got a dip in the middle versus just a standard polynomial who doesn't really pick it up as much for the least square's regression. To summarize, instead of a traditional mixture-type design, we've used a space-filling design of experiments. We've treated the mixture components or the mixture factors as continuous factors with constraints built-in, and that's how we've accounted for the sum total of the mixture. We applied SVEM as a modeling technique to maximize the information we've got from a very small data set. We've increased the resolution of that prediction versus a traditional polynomial type model, and that's really helped us to understand the conditions to minimize ether formation in this chemical reaction. It's also accelerated the time for us to start building a kinetic model for this whole system. In terms of future work, we do also have time series data from these experiments. Instead of just having the data point at 30 minutes, we also have data points at 5, 10, 15, 20. We can do a bit more processing and try and integrate that to build and that to build, to calculate actual chemical rates and build a more developed kinetic model. That's where our attention is going to focus in the future. Also, many thanks to a lot of the scientists and engineers at Johnson Matthey on the technology team who contribute to this work. Finally, to Predictum, who provided training in the use of SVEM and also for its licensed use of the add-in. Thank you for listening to this poster presentation.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 5

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 7

Aerospace grade formulations are often composed of several ingredients whose ratios and interactions will impact one or more properties of the final component. Theory and experience can help with the design of these formulations, but sometimes there are interactions or synergies that have not been discovered yet. Therefore, it can be useful to explore a wide experimental space to discover the unexpected. In this presentation, I share the results and insights obtained after running a mixture design, including how to visualize, normalize, and analyse the data. I also discuss ternary plots, how to communicate technical information to a nontechnical audience, the challenges encountered, and what could have been done better. Hello, my name is Carlo Campanelli, and this poster is about use of a mixture design to optimize an aerospace formulation. Resin systems for aerospace applications typically compromise multiple components that need to be balanced to simultaneously meet thermomechanical safety and regulatory requirements. The testing process can be long, expensive, and with multiple sources of variability. Furthermore, there are often time constraints which limit the number of formulations, tests, conditions, samples, and repeats that can be done. The objective of this work is to improve the specific material properties while maintaining unaltered all other product, the characteristics and performance. This is often challenging as it can be a zero-sum game. And generally, it is a matter of finding the best compromise rather than finding the best formulation. The best compromise can change depending on the specific customer or the specific application. The approach for this work is to use a mixture design with 3 variables, 15 runs, and 1 repeat. Here on the top right, we can see a picture showing the experimental space of the mixture design containing the 15 formulation tested. The components or variables are X1, X2, and X3, and their sum is always 100%. Here on the right, we can see an image showing how to read the ternary plot and how the sum of the free the component X1, X2, and X3 is actually 100%. For this work, I've tested several properties, but I've reported they are three in the form of a color-coded ternary plot. Where green and red mean good and bad values, respectively. Starting from property one, we can see how the formulation at the bottom of the ternary plot have a good value, but this property tend to decrease by going up in the ternary plot. By going up, it means that we are increasing the amount of the X1 component. While going from left to right, so changing the amount of the X2 and X3 component, doesn't have an impact on this specific property. This is an example of a property that is dominated mainly by one component, X1 in this case. Now, looking at the second property, we can see a similar but opposite trend. By going up, this property get better. By increasing the amount of X1, we're improving this specific property. But in this case, going from left to right, we have a decrease in value. Here for this specific properties, we can see that it is influenced by X1 and X3 mainly. This causes us to have three sections where we have good properties, average properties, and bad properties. Regarding the third property, according to theories extrapolated from literature and direct technical experience, property three should be better for formulation at the bottom of the ternary plot. This is generally true, as we can see here, but there is an exception. This top-right formulation, which is a little bit of an outlier. This highlights the importance of screening a wide space and not having a bias toward certain parameters. Because if we were to follow the theory and only test or analyze the bottom formulation, we would have likely missed on this top-right formulation. Finally, the Predictor Profiler is useful to compare multiple properties and trends at the same time. Use this ability factor to maximize the most desired properties and visualize the confidence interval. In this case, I've reported or plotted seven properties against the three variables, and we can observe all the trends at the same time. We can also see that for some properties, the confidence intervals are very narrow, while for some other properties, they are very wide. We should be careful when we make conclusions using this data. About now, the learnings and the future works. This mixture design has highlighted several trends and some unexpected results. This will help with the optimization and tailoring of several products. It would have been beneficial to have more repeats as some properties have quite wide confidence intervals, and it is important to identify and understand the main sources of variability. A more in-depth study of the data generated is needed to find the correlation between the measured properties, explain outliers, and make connection with previous studies to confirm or disprove theories, and see the bigger picture. In fact, I believe that it is important to make full use work and the data that we have generated. This is everything for this work. Thank you for your attention.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Ballroom Ped 1

Outliers are often removed when modelling. However, detecting outliers is the essence of statistical process control. Understanding how they are created is an opportunity for quality improvement. Outliers are defined as "unusual" observations and require a convention, such as the three-sigma rule or another more subjective criterion. In this presentation, we demonstrate three approaches to modelling DOE in semiconductor process development, the goal of which was to understand the mechanism that generates outliers: The first approach takes experimental data, uses a rule to categorize observations as outliers, and then uses logistic/Poisson regression to model the rate they are generated. The second uses the Functional Data Explorer in JMP Pro to model the inverse empirical cumulative density function so one can see which combinations of factors cause or prevent outlier generation in a semiconductor manufacturing process. The third approach uses the nonlinear platform to model the data with a t-distribution so one can see the outlier distribution, as well as detect shifts in the process mean. We discuss how the three approaches differ in terms of the quality of the information they supply and the difficulty of the analyses.

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Whitworth 2

0 attendees

0

Event has ended

0 attendees

0

Thursday, March 7, 2024

Whitworth 1

0 attendees

0

Event has ended

0 attendees

0

Friday, March 8, 2024

The aim of the study was to explore a simplex mixture of three raw materials using a design of experiment (DOE) and to characterize it in terms of price, viscosities, and stabilities at different conditions (temperatures). Due to a difficulty in postulating an a priori model and the presence of a possible area of instability of the formulas, which could compromise the success of the DOE and the subsequent analysis of the results (no measurable response in the event of instability), a space filling design type with excluded zone was conducted. A first modelling with different machine learning type models (SVM, Gaussian process) was carried out, but certain areas of the experimental space were poorly described due to missing values for viscosity (e.g., too low viscosity or instability of some formulations). Using information from domain expertise, and with the help of a local data imputation method by K-nearest neighbors, the modelling was corrected and provided satisfactory results, thus giving a better representation and understanding of the experimental space and enabling the identification of a promising formulation candidate.

0 attendees

0

Event has ended

0 attendees

0

Friday, March 8, 2024

Looking at accessing new performance regions via formulation from a small number of synthesized materials, a mixture DOE was identified as the best approach. Upon initial discussion with the team, it was shown that DOE was deemed infeasible as traditional “random” point selection (based on target component ranges) could lead to large numbers of “un-processable” runs, resulting in an output data set that could not provide a useful model. The team knew the rules that needed to be applied to maximise the chances that all samples would be processable but were unsure how to implement these in a DOE. After trying a number of approaches to address this issue, the successful approach proved to be use of a candidate file to select runs; the runs to be included were generated using a pseudo-model, the factor grid tool and the grid point generator. The presentation reviews how this approach was carried out and the impact on the DOE of this approach.

0 attendees

0

Event has ended

0 attendees

0

Thursday, May 30, 2024

Conference Room 1,2, or 3

工艺验证分为工艺设计，工艺确认，和持续工艺验证三个阶段。不同阶段涉及的工作内容和参与人员均有不同，但均需要相关统计分析工作的支持。本次演讲将系统梳理和介绍不同统计方法在各个阶段的应用，例如DOE在工艺开发阶段的应用，比较分析在技术转移阶段的应用，等效性分析在小规模模型确认上的应用，蒙特卡罗模拟在工艺表征上的应用，稳定性分析在方法评估上的应用，以及控制图在持续工艺验证上的应用等。

0 attendees

0

Event has ended

0 attendees

0

Thursday, May 30, 2024

Conference Room 1,2, or 3

Speakers : 杨嘉易, 工艺支持工程师，应用材料 The speech topic : 告别试错时代，DOE工作流程助力Applied Materials 创造价值 Speech abstract : 软件，如利刃，在匠人之手，可雕琢万物，亦可改变世界。来自世界领先的半导体设备制造公司AMAT，携手多个 6Sigma 真实案例，为您展现软件的力量。案例源自实践，价值创造，值得借鉴。 AMAT 联手JMP 团队，精心甄选多个真实案例，涵盖 JMP 软件的 DOE、多元分析、质量和过程分析、消费者研究等强大功能，不仅适用于半导体行业，更可为其它行业，例如：制造业，医疗保健，金融和服务行业提供借鉴。打破思维定式，激发创新灵感。通过学习和应用这些案例，用户可以掌握 JMP 的使用方法，解决实际问题，并创造价值，更能打破传统的思维定式，激发创新灵感，为不同的的组织创造价值。案例呈现清晰，脉络分明，助力实践。这个演讲通过巧妙的工作流概念将不同的案例连接起来，不仅有利于理解案例内容，更重要的是可以帮助用户学习如何将 JMP 应用到实际工作中。 JMP，数据驱动决策的最佳利器。作为功能强大的数据分析软件，JMP 可以帮助您做出更好的决策。这个演讲从设计实验出发，推动了整个JMP平台在推动“数据驱动决策”方面的应用，为6Sigma项目提供了关键的支持

0 attendees

0

Event has ended

0 attendees

0

Thursday, May 30, 2024

Conference Room 1, 2, or 3

Speakers : 沈佳苹, 工艺支持工程师，应用材料 The speech topic : JMP DOE 分析及建模优化在预测外延硅生长速率中的应用 Speech abstract : 在半导体制造中，外延硅被广泛应用于晶圆衬底、pMOS SiGe、nMOS SiP、沟槽填充等应用。外延硅层的生长速率会受到多个步骤、多个工艺参数的影响。为了更高效地建立此种强交互的预测模型，我们需要彻底且全面的DOE评估，设计高度正交的DOE。此项目以外延硅生长速率的历史数据为起点，进行设计评估。对于建立预测模型而言，该历史数据结构功效弱，D 效率低，设计均匀性差，效应相关性和预测方差较高。在建模期间，由于缺乏模型自由度，RSM的逐步算法不稳定，导致模型重复性不佳，最优设计点的置信区间过宽。除此之外，RSM 模型中观察到了两对强相互作用：一对相互作用可能归因于各效应之间的高度相关；而另一对相互作用则与工艺的竞争机制相关。随后利用稳健设计和蒙特卡罗模拟进行公差设计，利用设计空间刻画器用于进行公差分配分析，以模拟未来的技术需求。为了以最小成本改善现有的DOE结构从而优化预测模型，我们未采用全新的DOE，而是利用增强设计方法，通过三种方法改善现有DOE结构：(1) 移除非正交的数据； (2) 默认增强算法； (3) 中心增强算法。通过非常全面的增强设计，JMP建议了最佳四个增强数据点，最终改进了DOE 结构，极大地提高了对于历史数据的利用效率，显著缩短了工艺开发的周期及成本。

0 attendees

0

Event has ended

0 attending

0

Tuesday, October 22, 2024

Executive Briefing Center 150

Experience a lively session that combines the power of JMP's Easy DOE platform with the creative energy of a father-daughter team. Our presenters – an R&D manager and his 9-year-old daughter – guide you through an engaging experiment using Easy DOE to create different variations of paper frogs. With Easy DOE's user-friendly approach in JMP 18, you learn how to effortlessly set up and run designed experiments to explore how different paper frogs perform under various conditions. This session highlights some of Easy DOE's latest features in JMP 18, including user-friendly options for factor specification and model selection. Join us to see how the family team uses Easy DOE to analyze results and uncover patterns in their paper frog designs. Whether you're new to DOE or looking to expand your expertise, this session offers practical insights and strategies for your own projects.

0 attending

0

I will attend

0 attending

0

Tuesday, October 22, 2024

Executive Briefing Center 8

One of the great things about humans is that we are all unique. Diversity has benefits all around us, but variation in preferences with regard to consumer goods makes it challenging to predict what will delight the greatest number of people. The use of design of experiments (DOE), facilitated by JMP, is critical to designing formulations with the most appeal. The objective of this presentation is to interactively present methods to optimize formulations for the greatest consumer liking. Two DOE approaches from a real food formulation case study will be presented: an 18-run definitive screening design (DSD) and an 18-run space-filling design modeled using SVEM and neural networks. To prepare data for modeling, multidimensional scaling is demonstrated to remove anomalous participants’ data. Participant clusters built using hierarchical clustering are used when fitting models using fit definitive screening and SVEM neural networks. The power of the JMP profiler will highlight how consumer preferences differ by cluster. Text Explorer is used to show how to verify insights gained through modeling by exploring verbatim comments by study participants. Lastly, insights gained from each experimental design and modeling approach are compared along with limitations of each. Attendees are presented with a more information-rich alternative to traditional DOE and consumer testing strategies.

0 attending

0

I will attend

0 attending

0

Tuesday, October 22, 2024

Executive Briefing Center 8

Once you’ve learned how easy it is to design an experiment in JMP, you never look at the world around you the same. Everything becomes an opportunity for an experiment! This presentation uses a practical example to demonstrate the process of design of experiments (DOE), including designing the experiment, modeling the results, and optimizing the inputs to provide the most desirable output. Attendees at last year’s Discovery conference were treated to an evening of unique fun: hitting glow-in-the-dark golf balls on the driving range at Indian Wells Golf Resort. The driving range has Toptracer technology that monitors each shot. Total distance, carry, ball speed, launch angle, and curve are some of the variables reported with each shot. A driving range that provides so much data provided a perfect opportunity to design an experiment using JMP! After an evening with fellow JMP users and friends, an experiment was designed using the Custom Designer in JMP. The design took only minutes to create. Input variables based on the golf stance setup were used in the design. These included variables such as grip, club head alignment, stance width, and ball location. The designed experiment was executed on the driving range, a model was created, and optimum settings to create the longest and straightest shot were discovered. The modeling and optimization were completed in minutes, while still on the driving range! This allowed for confirmation runs to immediately be performed. The benefits were later transferred the golf course as well.

0 attending

0

I will attend

0 attending

0

Tuesday, October 22, 2024

Executive Briefing Center 9

There have been numerous studies showing efficacy of strategies in process optimization. The common comparisons made are usually between the ‘one factor at a time’ or OFAT experiments and a ‘design of experiments’ approach. When faced with an unfamiliar, high-dimensional process space (e.g. >10 factors), researchers often resort to the OFAT methods as they are easy to interpret. Generally, it would be cost-prohibitive and logistically challenging to run multiple experiments geared towards the same objective just to evaluate which strategy outperforms others. To circumvent these issues, we used a Polymerase Chain Reaction (PCR) simulator with 12 unfamiliar continuous and categorical factors to explore these questions. Our team comes from decades of experience in process optimization in the electronic materials industry (former employees of Apple and others). We intentionally sought and selected a simulator from a research area completely unknown to us that has the ability to simulate a large number of factors and their complex interactions on many responses. To automate experimentation, we used a python web automation script. By using a simulator and our script, we can run through many experiments while mimicking real-life constraints and experimental budgets as seen in our own professional careers. While adhering to run budget rules, we compare the efficiency and accuracy of four strategies; two OFAT type strategies as commonly used in the industry, and two strategies from the DOE and advanced DOE genre. JMP is used for all experimental analyses and modeling and an objective attempt is made to compare the strategies.

0 attending

0

I will attend

0 attending

0

Wednesday, October 23, 2024

Executive Briefing Center 8

Cell culture plays a crucial role in the production of biologics. When introducing process changes as part of a design of experiment (DOE), accurately modeling the behavior of the cell culture process is challenging as the process involves multiple interdependent growth and production phases, only some of which may be impacted by process changes. Traditional parametric non-linear models struggle to effectively capture this complexity, while non-parametric models alone can be disjointed and difficult to correlate with DOE parameters. To address this issue, functional DOE simplifies the complexity into principal components and correlates the changes with DOE parameters. This approach enables the creation of a prediction profiler, which can optimize cell culture parameters from small scale data and use them to predict behavior during larger-scale production. The entire process can be performed within the Functional Data Explorer Platform in JMP Pro and can provide a more efficient approach for optimizing cell culture processes.

0 attending

0

I will attend

0 attending

0

Wednesday, October 23, 2024

Executive Briefing Center 7

Design of experiments (DOE) is a statistical method that guides the execution of experiments, analyzes them to detect the relevant variables, and optimizes the process or phenomenon under investigation. The use of DOE in product development can result in products that are easier and cheaper to manufacture, have enhanced performance and reliability, and require shorter product design and development times. Nowadays, machine learning (ML) is widely adopted as a data analytics tool due to increasing availability of large and complex sets of data. However, not all applications can afford to have big data. For example, in pharma and chemical industries, experimental data set is typically small due to cost constraints and the time needed to generate the valuable data. Nevertheless, incorporating machine learning into experimental design has proved to be an effective way for optimizing formulation in a small data set that can be collected cheaper and faster. There are three parts in this presentation. First, the literature relevant to machine learning-assisted experimental design is briefly summarized. Next, an adhesive case is presented to illustrate the efficiency of combining experimental design and machine learning to reduce the number of experiments needed for identifying the design space with an optimized catalyst package. In the third part, which pertains to an industrial sealant application, we use response surface data to compare the prediction error of the RSM model with models from various machine learning algorithms (RF, SVR, Lasso, SVEM, and XGBoost) using validation data runs within and outside the design space.

0 attending

0

I will attend