<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What type of analysis for an infrequently occurring event in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/880974#M104484</link>
    <description>&lt;P&gt;First welcome to the community. &amp;nbsp;The question regarding how to develop causal relationships with rare events is very challenging. &amp;nbsp;The response variable of frequency of spills is not a very efficient response variable. &amp;nbsp;It also is not very discriminant in terms of understanding causation. &amp;nbsp;It is an aggregate of many possible failure modes/mechanisms.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;These are challenging to investigate with experimental design (while you can likely make the lorry spill, that may not be why it is currently spilling). &amp;nbsp;My bias would be to use directed sampling (component of variation and stability studies to study the process as is). &amp;nbsp;There is the question, are these actually &lt;EM&gt;&lt;STRONG&gt;special cause&lt;/STRONG&gt;&lt;/EM&gt; events as defined by Deming? &amp;nbsp;If so, his advice is to react specifically and locally to the events, rather than spend time and effort to predict the events (common cause action). &amp;nbsp;Are they actually higher order effects (e.g., &amp;gt;4th order interaction effects)? &amp;nbsp;Where it is a combination of factors that combine to create the event. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How confident are you in the existing data set? &amp;nbsp;Are there spillages that are unrecorded (perhaps they were small or corrected)? &amp;nbsp;I don't know what the load is, but the clue from your description is some sort of bottle. &amp;nbsp;If the bottles fall out and do not break, is it a spill? &amp;nbsp;Does size of spill matter? &amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here are things you can do:&lt;/P&gt;
&lt;P&gt;1. First start with &lt;STRONG&gt;hypotheses&lt;/STRONG&gt; as to what may cause these events and why. For example: Spills occur because the lorry becomes unstable due to uneven loading. &amp;nbsp;You may be able to do some scatter plots/correlation off the existing data set, but that would just be to stimulate your mind to develop hypotheses that would need to be investigated with future data.&lt;/P&gt;
&lt;P&gt;2. Develop an exhaustive &lt;STRONG&gt;list of factors&lt;/STRONG&gt;. Process mapping and FMEA often helps to do this. &amp;nbsp;Make sure you actually watch the process of loading, moving and unloading the lorry. It is likely your current data set does not contain information about all of those x's.&lt;/P&gt;
&lt;P&gt;3. Are there other &lt;STRONG&gt;response variables&lt;/STRONG&gt; that could be measured that might correlate with the phenomena that would provide better insight to failures(e.g., number of situations that might increase the chance of a spill, location of spill, direction of spill &amp;nbsp;from the lorry)? Or, for example, perhaps you hypothesize about the effect of weight balance of the load in a lorry. &amp;nbsp;Perhaps measure the weight distribution within and between lorry loads. Or you suspect it is the road conditions, perhaps you add accelerometers to the lorry. &amp;nbsp;Or speed, speedometers...etc.&lt;/P&gt;
&lt;P&gt;4. Make the lorry &lt;STRONG&gt;robust&lt;/STRONG&gt; to conditions you hypothesize increase the likelihood of spillage (e.g., suspension that absorbs changing road conditions). &amp;nbsp;This can be done with experimentation as long as the noise in the process is included in the study. &amp;nbsp;Again, you must weight the resources necessary and determine if indeed common cause action is cost effective. &amp;nbsp;Do you need to be robust to a rare event?&lt;/P&gt;</description>
    <pubDate>Mon, 23 Jun 2025 15:19:04 GMT</pubDate>
    <dc:creator>statman</dc:creator>
    <dc:date>2025-06-23T15:19:04Z</dc:date>
    <item>
      <title>What type of analysis for an infrequently occurring event</title>
      <link>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/880935#M104478</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have a set of data that has recorded the frequency of spillages from lorry loads. I have recorded for the period of two years: load, date, time, number of bottles on the load, driver, weather, dusk/dawn etc. I want to identify risk factors for when spillages are most likely to occur. However, these spillages happen so infrequently that most of my data is 0. For example, I have 29,000 rows of data, and approx. 28,000 of these loads have no spillages associated. I have calculated Mean Spillages per load, and mean spillages per million bottles. However, it is hard to know whether to do an ANOVA when I have so many zero values. Please could someone advise on what is the best type of modelling/test to use? I just want to determine whether any specific risk factor is associated with the number of spillages if and when they occur.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Abbie&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jun 2025 13:41:31 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/880935#M104478</guid>
      <dc:creator>AbbieGraham333</dc:creator>
      <dc:date>2025-06-23T13:41:31Z</dc:date>
    </item>
    <item>
      <title>Re: What type of analysis for an infrequently occurring event</title>
      <link>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/880974#M104484</link>
      <description>&lt;P&gt;First welcome to the community. &amp;nbsp;The question regarding how to develop causal relationships with rare events is very challenging. &amp;nbsp;The response variable of frequency of spills is not a very efficient response variable. &amp;nbsp;It also is not very discriminant in terms of understanding causation. &amp;nbsp;It is an aggregate of many possible failure modes/mechanisms.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;These are challenging to investigate with experimental design (while you can likely make the lorry spill, that may not be why it is currently spilling). &amp;nbsp;My bias would be to use directed sampling (component of variation and stability studies to study the process as is). &amp;nbsp;There is the question, are these actually &lt;EM&gt;&lt;STRONG&gt;special cause&lt;/STRONG&gt;&lt;/EM&gt; events as defined by Deming? &amp;nbsp;If so, his advice is to react specifically and locally to the events, rather than spend time and effort to predict the events (common cause action). &amp;nbsp;Are they actually higher order effects (e.g., &amp;gt;4th order interaction effects)? &amp;nbsp;Where it is a combination of factors that combine to create the event. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How confident are you in the existing data set? &amp;nbsp;Are there spillages that are unrecorded (perhaps they were small or corrected)? &amp;nbsp;I don't know what the load is, but the clue from your description is some sort of bottle. &amp;nbsp;If the bottles fall out and do not break, is it a spill? &amp;nbsp;Does size of spill matter? &amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here are things you can do:&lt;/P&gt;
&lt;P&gt;1. First start with &lt;STRONG&gt;hypotheses&lt;/STRONG&gt; as to what may cause these events and why. For example: Spills occur because the lorry becomes unstable due to uneven loading. &amp;nbsp;You may be able to do some scatter plots/correlation off the existing data set, but that would just be to stimulate your mind to develop hypotheses that would need to be investigated with future data.&lt;/P&gt;
&lt;P&gt;2. Develop an exhaustive &lt;STRONG&gt;list of factors&lt;/STRONG&gt;. Process mapping and FMEA often helps to do this. &amp;nbsp;Make sure you actually watch the process of loading, moving and unloading the lorry. It is likely your current data set does not contain information about all of those x's.&lt;/P&gt;
&lt;P&gt;3. Are there other &lt;STRONG&gt;response variables&lt;/STRONG&gt; that could be measured that might correlate with the phenomena that would provide better insight to failures(e.g., number of situations that might increase the chance of a spill, location of spill, direction of spill &amp;nbsp;from the lorry)? Or, for example, perhaps you hypothesize about the effect of weight balance of the load in a lorry. &amp;nbsp;Perhaps measure the weight distribution within and between lorry loads. Or you suspect it is the road conditions, perhaps you add accelerometers to the lorry. &amp;nbsp;Or speed, speedometers...etc.&lt;/P&gt;
&lt;P&gt;4. Make the lorry &lt;STRONG&gt;robust&lt;/STRONG&gt; to conditions you hypothesize increase the likelihood of spillage (e.g., suspension that absorbs changing road conditions). &amp;nbsp;This can be done with experimentation as long as the noise in the process is included in the study. &amp;nbsp;Again, you must weight the resources necessary and determine if indeed common cause action is cost effective. &amp;nbsp;Do you need to be robust to a rare event?&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jun 2025 15:19:04 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/880974#M104484</guid>
      <dc:creator>statman</dc:creator>
      <dc:date>2025-06-23T15:19:04Z</dc:date>
    </item>
    <item>
      <title>Re: What type of analysis for an infrequently occurring event</title>
      <link>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/881033#M104492</link>
      <description>&lt;P&gt;Just to add a bit of a different perspective to what&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/4358"&gt;@statman&lt;/a&gt;&amp;nbsp;recommends...which I also endorse...you might want to try zero inflated Poisson regression as a fitting personality in the JMP Fit Model platform.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jun 2025 20:18:22 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/What-type-of-analysis-for-an-infrequently-occurring-event/m-p/881033#M104492</guid>
      <dc:creator>P_Bartell</dc:creator>
      <dc:date>2025-06-23T20:18:22Z</dc:date>
    </item>
  </channel>
</rss>

