<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Statistical Significance in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216763#M43311</link>
    <description>Nicely done</description>
    <pubDate>Tue, 09 Jul 2019 14:53:37 GMT</pubDate>
    <dc:creator>txnelson</dc:creator>
    <dc:date>2019-07-09T14:53:37Z</dc:date>
    <item>
      <title>Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216698#M43295</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have the following information only:&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have Failure Rates (FR) from two sets of data. The experiment set has 4 fails out of 2500 (FR:0.16%) samples and the control set has 34 fails out of 35000 samples (FR:0.097%). Due to such a big difference in the sample size, is there a way JMP can help to determine if these two FRs are statistocally similar or different?&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;Ravi&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2019 05:26:59 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216698#M43295</guid>
      <dc:creator>RaviK</dc:creator>
      <dc:date>2019-07-09T05:26:59Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216752#M43304</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/15319"&gt;@RaviK&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;There are a few ways you could approach this, but perhaps the most straightforward is a Chi-Square Test of Independence, a type of contingency analysis. This test is appropriate because what you have are two categorical variables, a grouping variable, and a categorical outcome (success or fail), and you are interested in whether the observed proportions of the categorical outcome for your two groups provide evidence that the process generating the outcomes differs between the groups.&amp;nbsp;You can obtain this test using Analyze &amp;gt; Fit Y by X. But first, you'll first need your data entered in a particular way (also attached):&amp;nbsp;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screen Shot 2019-07-09 at 8.45.34 AM.png" style="width: 775px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/18242iB29169361184EB70/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2019-07-09 at 8.45.34 AM.png" alt="Screen Shot 2019-07-09 at 8.45.34 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;What I've done is taken the numbers you provided me and made columns for &lt;EM&gt;Group&lt;/EM&gt;, &lt;EM&gt;Outcome&lt;/EM&gt;, and &lt;EM&gt;N&lt;/EM&gt;, the number of observations in each. For the number of successes, I simply took the total you gave minus the failures. Next, we run Analyze &amp;gt; Fit Y by X:&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screen Shot 2019-07-09 at 8.45.44 AM.png" style="width: 791px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/18243i9A82DAD4B49A24BA/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2019-07-09 at 8.45.44 AM.png" alt="Screen Shot 2019-07-09 at 8.45.44 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Here I've cast &lt;EM&gt;Outcome&lt;/EM&gt; in the Y role, &lt;EM&gt;Group&lt;/EM&gt; to the X, and &lt;EM&gt;N&lt;/EM&gt; as the Freq, or frequency of occurrence. When we hit OK, we get the output below (I've hidden the mosaic plot since your observed frequencies of fail are so low that the plot is not helpful). &lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="Screen Shot 2019-07-09 at 8.48.55 AM.png" style="width: 737px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/18244iF715C9CA74F4F781/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2019-07-09 at 8.48.55 AM.png" alt="Screen Shot 2019-07-09 at 8.48.55 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Our p-value of interest is the Likelihood ratio, or Pearson (classical Chi-Square test of independence), both of which are around p = ~0.30, indicating that if there isn't any true difference in the experimental and control group processes, a difference in the proportion of failures you observed in these sample data (or a difference more extreme) would occur about 30% of the time when taking samples of the sizes you had. In other words, not very convincing evidence that there is a true difference in these sets.&amp;nbsp; Given what appears to be a large difference in the proportion of failures this may be surprising; but, given the low failure count overall, it's relatively easy to observe differences in the proportions of this magnitude or greater simply by chance (which is what this statistical significance test is telling us directly).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I hope this helps!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/2026"&gt;@jules&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2019 13:01:54 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216752#M43304</guid>
      <dc:creator>jules</dc:creator>
      <dc:date>2019-07-09T13:01:54Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216761#M43309</link>
      <description>&lt;P&gt;Don't know if it helps understanding, but here's some JSL that gets the pValue directly through simulation:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-jsl"&gt;NamesDefaultToHere(1);

n1 = 2500;		// Size of sample one
nf1 = 4;		// Number of failures in sample one
n2 = 35000;	// Size of sample two
nf2 = 34;		// Number of failures in sample two

n = n1 + n2;
nf = nf1 + nf2;

nSim = 10000;					// Number of simulations
np1 = J(nSim, 1, .);			// Vector to hold the number of passes counted in sample one

// Randomly allocate nf failures to n units
for(s=1, s&amp;lt;=nSim, s++,
	result$ = J(n, 1, 1);					// All n units pass initially
	result$[randomIndex(n, nf)] = 0;		// Simulate nf failures at random
	np1$ = VSum(result$[1::n1]);			// Count the nummber of passes in sample one
	np1[s] = np1$;							// Store this result
);

// Now evaluate how extreme the observed number of passes, (n1 - nf1), is in relation to the
// reference distribution constructed under the null hypothesis of random allocation of
// failures to samples
np1Observed = n1 - nf1;
pValue = NRow(Loc(np1 &amp;lt;= np1Observed)) / nSim;
Print("Estimated pValue is "||Char(Round(pValue, 3)));&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 09 Jul 2019 14:44:34 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216761#M43309</guid>
      <dc:creator>ian_jmp</dc:creator>
      <dc:date>2019-07-09T14:44:34Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216763#M43311</link>
      <description>Nicely done</description>
      <pubDate>Tue, 09 Jul 2019 14:53:37 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216763#M43311</guid>
      <dc:creator>txnelson</dc:creator>
      <dc:date>2019-07-09T14:53:37Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216797#M43320</link>
      <description>&lt;P&gt;Thanks Julian. It was useful indeed.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could I also use "Hypothesis Test for two Proportions"?&lt;/P&gt;&lt;P&gt;Ravi&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2019 21:04:02 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216797#M43320</guid>
      <dc:creator>RaviK</dc:creator>
      <dc:date>2019-07-09T21:04:02Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216798#M43321</link>
      <description>&lt;P&gt;Thanks Ian.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ravi&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2019 21:05:44 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216798#M43321</guid>
      <dc:creator>RaviK</dc:creator>
      <dc:date>2019-07-09T21:05:44Z</dc:date>
    </item>
    <item>
      <title>Re: Statistical Significance</title>
      <link>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216799#M43322</link>
      <description>Absolutely— those calculators are useful when working from summarized statistics. That said, we intended them mostly for classroom use, and you’ll usually get much more value from using the native JMP platforms.</description>
      <pubDate>Tue, 09 Jul 2019 21:10:01 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Statistical-Significance/m-p/216799#M43322</guid>
      <dc:creator>jules</dc:creator>
      <dc:date>2019-07-09T21:10:01Z</dc:date>
    </item>
  </channel>
</rss>

