<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to do hypothesis test with highly right skewed data that contains many zeros? in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/220162#M44038</link>
    <description>&lt;P&gt;The data we got is "defect count on substract",&amp;nbsp;&lt;SPAN style="font-family: inherit;"&gt;and let's say we implemant a new clean method, and want to know if the new method is better than the original method, that is, we want to perform a hypothesis test to judge it.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;But the problem is: for both sample set, the major number is zero, and right skewed to several defect count, in this case is there any good method to perform hypothesis test?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My original idea is transfrom data to normal distribution then perform two sample t test, and since the majority number is zero, I tried to use log(x+1) to transform my data, but it still failed to fit normal distribtution from JMP continuous fit&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 06 Aug 2019 07:00:28 GMT</pubDate>
    <dc:creator>Hans_Hsu</dc:creator>
    <dc:date>2019-08-06T07:00:28Z</dc:date>
    <item>
      <title>How to do hypothesis test with highly right skewed data that contains many zeros?</title>
      <link>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/220162#M44038</link>
      <description>&lt;P&gt;The data we got is "defect count on substract",&amp;nbsp;&lt;SPAN style="font-family: inherit;"&gt;and let's say we implemant a new clean method, and want to know if the new method is better than the original method, that is, we want to perform a hypothesis test to judge it.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;But the problem is: for both sample set, the major number is zero, and right skewed to several defect count, in this case is there any good method to perform hypothesis test?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My original idea is transfrom data to normal distribution then perform two sample t test, and since the majority number is zero, I tried to use log(x+1) to transform my data, but it still failed to fit normal distribtution from JMP continuous fit&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Aug 2019 07:00:28 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/220162#M44038</guid>
      <dc:creator>Hans_Hsu</dc:creator>
      <dc:date>2019-08-06T07:00:28Z</dc:date>
    </item>
    <item>
      <title>Re: How to do hypothesis test with highly right skewed data that contains many zeros?</title>
      <link>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/220189#M44044</link>
      <description>&lt;P&gt;I can think of two approaches. The first is a non-parametric test. They are also available in the Oneway platform along with the t tests. The second way is to define a meaningful sample statistic (e.g., 0.9 quantile) and use a bootstrap to obtain a p-value for the difference..&lt;/P&gt;</description>
      <pubDate>Tue, 06 Aug 2019 11:30:04 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/220189#M44044</guid>
      <dc:creator>Mark_Bailey</dc:creator>
      <dc:date>2019-08-06T11:30:04Z</dc:date>
    </item>
    <item>
      <title>Re: How to do hypothesis test with highly right skewed data that contains many zeros?</title>
      <link>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/221098#M44126</link>
      <description>&lt;P&gt;Thank you for your reply, for nonparametric test, I look up the JMP help, and it says that:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Wilcoxon Test --&amp;gt; powerful for&amp;nbsp;logistic distributions&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Median Test --&amp;gt; powerful for double-exponential distributions&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;van der Waerden Test --&amp;gt; powerful for normal distributions&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Kolmogorov Smirnov Test --&amp;gt; not so sure&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So my question is for the extreme right skew distribution, which nonparametric method will be more suitbale?&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 01:39:58 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/221098#M44126</guid>
      <dc:creator>Hans_Hsu</dc:creator>
      <dc:date>2019-08-09T01:39:58Z</dc:date>
    </item>
    <item>
      <title>Re: How to do hypothesis test with highly right skewed data that contains many zeros?</title>
      <link>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/221134#M44132</link>
      <description>&lt;P&gt;The &lt;A href="https://en.wikipedia.org/wiki/Logistic_distribution" target="_self"&gt;logistic distribution&lt;/A&gt; is symmetric, so that choice is not the best. for your data. I would not use the Wilcoxon test.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The double exponential distribution (also known as the &lt;A href="https://en.wikipedia.org/wiki/Gumbel_distribution" target="_self"&gt;Gumbel distribution&lt;/A&gt;) is skewed, like your data, so it would be a better choice. I would use the Median test.&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 11:25:11 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/How-to-do-hypothesis-test-with-highly-right-skewed-data-that/m-p/221134#M44132</guid>
      <dc:creator>Mark_Bailey</dc:creator>
      <dc:date>2019-08-09T11:25:11Z</dc:date>
    </item>
  </channel>
</rss>

