<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Lack of Fit in Logistic Rgression Report in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Lack-of-Fit-in-Logistic-Rgression-Report/m-p/62244#M33542</link>
    <description>&lt;P&gt;I do not recommend replacing a continuous predictor with a binary predictor. Binary variables are less informative.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The difference you observe is due to the change in the degrees of freedom. The first analysis includes 1.85E+7 degrees of freedom in the test of the sample statistic of 8867.525 while the second analysis includes only 120 DF for the corresponding sample statistic of 174.2951.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The lack of fit test is based on a comparison between the selected model and the saturated model (unbiased). The null hypothesis assumes that they are the same. The expected&amp;nbsp;value of chi square under the null hypothesis is equal to the DF.&amp;nbsp;Chi square exceeds the DF under the null hypothesis.&amp;nbsp;The associated p-value informs how many such results exceed the sample statistic from the analysis.&lt;/P&gt;</description>
    <pubDate>Fri, 06 Jul 2018 13:09:35 GMT</pubDate>
    <dc:creator>Mark_Bailey</dc:creator>
    <dc:date>2018-07-06T13:09:35Z</dc:date>
    <item>
      <title>Lack of Fit in Logistic Rgression Report</title>
      <link>https://community.jmp.com/t5/Discussions/Lack-of-Fit-in-Logistic-Rgression-Report/m-p/62227#M33529</link>
      <description>&lt;P&gt;I have performed Logistic Regression analysis on a data set that contains 6 binary factors and 1 continuous factor. Then I repeat the analysis after converting the continuous parameter to binary by thresholding (0 if &amp;lt;= threshold, 1 otherwise).&lt;/P&gt;&lt;P&gt;With the first analysis I get the following Lack of Fit table in the report&lt;/P&gt;&lt;P&gt;Lack Of Fit&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;Source&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;DF&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;-LogLikelihood&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;ChiSquare&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Lack Of Fit&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1.85e+7&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;4433.7627&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;8867.525&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Saturated&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1.85e+7&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;387.7085&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;Prob&amp;gt;ChiSq&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Fitted&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;7&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;4821.4712&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1.0000&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I repeat the analysis after converting the continuous variable to binary I get the following Lack of Fit table in which Prob&amp;gt;ChiSq is now 0.0009 in place of the earlier value of 1.0000.&lt;/P&gt;&lt;P&gt;Lack Of Fit&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;Source&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;DF&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;-LogLikelihood&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;ChiSquare&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Lack Of Fit&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;120&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;87.1476&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;174.2951&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Saturated&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;127&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;4778.6278&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;STRONG&gt;Prob&amp;gt;ChiSq&lt;/STRONG&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Fitted&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;7&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;4865.7753&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;0.0009*&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I cannot find the description of Lack of Fit in the documentation for Nominal Logistic Fit Report.&amp;nbsp;In the context of "Lack of Fit", do I want the value to be close to 1 for the model to be fitting well to the data? What does the asterisk next to 0.0009 mean?&lt;/P&gt;</description>
      <pubDate>Thu, 05 Jul 2018 20:05:44 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Lack-of-Fit-in-Logistic-Rgression-Report/m-p/62227#M33529</guid>
      <dc:creator>ranjan_mitre_or</dc:creator>
      <dc:date>2018-07-05T20:05:44Z</dc:date>
    </item>
    <item>
      <title>Re: Lack of Fit in Logistic Rgression Report</title>
      <link>https://community.jmp.com/t5/Discussions/Lack-of-Fit-in-Logistic-Rgression-Report/m-p/62244#M33542</link>
      <description>&lt;P&gt;I do not recommend replacing a continuous predictor with a binary predictor. Binary variables are less informative.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The difference you observe is due to the change in the degrees of freedom. The first analysis includes 1.85E+7 degrees of freedom in the test of the sample statistic of 8867.525 while the second analysis includes only 120 DF for the corresponding sample statistic of 174.2951.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The lack of fit test is based on a comparison between the selected model and the saturated model (unbiased). The null hypothesis assumes that they are the same. The expected&amp;nbsp;value of chi square under the null hypothesis is equal to the DF.&amp;nbsp;Chi square exceeds the DF under the null hypothesis.&amp;nbsp;The associated p-value informs how many such results exceed the sample statistic from the analysis.&lt;/P&gt;</description>
      <pubDate>Fri, 06 Jul 2018 13:09:35 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Lack-of-Fit-in-Logistic-Rgression-Report/m-p/62244#M33542</guid>
      <dc:creator>Mark_Bailey</dc:creator>
      <dc:date>2018-07-06T13:09:35Z</dc:date>
    </item>
  </channel>
</rss>

