<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Boosted Neural Network in JMP 13 Pro in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53573#M30288</link>
    <description>Hi, can you clarify: are the 100 “measurements” 100 variables (columns) or 100 observations (rows)?&lt;BR /&gt;Thanks,&lt;BR /&gt;Phil&lt;BR /&gt;</description>
    <pubDate>Wed, 21 Mar 2018 16:41:07 GMT</pubDate>
    <dc:creator>Phil_Kay</dc:creator>
    <dc:date>2018-03-21T16:41:07Z</dc:date>
    <item>
      <title>Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53411#M30225</link>
      <description>&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;I find the JMP 13 manual description&amp;nbsp;of the boosting procedure in neural networks to be vague. Here is a snap of it:&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;"The first step is to fit a one-layer, two-node model. The predicted values from that model are scaled by the learning rate, then subtracted from the actual values to form a scaled residual. The next step is to fit a different one-layer, &lt;/FONT&gt;two-node&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt; model on the scaled residuals of the previous model."&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;I don't understand how JMP is using residuals from the predicted values. A mathematical formula or a diagram&amp;nbsp;could be extremely&amp;nbsp;helpful.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;Is JMP adding the&amp;nbsp;residuals to defined features and train the network again? or does if train the chained networks of residuals alone? For example, if I have 10 features and I am trying to predict a single&amp;nbsp;continuous response and my network has one hidden layer of 3 neurons (aka nodes); the first neural net in a boosted network would be of shape&amp;nbsp;[10, 3, 1]. Now JMP calculates the residuals (True values - Predicted values) from the last layer. Now, what happens to the residuals? What is the shape of the second network, what is its inputs and what is it predicting?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;Thank you for your time in advance.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial,helvetica,sans-serif" size="3"&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Mar 2018 18:02:26 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53411#M30225</guid>
      <dc:creator>pans</dc:creator>
      <dc:date>2018-03-19T18:02:26Z</dc:date>
    </item>
    <item>
      <title>Re: Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53457#M30244</link>
      <description>Hi, the general approach of boosting in machine learning is to first fit a "weak predictor" model to Y. The next model then fits to the residuals of the first model. The combination of these two models is then a better model than either of the original models. Assuming there is a still a residual between Y(predicted) and Y(actual) the third model will then fit to these residuals and is then combined with the models 1 and 2. This continues until the goodness-of-fit stops improving (according to whichever measure you are using).</description>
      <pubDate>Tue, 20 Mar 2018 13:14:55 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53457#M30244</guid>
      <dc:creator>Phil_Kay</dc:creator>
      <dc:date>2018-03-20T13:14:55Z</dc:date>
    </item>
    <item>
      <title>Re: Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53572#M30287</link>
      <description>&lt;P&gt;Hi Phil, thank you for your reply. The input to the second neural network is still a little unclear to me. So would you say that the second neural network would be working with only 1 feature (the residuals) even if my initial dataset had 100 measurements and I was using them as features?&lt;/P&gt;</description>
      <pubDate>Wed, 21 Mar 2018 16:37:01 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53572#M30287</guid>
      <dc:creator>pans</dc:creator>
      <dc:date>2018-03-21T16:37:01Z</dc:date>
    </item>
    <item>
      <title>Re: Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53573#M30288</link>
      <description>Hi, can you clarify: are the 100 “measurements” 100 variables (columns) or 100 observations (rows)?&lt;BR /&gt;Thanks,&lt;BR /&gt;Phil&lt;BR /&gt;</description>
      <pubDate>Wed, 21 Mar 2018 16:41:07 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53573#M30288</guid>
      <dc:creator>Phil_Kay</dc:creator>
      <dc:date>2018-03-21T16:41:07Z</dc:date>
    </item>
    <item>
      <title>Re: Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53574#M30289</link>
      <description>&lt;P&gt;let's say I start with 100 different measurements (columns) and 200 observations (rows)&amp;nbsp;trying to predict 1 continuous&amp;nbsp;response (column). I assume that the first neural net would take in all 100 columns and output 1 prediction. Would the second neural net take in 1 column of residuals and also output 1 column of predictions?&lt;/P&gt;</description>
      <pubDate>Wed, 21 Mar 2018 16:44:32 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53574#M30289</guid>
      <dc:creator>pans</dc:creator>
      <dc:date>2018-03-21T16:44:32Z</dc:date>
    </item>
    <item>
      <title>Re: Boosted Neural Network in JMP 13 Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53575#M30290</link>
      <description>Okay, thanks for clarifying.&lt;BR /&gt;&lt;BR /&gt;The second stage in boosting is using the same predictors as in the first stage. The difference is in the response that it is fitting. In the second stage the response is the residual from the first stage.&lt;BR /&gt;&lt;BR /&gt;The result is that you add the same number of hidden nodes again for each boost.&lt;BR /&gt;&lt;BR /&gt;Phil&lt;BR /&gt;</description>
      <pubDate>Wed, 21 Mar 2018 16:53:07 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Boosted-Neural-Network-in-JMP-13-Pro/m-p/53575#M30290</guid>
      <dc:creator>Phil_Kay</dc:creator>
      <dc:date>2018-03-21T16:53:07Z</dc:date>
    </item>
  </channel>
</rss>

