<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: XGbbost training, validation and testing in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/864012#M102790</link>
    <description>&lt;P&gt;I was just about to launch the same question&lt;/P&gt;</description>
    <pubDate>Tue, 01 Apr 2025 14:52:35 GMT</pubDate>
    <dc:creator>frankderuyck</dc:creator>
    <dc:date>2025-04-01T14:52:35Z</dc:date>
    <item>
      <title>XGboost training, validation and testing</title>
      <link>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/807837#M98725</link>
      <description>&lt;P class=""&gt;Hello,&lt;/P&gt;
&lt;P class=""&gt;I have a question about handling test datasets in XGBoost. Here's my current approach:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;I'm using 10-fold cross-validation for model training and validation&lt;/LI&gt;
&lt;LI&gt;I keep a separate test dataset completely excluded during this training phase (hide and exclude in the table before launching the model)&lt;/LI&gt;
&lt;LI&gt;After training, I manually generate predictions on this test dataset (save predicted to table and I unhide the test data)&lt;/LI&gt;
&lt;/OL&gt;
&lt;P class=""&gt;My issue is: By handling the test data separately, I'm missing out on the automatic performance metrics that XGBoost can calculate. Is there a way to:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Keep the test data completely segregated during training (using hide and exclude)&lt;/LI&gt;
&lt;LI&gt;BUT still have XGBoost automatically calculate performance metrics on this test data after training is complete?&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Sat, 06 Sep 2025 11:35:43 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/807837#M98725</guid>
      <dc:creator>Sburel</dc:creator>
      <dc:date>2025-09-06T11:35:43Z</dc:date>
    </item>
    <item>
      <title>Re: XGbbost training, validation and testing</title>
      <link>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/864012#M102790</link>
      <description>&lt;P&gt;I was just about to launch the same question&lt;/P&gt;</description>
      <pubDate>Tue, 01 Apr 2025 14:52:35 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/864012#M102790</guid>
      <dc:creator>frankderuyck</dc:creator>
      <dc:date>2025-04-01T14:52:35Z</dc:date>
    </item>
    <item>
      <title>Re: XGbbost training, validation and testing</title>
      <link>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/898870#M105875</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/18041"&gt;@Sburel&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Just saw your post now, sorry for late reply.&lt;/P&gt;
&lt;P&gt;For any model evaluation on test set, you can use&amp;nbsp;&lt;A href="https://www.jmp.com/support/help/en/18.1/#page/jmp/model-comparison.shtml?_gl=1*kzsrm0*_up*MQ..*_ga*ODM5NzAwMjgyLjE3NTcwNzg1MDg.*_ga_BRNVBEC1RS*czE3NTcwNzg1MDckbzEkZzAkdDE3NTcwNzg1MDckajYwJGwwJGgw#" target="_blank" rel="noopener"&gt;Model Comparison&lt;/A&gt;&amp;nbsp;platform (on test rows only) : simply select your XGBoost prediction formula as Y Predictor and you'll get Rsquare, RASE (=RMSE) and Average Absolute Error calculated automatically. Some other metrics like Correlation can be calculated with other platform : for Correlation, you can use&amp;nbsp;&lt;A href="https://www.jmp.com/support/help/en/18.1/#page/jmp/introduction-to-multivariate-analysis.shtml?_gl=1*1t28c2l*_up*MQ..*_ga*ODM5NzAwMjgyLjE3NTcwNzg1MDg.*_ga_BRNVBEC1RS*czE3NTcwNzg1MDckbzEkZzAkdDE3NTcwNzg1MDckajYwJGwwJGgw#" target="_blank" rel="noopener"&gt;Multivariate&lt;/A&gt;&amp;nbsp;platform with the predicted response and measured response to see the actual vs. predicted plot (also available in Model Comparison by default) as well as correlation value.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this answer will help you or other members,&lt;/P&gt;</description>
      <pubDate>Fri, 05 Sep 2025 16:48:15 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/XGboost-training-validation-and-testing/m-p/898870#M105875</guid>
      <dc:creator>Victor_G</dc:creator>
      <dc:date>2025-09-05T16:48:15Z</dc:date>
    </item>
  </channel>
</rss>

