<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Split data into training and validation without JMP Pro in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17610#M16060</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You may find the "Initialize Column" feature helpful for this task. &lt;SPAN style="font-size: 10pt;"&gt;To create a "data usage" column add a new column to your table, right click and select "column info" and then there will be an "Initial Data" option towards the bottom of the dialog box (once initialized this will not appear in the column info dialog box). From the drop down menu select "Random" and then from the radio options "Random Indicator" . Now you have three lines that have the default of 0, 1, and 2.&amp;nbsp; You can rename as Train, Test, Validate or whatever you like.&amp;nbsp; Finally you can select the proportion of the data that you would like in each group.&amp;nbsp; The result will be an indicator column that you can use to filter your data for fitting, testing, and validating.&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 24 Mar 2016 20:13:15 GMT</pubDate>
    <dc:creator>KarenC</dc:creator>
    <dc:date>2016-03-24T20:13:15Z</dc:date>
    <item>
      <title>Split data into training and validation without JMP Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17608#M16058</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;What is&amp;nbsp; the easiest and convenient way to split data into training, test and validation without using JMP Pro?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 24 Mar 2016 15:37:49 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17608#M16058</guid>
      <dc:creator>tajrida</dc:creator>
      <dc:date>2016-03-24T15:37:49Z</dc:date>
    </item>
    <item>
      <title>Re: Split data into training and validation without JMP Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17609#M16059</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The easiest way to do this interactively, is to open the data table in question, go to the pull down menu and select:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Rows==&amp;gt;Row Selection==&amp;gt;Select Randomly&lt;/P&gt;&lt;P&gt;Then go to&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Tables==&amp;gt;Subset&lt;/P&gt;&lt;P&gt;Create the new data table&lt;/P&gt;&lt;P&gt;In original data table, rt click on one of the selected rows and select&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Invert Selection&lt;/P&gt;&lt;P&gt;Then go back to&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Tables==&amp;gt;Subset&lt;/P&gt;&lt;P&gt;and create your second data table&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;This can also be simply scripted&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #0000dd; font-family: Consolas; font-size: 10pt;"&gt;Names Default To Here&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;1&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; );&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt = &lt;/SPAN&gt;&lt;SPAN style="color: #0000dd; font-family: Consolas; font-size: 10pt;"&gt;Current Data Table&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;();&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Create a Uniformly Random column&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;New Column&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( &lt;/SPAN&gt;&lt;SPAN style="color: purple; font-family: Consolas; font-size: 10pt;"&gt;"my random sample"&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;, formula( &lt;/SPAN&gt;&lt;SPAN style="color: #0000dd; font-family: Consolas; font-size: 10pt;"&gt;Random Uniform&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;() ) );&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Select 20% of the data&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;select where&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( :my random sample &amp;lt;= &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;.2&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; );&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Delete the random number column since it is no longer needed&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;delete columns&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( &lt;/SPAN&gt;&lt;SPAN style="color: purple; font-family: Consolas; font-size: 10pt;"&gt;"my random sample"&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; );&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Put those selected rows into a Validate data table&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;subset&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( selected rows( &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;1&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ), selected columns( &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;0&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ), output table name( &lt;/SPAN&gt;&lt;SPAN style="color: purple; font-family: Consolas; font-size: 10pt;"&gt;"Validate"&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ) );&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Invert the row selection&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;invert row selection&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;;&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: green; font-family: Consolas; font-size: 10pt;"&gt;// Place all of those rows into the Training data table&lt;/SPAN&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;dt &amp;lt;&amp;lt; &lt;/SPAN&gt;&lt;SPAN style="color: navy; font-family: Consolas; font-size: 10pt;"&gt;subset&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt;( selected rows( &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;1&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ), selected columns( &lt;/SPAN&gt;&lt;SPAN style="color: teal; font-family: Consolas; font-size: 10pt;"&gt;0&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ), output table name( &lt;/SPAN&gt;&lt;SPAN style="color: purple; font-family: Consolas; font-size: 10pt;"&gt;"Training"&lt;/SPAN&gt;&lt;SPAN style="color: black; font-family: Consolas; font-size: 10pt;"&gt; ) );&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #000000; font-family: Times New Roman; font-size: 12pt;"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 24 Mar 2016 19:24:49 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17609#M16059</guid>
      <dc:creator>txnelson</dc:creator>
      <dc:date>2016-03-24T19:24:49Z</dc:date>
    </item>
    <item>
      <title>Re: Split data into training and validation without JMP Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17610#M16060</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You may find the "Initialize Column" feature helpful for this task. &lt;SPAN style="font-size: 10pt;"&gt;To create a "data usage" column add a new column to your table, right click and select "column info" and then there will be an "Initial Data" option towards the bottom of the dialog box (once initialized this will not appear in the column info dialog box). From the drop down menu select "Random" and then from the radio options "Random Indicator" . Now you have three lines that have the default of 0, 1, and 2.&amp;nbsp; You can rename as Train, Test, Validate or whatever you like.&amp;nbsp; Finally you can select the proportion of the data that you would like in each group.&amp;nbsp; The result will be an indicator column that you can use to filter your data for fitting, testing, and validating.&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 24 Mar 2016 20:13:15 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17610#M16060</guid>
      <dc:creator>KarenC</dc:creator>
      <dc:date>2016-03-24T20:13:15Z</dc:date>
    </item>
    <item>
      <title>Re: Split data into training and validation without JMP Pro</title>
      <link>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17611#M16061</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;In JMP Pro 12 you can use &lt;A href="http://www.jmp.com/support/help/Make_Validation_Column_Utility.shtml" rel="noopener noreferrer" target="_blank"&gt;Cols-&amp;gt;Modeling Utilities-&amp;gt;Make Validation Column&lt;/A&gt; utility.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="11175_JMPScreenSnapz008.png" style="width: 664px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/2855i634F1656B0988CFC/image-size/medium?v=v2&amp;amp;px=400" role="button" title="11175_JMPScreenSnapz008.png" alt="11175_JMPScreenSnapz008.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="11176_JMPScreenSnapz009.png" style="width: 574px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/2856i8E1CEF12F0C01E9E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="11176_JMPScreenSnapz009.png" alt="11176_JMPScreenSnapz009.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-Jeff&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 19 Oct 2016 02:11:27 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Split-data-into-training-and-validation-without-JMP-Pro/m-p/17611#M16061</guid>
      <dc:creator>Jeff_Perkinson</dc:creator>
      <dc:date>2016-10-19T02:11:27Z</dc:date>
    </item>
  </channel>
</rss>

