<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Get and Filter Files in Directory in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11980#M11463</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Craige, that works great!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 22 Apr 2015 18:31:47 GMT</pubDate>
    <dc:creator>robot</dc:creator>
    <dc:date>2015-04-22T18:31:47Z</dc:date>
    <item>
      <title>Get and Filter Files in Directory</title>
      <link>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11977#M11460</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I have a JMP script to prompt the user for a directory, and then filter for specific file types from within that directory.&amp;nbsp; My problem is that the filtering process often takes longer than just getting the files.&amp;nbsp; If the number of files is large (&amp;gt;100,000 files), the total time to collect and filter files can be more than 30 minutes.&amp;nbsp; Is is there a faster or more efficient way to do this?&amp;nbsp; &lt;SPAN style="font-size: 13.3333330154419px;"&gt;I am using JMP11.&amp;nbsp; Thanks!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 13.3333330154419px;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 13.3333330154419px;"&gt;// Example.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Names Default To Here( 1 );&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;dir = Pick Directory( "Select a directory", "/C:/Program Files/SAS/" );&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;t1 = Tick Seconds();&lt;/P&gt;&lt;P&gt;files = Files In Directory( dir, Recursive );&lt;/P&gt;&lt;P&gt;t2 = Tick Seconds();&lt;/P&gt;&lt;P&gt;t_getfiles = t2 - t1;&lt;/P&gt;&lt;P&gt;n_getfiles = N Items( files );&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;t3 = Tick Seconds();&lt;/P&gt;&lt;P&gt;For( i = N Items( files ), i &amp;gt;= 1, i--,&lt;/P&gt;&lt;P&gt;&amp;nbsp; If( !Ends With( files&lt;I&gt;, ".jmp" ),&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; Remove From( files, i )&lt;/P&gt;&lt;P&gt;&amp;nbsp; )&lt;/P&gt;&lt;P&gt;);&lt;/P&gt;&lt;P&gt;t4 = Tick Seconds();&lt;/P&gt;&lt;P&gt;t_filterfiles = t4 - t3;&lt;/P&gt;&lt;P&gt;n_filterfiles = N Items( files );&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Show( t_getfiles, n_getfiles, t_filterfiles, n_filterfiles );&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 21 Apr 2015 22:51:18 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11977#M11460</guid>
      <dc:creator>robot</dc:creator>
      <dc:date>2015-04-21T22:51:18Z</dc:date>
    </item>
    <item>
      <title>Re: Get and Filter Files in Directory</title>
      <link>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11978#M11461</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Yes, rework the filtering loop to remove the N^2 behavior.&amp;nbsp; JSL { lists } access elements by starting at the front.&amp;nbsp; You started at the back to prevent the deleted elements from messing up the indexing.&amp;nbsp; That leads to pretty much the worst case time behavior for manipulating the list, walking i elements to reach the i'th element.&amp;nbsp; Here's a reworked version that removes the front-most element from the list of files, checks it, and inserts it as the front-most element of the filtered list.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE __default_attr="c++" __jive_macro_name="code" class="jive_macro_code _jivemacro_uid_1429662039112990 jive_text_macro" jivemacro_uid="_1429662039112990"&gt;&lt;BR /&gt;&lt;P&gt;dir = "c:\";&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t1 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;files = Files In Directory( dir, Recursive );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t2 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t_getfiles = t2 - t1;&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;n_getfiles = N Items( files );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;filteredFiles = {};&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t3 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;while( (testname = Remove From( files, 1 )) != {},&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; If( Ends With( testname[1], ".jmp" ),&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; insertinto(filteredFiles,testname,1);&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; )&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;);&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t4 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t_filterfiles = t4 - t3;&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;n_filterfiles = N Items( filteredFiles );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;Show( t_getfiles, n_getfiles, t_filterfiles, n_filterfiles );&lt;/P&gt;&lt;BR /&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;t_getfiles = 4.0333333333333;&lt;/P&gt;&lt;P&gt;n_getfiles = 171630;&lt;/P&gt;&lt;P&gt;t_filterfiles = 258.5;&lt;/P&gt;&lt;P&gt;n_filterfiles = 975;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;About 5 minutes.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another idea:&amp;nbsp; you can load a data table like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE __default_attr="c++" __jive_macro_name="code" class="jive_macro_code _jivemacro_uid_14296625205762417 jive_text_macro" jivemacro_uid="_14296625205762417" modifiedtitle="true"&gt;&lt;BR /&gt;&lt;P&gt;dir = "c:\";&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t1 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;files = Files In Directory( dir, Recursive );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t2 = Tick Seconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t_getfiles = t2 - t1;&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;n_getfiles = N Items( files );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t3 = tickseconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;dt = New Table( "directory",&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; Add Rows( 0 ),&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; New Column( "filename", Character, Nominal, Set Values( files ) ),&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; New Column( "isTable",&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; Numeric,&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; Continuous,&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; Format( "Best", 12 ),&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; Formula( Ends With( :filename, ".jmp" ) )&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;&amp;nbsp; )&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;);&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;dt&amp;lt;&amp;lt;runformulas;&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;dt&amp;lt;&amp;lt;selectwhere(isTable==1);&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;dtFiltered = dt&amp;lt;&amp;lt;subset(selectedrows(1));&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t4=tickseconds();&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;t_filterfiles = t4 - t3;&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;n_filterfiles = N rows( dtFiltered );&lt;/P&gt;&lt;BR /&gt;&lt;P&gt;Show( t_getfiles, n_getfiles, t_filterfiles, n_filterfiles );&lt;/P&gt;&lt;BR /&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;t_getfiles = 4.01666666666642;&lt;/P&gt;&lt;P&gt;n_getfiles = 171635;&lt;/P&gt;&lt;P&gt;t_filterfiles = 0.716666666666697;&lt;/P&gt;&lt;P&gt;n_filterfiles = 975;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;About 5 seconds.&amp;nbsp; The &amp;lt;&amp;lt;runFormulas is required; the data table will still be evaluating the formula for the isTable column and the selectwhere won't find anything and the subset will be empty without it.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 22 Apr 2015 00:50:24 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11978#M11461</guid>
      <dc:creator>Craige_Hales</dc:creator>
      <dc:date>2015-04-22T00:50:24Z</dc:date>
    </item>
    <item>
      <title>Re: Get and Filter Files in Directory</title>
      <link>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11979#M11462</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The first example is much faster in JMP 12 (similar speed to the data table example); it appears to still be showing some N^2 behavior in JMP 11.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 22 Apr 2015 01:19:21 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11979#M11462</guid>
      <dc:creator>Craige_Hales</dc:creator>
      <dc:date>2015-04-22T01:19:21Z</dc:date>
    </item>
    <item>
      <title>Re: Get and Filter Files in Directory</title>
      <link>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11980#M11463</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Craige, that works great!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 22 Apr 2015 18:31:47 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Get-and-Filter-Files-in-Directory/m-p/11980#M11463</guid>
      <dc:creator>robot</dc:creator>
      <dc:date>2015-04-22T18:31:47Z</dc:date>
    </item>
  </channel>
</rss>

