<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Text Explorer: find similar documents starting from a given document in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766594#M94663</link>
    <description>&lt;P&gt;Jed:&lt;BR /&gt;&lt;BR /&gt;Thanks. I tried that. Instead of Authors I had manufacturing platform type (think processing unit type in a refinery or chemical plant). Can we predict the manufacturing platform from the words in the description text? The overall misclassification rate was 60%. Looking at the classification summary, the highest predicted rate was ~0.6. So, it's safe to conclude that one platform's problems are not unique to that platform, which makes sense since there is a lot of equipment/processes that are common across platforms. My hypothesis that we should look across platforms for solutions seems reasonable...&lt;/P&gt;</description>
    <pubDate>Tue, 18 Jun 2024 04:47:36 GMT</pubDate>
    <dc:creator>markschahl</dc:creator>
    <dc:date>2024-06-18T04:47:36Z</dc:date>
    <item>
      <title>Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/764905#M94429</link>
      <description>&lt;P&gt;So, I have a corpus with thousands of documents which contains the details of problems that have been solved in the past. It's a vast pile of knowledge.&amp;nbsp; Each document also has metadata (dubious quality+omissions). The documents have 100+ tokens that tell the story of what was done.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Imagine this use case: through filters and metadata, i've found a document (ID#5678) that is pretty close to the problem that I am trying to solve now. &lt;STRONG&gt;How can I easily find all the documents that are similar to #5678?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With my current knowledge of the Text Explorer platform, I could do a Topic Analysis, then lookup which topic #5678 belongs to (say#10), then look at all the documents where Topic=10.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Is there a better way to do this?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance from stormy Kuala Lumpur, Malaysia!&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 07:40:24 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/764905#M94429</guid>
      <dc:creator>markschahl</dc:creator>
      <dc:date>2024-06-12T07:40:24Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/765931#M94575</link>
      <description>&lt;P&gt;Hey,&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/1051"&gt;@markschahl&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Topic analysis is one way, I'm also a fan of the SVD plots themselves (example in the image below also &lt;A href="https://www.jmp.com/support/help/en/18.0/#page/jmp/example-of-the-text-explorer-platform.shtml%23" target="_self"&gt;JMP Docs 1&lt;/A&gt;&amp;nbsp;and &lt;A href="https://www.jmp.com/support/help/en/18.0/#page/jmp/svd-report-options.shtml%23" target="_self"&gt;JMP Docs 2&lt;/A&gt;). &amp;nbsp;The tendrils going away from the center of the plot tend to have similar themes. &amp;nbsp;The doc SVD and term SVD plots provide slightly different views of the data. &amp;nbsp;You'll want to have a look at both. &amp;nbsp;With any luck, your document of interest will sit far out on a tendril which should give you some good candidates. &amp;nbsp;&lt;/P&gt;
&lt;DIV id="tinyMceEditor_7ad7a61ea05d3bMikeD_Anderson_0" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="MikeD_Anderson_1-1718368454088.png" style="width: 400px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/65221i25D115678AED0A4D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="MikeD_Anderson_1-1718368454088.png" alt="MikeD_Anderson_1-1718368454088.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Since you have meta data and tokens, you might also give the Torch add-in a try. &amp;nbsp;It has some language models that might do the trick. &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2024 12:39:21 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/765931#M94575</guid>
      <dc:creator>MikeD_Anderson</dc:creator>
      <dc:date>2024-06-14T12:39:21Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/765967#M94585</link>
      <description>&lt;P&gt;In addition to&amp;nbsp;&lt;a href="https://community.jmp.com/t5/user/viewprofilepage/user-id/4530"&gt;@MikeD_Anderson&lt;/a&gt;'s suggestion, you may also get good results using Discriminant and Correspondence Analysis. &lt;A href="https://community.jmp.com/t5/JMP-Blog/Predicting-book-authors-in-JMP-Pro/ba-p/635103" target="_self"&gt;Here's a blog entry&lt;/A&gt; that details the process.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2024 14:55:43 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/765967#M94585</guid>
      <dc:creator>Jed_Campbell</dc:creator>
      <dc:date>2024-06-14T14:55:43Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766154#M94614</link>
      <description>&lt;P&gt;I think you want to exclude tokens describing the solution because you are looking for similar problems that had different solutions.&lt;/P&gt;</description>
      <pubDate>Sun, 16 Jun 2024 02:16:19 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766154#M94614</guid>
      <dc:creator>Craige_Hales</dc:creator>
      <dc:date>2024-06-16T02:16:19Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766594#M94663</link>
      <description>&lt;P&gt;Jed:&lt;BR /&gt;&lt;BR /&gt;Thanks. I tried that. Instead of Authors I had manufacturing platform type (think processing unit type in a refinery or chemical plant). Can we predict the manufacturing platform from the words in the description text? The overall misclassification rate was 60%. Looking at the classification summary, the highest predicted rate was ~0.6. So, it's safe to conclude that one platform's problems are not unique to that platform, which makes sense since there is a lot of equipment/processes that are common across platforms. My hypothesis that we should look across platforms for solutions seems reasonable...&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2024 04:47:36 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766594#M94663</guid>
      <dc:creator>markschahl</dc:creator>
      <dc:date>2024-06-18T04:47:36Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766598#M94666</link>
      <description>&lt;P&gt;Mike:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What's interesting is comparing the Topic Analysis to SVD. SVD shows that ~56% are talking about the same thing(s).&lt;BR /&gt;I'm rethinking about how many topics/clusters I should ask for. Any guidance? Is there a scree-plot equivalent for this?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Distribution of Topic.Cluster, SVD.Cluster.png" style="width: 670px;"&gt;&lt;img src="https://community.jmp.com/t5/image/serverpage/image-id/65310i66411400372B9890/image-size/large?v=v2&amp;amp;px=999" role="button" title="Distribution of Topic.Cluster, SVD.Cluster.png" alt="Distribution of Topic.Cluster, SVD.Cluster.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2024 04:56:09 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766598#M94666</guid>
      <dc:creator>markschahl</dc:creator>
      <dc:date>2024-06-18T04:56:09Z</dc:date>
    </item>
    <item>
      <title>Re: Text Explorer: find similar documents starting from a given document</title>
      <link>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766919#M94699</link>
      <description>&lt;P&gt;Craige:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have plenty of metadata on what tools/techniques were used to solve the problem. I have been cross-checking that with the free-form text story: i.e. can I trust the metadata or the story the real truth. I will give your suggestion a try: JMP is a high-frequency token in the corpus...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope retirement is treating you well!&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2024 22:47:15 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Text-Explorer-find-similar-documents-starting-from-a-given/m-p/766919#M94699</guid>
      <dc:creator>markschahl</dc:creator>
      <dc:date>2024-06-18T22:47:15Z</dc:date>
    </item>
  </channel>
</rss>

