<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Memory Demands of Multiple Correspondence Analysis in Discussions</title>
    <link>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488764#M73232</link>
    <description>&lt;P&gt;I think it might help if you can share some illustrative data.&lt;/P&gt;
&lt;P&gt;I don't really understand what the objective of your analysis is and what the data that you are trying to analyse looks like. I am not sure if MCA is appropriate.&lt;/P&gt;
&lt;P&gt;If I understand, you have a table with 4257 rows. That is not a problem for MCA in JMP Pro.&lt;/P&gt;
&lt;P&gt;A more important factor will be the number of levels within each variable.&lt;/P&gt;</description>
    <pubDate>Fri, 20 May 2022 07:52:28 GMT</pubDate>
    <dc:creator>Phil_Kay</dc:creator>
    <dc:date>2022-05-20T07:52:28Z</dc:date>
    <item>
      <title>Memory Demands of Multiple Correspondence Analysis</title>
      <link>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488306#M73179</link>
      <description>&lt;P&gt;A couple of days ago I tried, somewhat thoughtlessly, an MCA on four variables from the Twitter record. These were all user names, user screen names, in-response-to names, and the like. All are alphabetic of random length.&amp;nbsp; The procedure ran perfecty.&lt;/P&gt;&lt;P&gt;Now I have started to redo the analysis, thinking first. My obvious mistake.&amp;nbsp; on a 4257 case file, I cannot get the MCA to finish executing (I have not waited long enough).The&amp;nbsp; Windows Task Manager tells me the program is not responding, but, of course, core size and CPU usage continually change.&lt;/P&gt;&lt;P&gt;I speculated about the number of categories that might be generated from these variables. So, I picked the user ID and the ID of the in-response-to entity that occasioned the tweet.&amp;nbsp; This is running on a Dell&amp;nbsp; 8 core machine with 64GB of memory.&amp;nbsp; It has been running for two hours, using 8% of the CPU (varying)a couple of percent) and under 5,000&amp;nbsp; MB memory.&amp;nbsp; The comment line lists "not responding.."&amp;nbsp; I can let the job run all night at no extra cost.&amp;nbsp; But the underling logic bothes me, because I do not know it well enough.&lt;/P&gt;&lt;P&gt;a. If a variable uses numbers, but is designated as nominal modeling--the maximum number of categories is the number of cases (e.g. the number of tweet authors in the file).&lt;/P&gt;&lt;P&gt;b. If a variable uses arbitrary characters, words, or names, the same number holds--the maximum dimension in the direction fo the cases is the number of observations.&amp;nbsp; So no difference.&lt;/P&gt;&lt;P&gt;c. If a variable is modeled as a number, even if ordinal, the number of positions might be the number of places between the lowest and highest number in the set.&lt;/P&gt;&lt;P&gt;d. One of the JMP Community members did a 600,000 case study and produced neat graphs.&amp;nbsp; He&amp;nbsp; must have used a magic wand--or had very few values in the target variable.&lt;/P&gt;&lt;P&gt;e. If I must reckon with creating a who-to-whom matrix, I must be prepared for (in my case) a 4257 x 4256 or thereabouts matrix which is then to be simplified.&amp;nbsp; But at first guess, this matrix would be under 19 million words.&amp;nbsp; That is not much for a computer.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What am I failing to understand?&amp;nbsp; Am I limited to something like my 4257 cases&amp;nbsp; times a variable with only a dozen or two values?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, since you folks have been so kind in the past, I thought I might toss this out.&amp;nbsp; If someone wants, I can upload a table with all the cases and a few variables, but don't want to flood somebody else's storage.&lt;BR /&gt;:&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 10 Jun 2023 23:48:34 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488306#M73179</guid>
      <dc:creator>LNitz</dc:creator>
      <dc:date>2023-06-10T23:48:34Z</dc:date>
    </item>
    <item>
      <title>Re: Memory Demands of Multiple Correspondence Analysis</title>
      <link>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488764#M73232</link>
      <description>&lt;P&gt;I think it might help if you can share some illustrative data.&lt;/P&gt;
&lt;P&gt;I don't really understand what the objective of your analysis is and what the data that you are trying to analyse looks like. I am not sure if MCA is appropriate.&lt;/P&gt;
&lt;P&gt;If I understand, you have a table with 4257 rows. That is not a problem for MCA in JMP Pro.&lt;/P&gt;
&lt;P&gt;A more important factor will be the number of levels within each variable.&lt;/P&gt;</description>
      <pubDate>Fri, 20 May 2022 07:52:28 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488764#M73232</guid>
      <dc:creator>Phil_Kay</dc:creator>
      <dc:date>2022-05-20T07:52:28Z</dc:date>
    </item>
    <item>
      <title>Re: Memory Demands of Multiple Correspondence Analysis</title>
      <link>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488773#M73233</link>
      <description>&lt;P&gt;Thanks, Phil.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The issue is the number of categories.&amp;nbsp; It is not that the program cannot compute with a thousand or so categories, but it takes a really long time.&amp;nbsp; The question I am asking is who responds to whom.&amp;nbsp; I will play with this a bit more to see if I can screen one of the variables to reduce the number of categories.&amp;nbsp; If anything comes out of it, I will post results.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Larry&lt;/P&gt;</description>
      <pubDate>Fri, 20 May 2022 08:08:17 GMT</pubDate>
      <guid>https://community.jmp.com/t5/Discussions/Memory-Demands-of-Multiple-Correspondence-Analysis/m-p/488773#M73233</guid>
      <dc:creator>LNitz</dc:creator>
      <dc:date>2022-05-20T08:08:17Z</dc:date>
    </item>
  </channel>
</rss>

