Discussions

statlover · Nov 22, 2019 03:44 PM

On the Multivariate Analysis Tool kit Latent Class Analysis is used to predict clusters on Categorical data (Example Health Risk Survey on JMP Library). How is that principle used on Text Data? We get a Document Term Matrix which is either binary (numeric) , Frequency or TF-IDF. We do not get a DTM of Categorical data. So how does LCA work in this case? Does the binary DTM get converted internally to categorical DTM?

Mark_Zwald · Nov 22, 2019 04:02 PM

You can learn more about LCA within Text Explorer here: https://www.jmp.com/support/help/en/15.0/#page/jmp/latent-class-analysis.shtml

And more about the LCA platform here: https://www.jmp.com/support/help/en/15.0/#page/jmp/latent-class-model-fit.shtml#ww383604

Refer to the note at the bottom: The LCA algorithm that is used in the Text Explorer platform takes advantage of the specific structure of the document term matrix. For this reason, the LCA results in the Text Explorer platform do not exactly match the results in the Latent Class Analysis platform.

View solution in original post

Mark_Zwald · Nov 22, 2019 04:02 PM

You can learn more about LCA within Text Explorer here: https://www.jmp.com/support/help/en/15.0/#page/jmp/latent-class-analysis.shtml

And more about the LCA platform here: https://www.jmp.com/support/help/en/15.0/#page/jmp/latent-class-model-fit.shtml#ww383604

Refer to the note at the bottom: The LCA algorithm that is used in the Text Explorer platform takes advantage of the specific structure of the document term matrix. For this reason, the LCA results in the Text Explorer platform do not exactly match the results in the Latent Class Analysis platform.

statlover · Nov 25, 2019 01:28 PM

Thank you very much. I tried to reproduce the results on the Multivariate LCA platform after converting the DTM matrix to a categorical matrix. Since the output is very different, I could not come to any conclusion. But your feedback is valuable.

Mark_Bailey · Nov 23, 2019 11:41 AM

The LCA is an unsupervised learning method in Text Explorer. It discovers clusters of documents. It is not a classifier.

statlover · Nov 25, 2019 01:31 PM

Thanks Mark Bailey. I understand it is unsupervised learning algorithm. I was trying to see if I could get the same result on the Multivariate platform and the Text Explorer platform. It appears the mechanism is different. Thanks for responding to my question.

Discussions

How does Latent Class Analysis work on Text Explorer

Re: How does Latent Class Analysis work on Text Explorer

Re: How does Latent Class Analysis work on Text Explorer

Re: How does Latent Class Analysis work on Text Explorer

Re: How does Latent Class Analysis work on Text Explorer

Re: How does Latent Class Analysis work on Text Explorer

Recommended Articles